Transformers version bump #2029

KaelanDt · 2025-05-07T12:20:44Z

Transformers version bump for more recent model support

KaelanDt · 2025-05-07T15:04:33Z

Update: unpinning pydantic seems for more flexibility on how deepspeed is tested as they seem to use some strange aliases to enable for deprecated features to be used, see: deepspeedai/DeepSpeed#4407. Huggingface also unpinned pydantic here

Now the test stumbles on some mismatch for gemma

 AssertionError: Tensor-likes are not close!
E       
E       Mismatched elements: 3 / 5242880 (0.0%)
E       Greatest absolute difference: 3.3855438232421875e-05 at index (0, 19, 61862) (up to 3e-05 allowed)
E       Greatest relative difference: 0.0006763985147699714 at index (0, 16, 127381) (up to 3e-05 allowed)

As well as some openai client tests:

 AssertionError: Non-streaming chat completion failed with status code 500
E           assert 500 == 200
E            +  where 500 = <Response [500]>.status_code

/__w/9/s/tests/test_serve.py:230: AssertionError

Would appreciate a look on these, perhaps @k223kim for gemma and @Borda for openai api tests

pyproject.toml

Borda · 2025-05-07T15:39:05Z

I think his need to be fixed:

 pydantic.errors.PydanticUserError: If you use `@root_validator` with pre=False (the default) you MUST specify `skip_on_failure=True`. Note that `@root_validator` is deprecated and should be replaced with `@model_validator`

KaelanDt · 2025-05-07T15:50:14Z

Yes, however this comes from huggingface code that calls deepspeed

/usr/local/lib/python3.10/dist-packages/transformers/models/falcon/modeling_falcon.py:41: in <module>
    from ...modeling_utils import PreTrainedModel
/usr/local/lib/python3.10/dist-packages/transformers/modeling_utils.py:158: in <module>
    import deepspeed
/usr/local/lib/python3.10/dist-packages/deepspeed/__init__.py:16: in <module>
    from . import module_inject
/usr/local/lib/python3.10/dist-packages/deepspeed/module_inject/__init__.py:6: in <module>
    from .replace_module import replace_transformer_layer, revert_transformer_layer, ReplaceWithTensorSlicing, GroupQuantizer, generic_injection
/usr/local/lib/python3.10/dist-packages/deepspeed/module_inject/replace_module.py:792: in <module>
    from ..pipe import PipelineModule
/usr/local/lib/python3.10/dist-packages/deepspeed/pipe/__init__.py:6: in <module>
    from ..runtime.pipe import PipelineModule, LayerSpec, TiedLayerSpec
/usr/local/lib/python3.10/dist-packages/deepspeed/runtime/pipe/__init__.py:6: in <module>
    from .module import PipelineModule, LayerSpec, TiedLayerSpec
/usr/local/lib/python3.10/dist-packages/deepspeed/runtime/pipe/module.py:19: in <module>
    from ..activation_checkpointing import checkpointing
/usr/local/lib/python3.10/dist-packages/deepspeed/runtime/activation_checkpointing/checkpointing.py:25: in <module>
    from deepspeed.runtime.config import DeepSpeedConfig

My understanding is that they allow this such that old code can be run with an older of version of pydantic

Borda · 2025-05-07T18:34:12Z

cc: @t-vi

pyproject.toml

Borda · 2025-05-19T07:50:25Z

FAILED tests/test_serve.py::test_serve_with_openai_spec - AssertionError: Non-streaming chat completion failed with status code 500
assert 500 == 200
 +  where 500 = <Response [500]>.status_code

cc: @aniketmaurya

aniketmaurya · 2025-05-19T08:25:52Z

FAILED tests/test_serve.py::test_serve_with_openai_spec - AssertionError: Non-streaming chat completion failed with status code 500
assert 500 == 200
 +  where 500 = <Response [500]>.status_code

cc: @aniketmaurya

does this pass now?

bhimrazy · 2025-05-19T13:00:08Z

It seems like the issue might be related to Pydantic version. Under the gpu test, somehow it appears to be falling back to a much older version of Pydantic (1.10.17) that doesn't have the support for model_copy method, which is used in the OpenAI spec:

cc: @aniketmaurya @Borda

aniketmaurya · 2025-05-19T13:02:28Z

thanks for investigating this @bhimrazy! @Borda @KaelanDt maybe set a minimum version for pydantic. Not worth supporting an ancient version of pydantic.

Borda · 2025-05-23T13:14:55Z

we did some major updates on master regarding dev env, so let's update it

pyproject.toml

for more information, see https://pre-commit.ci

transformers version bump

def3101

KaelanDt mentioned this pull request May 7, 2025

Adds Qwen3 dense models #2028

Closed

KaelanDt added 2 commits May 7, 2025 12:22

typo

791a3f2

update transformers to patched version

5fe0b1d

KaelanDt mentioned this pull request May 7, 2025

Transformers version bump for recent model support #2030

Open

unpin pydantic to allow for aliases

3a59f62

Borda reviewed May 7, 2025

View reviewed changes

pyproject.toml Outdated Show resolved Hide resolved

pydantic>=2.11

2baf717

Update pyproject.toml

9024698

Borda reviewed May 7, 2025

View reviewed changes

pyproject.toml Show resolved Hide resolved

ysjprojects mentioned this pull request May 15, 2025

Qwen3 Dense #2044

Open

KaelanDt and others added 2 commits May 16, 2025 09:35

Merge branch 'main' into kaelan/version-bump

dd3e277

Merge branch 'main' into kaelan/version-bump

3e72632

bhimrazy mentioned this pull request May 19, 2025

Add pip list command after installing dependencies in GPU tests #2051

Closed

Merge branch 'main' into kaelan/version-bump

c5a3693

Borda reviewed May 23, 2025

View reviewed changes

pyproject.toml Outdated Show resolved Hide resolved

Apply suggestions from code review

052ec70

Borda reviewed May 23, 2025

View reviewed changes

pyproject.toml Outdated Show resolved Hide resolved

Borda and others added 2 commits May 23, 2025 15:16

Apply suggestions from code review

15d022e

[pre-commit.ci] auto fixes from pre-commit.com hooks

a91e472

for more information, see https://pre-commit.ci

Borda marked this pull request as ready for review May 23, 2025 13:17

Borda requested review from lantiga and t-vi as code owners May 23, 2025 13:17

Borda enabled auto-merge (squash) May 23, 2025 14:11

Borda approved these changes May 23, 2025

View reviewed changes

Borda merged commit f021d88 into main May 23, 2025
24 of 36 checks passed

Borda deleted the kaelan/version-bump branch May 23, 2025 14:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Transformers version bump #2029

Transformers version bump #2029

Uh oh!

KaelanDt commented May 7, 2025 •

edited

Loading

Uh oh!

KaelanDt commented May 7, 2025

Uh oh!

Uh oh!

Borda commented May 7, 2025

Uh oh!

KaelanDt commented May 7, 2025

Uh oh!

Borda commented May 7, 2025

Uh oh!

Uh oh!

Borda commented May 19, 2025

Uh oh!

aniketmaurya commented May 19, 2025

Uh oh!

bhimrazy commented May 19, 2025 •

edited

Loading

Uh oh!

aniketmaurya commented May 19, 2025

Uh oh!

Borda commented May 23, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Transformers version bump #2029

Transformers version bump #2029

Uh oh!

Conversation

KaelanDt commented May 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

KaelanDt commented May 7, 2025

Uh oh!

Uh oh!

Borda commented May 7, 2025

Uh oh!

KaelanDt commented May 7, 2025

Uh oh!

Borda commented May 7, 2025

Uh oh!

Uh oh!

Borda commented May 19, 2025

Uh oh!

aniketmaurya commented May 19, 2025

Uh oh!

bhimrazy commented May 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aniketmaurya commented May 19, 2025

Uh oh!

Borda commented May 23, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

KaelanDt commented May 7, 2025 •

edited

Loading

bhimrazy commented May 19, 2025 •

edited

Loading