[Model] [Bugfix] Fix loading of fine-tuned models based on Phi-3-Small #12689

mgtk77 · 2025-02-03T10:58:59Z

We have a fine-tuned model based on Phi-3-Small and loading it fails due to this issue.
It's an official model, created by a team in Microsoft, that handles financial reports.
Model information exists here: https://ai.azure.com/explore/models/financial-reports-analysis/version/2/registry/azureml

The error we get:

...site-packages/vllm/model_executor/models/phi3_small.py", line 477, in load_weights
[rank0]:     param = params_dict[name]
[rank0]: KeyError: 'lm_head.weight'
Loading safetensors checkpoint shards:  75% Completed | 3/4 [00:01<00:00,  2.00it/s]

This fix was verified by successfully loading the model.

We also saw the exact same fix for other models: commandr.py, gemma.py, gpt_bigcode.py and jais.py.

github-actions · 2025-02-03T10:59:12Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

Signed-off-by: Michael Greenbaum <[email protected]>

vllm/model_executor/models/phi3_small.py

Signed-off-by: Michael Greenbaum <[email protected]>

DarkLight1337

Thanks for fixing!

…project#12689) Signed-off-by: Michael Greenbaum <[email protected]> Co-authored-by: Michael Greenbaum <[email protected]> Signed-off-by: Felix Marty <[email protected]>

…project#12689) Signed-off-by: Michael Greenbaum <[email protected]> Co-authored-by: Michael Greenbaum <[email protected]>

Fix loading finetuned Phi-3-Small models

0fcd244

Signed-off-by: Michael Greenbaum <[email protected]>

mgtk77 force-pushed the dev/mgreenbaum/fixPhi3FT branch from ab8ca5d to 0fcd244 Compare February 3, 2025 11:01

Fix pre-commit

4bb399e

Signed-off-by: Michael Greenbaum <[email protected]>

DarkLight1337 reviewed Feb 3, 2025

View reviewed changes

vllm/model_executor/models/phi3_small.py Outdated Show resolved Hide resolved

Add suggestion

b42e316

Signed-off-by: Michael Greenbaum <[email protected]>

DarkLight1337 approved these changes Feb 4, 2025

View reviewed changes

DarkLight1337 enabled auto-merge (squash) February 4, 2025 08:08

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Feb 4, 2025

Merge branch 'main' into dev/mgreenbaum/fixPhi3FT

37374e9

youkaichao disabled auto-merge February 4, 2025 12:58

youkaichao merged commit 6469038 into vllm-project:main Feb 4, 2025
22 of 36 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model] [Bugfix] Fix loading of fine-tuned models based on Phi-3-Small #12689

[Model] [Bugfix] Fix loading of fine-tuned models based on Phi-3-Small #12689

mgtk77 commented Feb 3, 2025 •

edited by github-actions bot

Loading

github-actions bot commented Feb 3, 2025

DarkLight1337 left a comment

[Model] [Bugfix] Fix loading of fine-tuned models based on Phi-3-Small #12689

[Model] [Bugfix] Fix loading of fine-tuned models based on Phi-3-Small #12689

Conversation

mgtk77 commented Feb 3, 2025 • edited by github-actions bot Loading

github-actions bot commented Feb 3, 2025

DarkLight1337 left a comment

Choose a reason for hiding this comment

mgtk77 commented Feb 3, 2025 •

edited by github-actions bot

Loading