[5/N] pass the whole config to model #9983

youkaichao · 2024-11-04T07:09:24Z

contains code from #9978 , need to merge that first.

Signed-off-by: youkaichao <[email protected]>

github-actions · 2024-11-04T07:09:36Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

Signed-off-by: youkaichao <[email protected]>

DarkLight1337 · 2024-11-04T09:55:55Z

vllm/model_executor/models/baichuan.py

-        quant_config: Optional[QuantizationConfig] = None,
-        lora_config: Optional[LoRAConfig] = None,
+        vllm_config: VllmConfig,
+        prefix: str = "",


I think you should not provide a default value for position_embedding so people won't forget about it.

Is it intended that you add prefix argument here? (Same question for the other models).

we need to unify the function signature, so that vllm_config (required) and prefix (optional) are enough to construct any model (as long as they have correct config).

this means we cannot have any other required parameters.

I think the position_embedding here should be fine, because it is only used for internal classes. Code outside of this file should not be aware of it.

I think prefix should only be an argument if we have actually implemented quantization support for that model. Is this necessary for the model construction code?

Also, this particular model is a base class and will not be called directly by the model builder. As long as the subclass doesn't have position_embedding in its argument list, it should not affect model builder code.

I think prefix should only be an argument if we have actually implemented quantization support for that model. Is this necessary for the model construction code?

we should keep this uniform signature, and if people want to add quantization support later, it is easier. functionality support like lora, quantization, etc. should be done via checking the config, rather than checking the function signature.

Can you add some helper functions to VllmConfig like raise_for_unsupported_quant so we have a standardized way of explicitly indicating that the model doesn't support quantization, and call it in the top-level models that don't originally support prefix?

Given this, I'm fine with the change.

I will ask neural magic folks to add this. I'm not familiar with how to tell if a model supports a quantization scheme.

As per offline discussion, let's address this in another PR.

vllm/model_executor/models/internlm2.py

vllm/model_executor/models/llama.py

mergify · 2024-11-04T16:52:54Z

This pull request has merge conflicts that must be resolved before it can be
merged. @youkaichao please rebase it. https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

Signed-off-by: youkaichao <[email protected]>

DarkLight1337

As per offline discussion, let's merge this first to unblock the next steps.

Signed-off-by: youkaichao <[email protected]> Signed-off-by: Loc Huynh <[email protected]>

Signed-off-by: youkaichao <[email protected]> Signed-off-by: Jee Jee Li <[email protected]>

Signed-off-by: youkaichao <[email protected]>

Signed-off-by: youkaichao <[email protected]> Signed-off-by: Sumit Dubey <[email protected]>

Signed-off-by: youkaichao <[email protected]>

Signed-off-by: youkaichao <[email protected]> Signed-off-by: Maxime Fournioux <[email protected]>

Signed-off-by: youkaichao <[email protected]> Signed-off-by: Tyler Michael Smith <[email protected]>

Signed-off-by: youkaichao <[email protected]>

youkaichao added 3 commits November 3, 2024 20:55

quant config

75ac7f6

Signed-off-by: youkaichao <[email protected]>

fix type annotation

a195534

Signed-off-by: youkaichao <[email protected]>

pass the whole config to top level model

f627286

Signed-off-by: youkaichao <[email protected]>

youkaichao added 5 commits November 3, 2024 23:16

unify signature

a27d203

Signed-off-by: youkaichao <[email protected]>

fix base

3b051ea

Signed-off-by: youkaichao <[email protected]>

fix embedding

7e51b01

Signed-off-by: youkaichao <[email protected]>

fix phi3v

03810ba

Signed-off-by: youkaichao <[email protected]>

fix

9af77dc

Signed-off-by: youkaichao <[email protected]>

youkaichao requested a review from DarkLight1337 November 4, 2024 08:23

DarkLight1337 requested changes Nov 4, 2024

View reviewed changes

mergify bot added the needs-rebase label Nov 4, 2024

Merge branch 'main' into more_config

10e68dc

mergify bot removed the needs-rebase label Nov 8, 2024

youkaichao added 10 commits November 8, 2024 10:55

fix?

0ef8f68

Signed-off-by: youkaichao <[email protected]>

arctic

e550fe0

Signed-off-by: youkaichao <[email protected]>

bart

ed0c3c8

Signed-off-by: youkaichao <[email protected]>

florence2

aaddcbb

Signed-off-by: youkaichao <[email protected]>

internlm2

d378f58

Signed-off-by: youkaichao <[email protected]>

remove

174bd43

Signed-off-by: youkaichao <[email protected]>

fix llama

b6bf1b4

Signed-off-by: youkaichao <[email protected]>

llava

eabf884

Signed-off-by: youkaichao <[email protected]>

fix

89c4193

Signed-off-by: youkaichao <[email protected]>

fix

1ffc796

Signed-off-by: youkaichao <[email protected]>

youkaichao added the ready ONLY add when PR is ready to merge/full CI is needed label Nov 8, 2024

youkaichao added 4 commits November 8, 2024 13:53

fix eagle

e228b90

Signed-off-by: youkaichao <[email protected]>

fix xverse

ea2e4d2

Signed-off-by: youkaichao <[email protected]>

fix idefics3

dcc1047

Signed-off-by: youkaichao <[email protected]>

fix paligemma

c5ab295

Signed-off-by: youkaichao <[email protected]>

youkaichao requested a review from DarkLight1337 November 9, 2024 01:35

DarkLight1337 approved these changes Nov 9, 2024

View reviewed changes

DarkLight1337 merged commit 1a95f10 into vllm-project:main Nov 9, 2024
50 checks passed

youkaichao deleted the more_config branch November 9, 2024 06:23

DarkLight1337 mentioned this pull request Nov 9, 2024

[CI/Build] Split up models tests #10069

Merged

youkaichao mentioned this pull request Nov 11, 2024

[6/N] pass whole config to inner model #10205

Merged

HoangCongDuc mentioned this pull request Nov 11, 2024

[Bugfix] bitsandbytes models fail to run pipeline parallel #10200

Merged

JC1DA pushed a commit to JC1DA/vllm that referenced this pull request Nov 11, 2024

[5/N] pass the whole config to model (vllm-project#9983)

a875f70

Signed-off-by: youkaichao <[email protected]> Signed-off-by: Loc Huynh <[email protected]>

jeejeelee pushed a commit to jeejeelee/vllm that referenced this pull request Nov 11, 2024

[5/N] pass the whole config to model (vllm-project#9983)

ea92905

Signed-off-by: youkaichao <[email protected]> Signed-off-by: Jee Jee Li <[email protected]>

DarkLight1337 mentioned this pull request Nov 12, 2024

Support Roberta embedding models #9387

Merged

rickyyx pushed a commit to rickyyx/vllm that referenced this pull request Nov 13, 2024

[5/N] pass the whole config to model (vllm-project#9983)

111e747

Signed-off-by: youkaichao <[email protected]>

sumitd2 pushed a commit to sumitd2/vllm that referenced this pull request Nov 14, 2024

[5/N] pass the whole config to model (vllm-project#9983)

a5aa345

Signed-off-by: youkaichao <[email protected]> Signed-off-by: Sumit Dubey <[email protected]>

KuntaiDu pushed a commit to KuntaiDu/vllm that referenced this pull request Nov 20, 2024

[5/N] pass the whole config to model (vllm-project#9983)

79339a5

Signed-off-by: youkaichao <[email protected]>

mfournioux pushed a commit to mfournioux/vllm that referenced this pull request Nov 20, 2024

[5/N] pass the whole config to model (vllm-project#9983)

48ec67f

Signed-off-by: youkaichao <[email protected]> Signed-off-by: Maxime Fournioux <[email protected]>

tlrmchlsmth pushed a commit to neuralmagic/vllm that referenced this pull request Nov 23, 2024

[5/N] pass the whole config to model (vllm-project#9983)

6efd24b

Signed-off-by: youkaichao <[email protected]> Signed-off-by: Tyler Michael Smith <[email protected]>

sleepwalker2017 pushed a commit to sleepwalker2017/vllm that referenced this pull request Dec 13, 2024

[5/N] pass the whole config to model (vllm-project#9983)

ad85a06

Signed-off-by: youkaichao <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[5/N] pass the whole config to model #9983

[5/N] pass the whole config to model #9983

youkaichao commented Nov 4, 2024

github-actions bot commented Nov 4, 2024

DarkLight1337 Nov 4, 2024

youkaichao Nov 8, 2024

DarkLight1337 Nov 9, 2024

DarkLight1337 Nov 9, 2024

youkaichao Nov 9, 2024

DarkLight1337 Nov 9, 2024 •

edited

Loading

DarkLight1337 Nov 9, 2024

youkaichao Nov 9, 2024

DarkLight1337 Nov 9, 2024

DarkLight1337 Nov 9, 2024

mergify bot commented Nov 4, 2024

DarkLight1337 left a comment

[5/N] pass the whole config to model #9983

[5/N] pass the whole config to model #9983

Conversation

youkaichao commented Nov 4, 2024

github-actions bot commented Nov 4, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

DarkLight1337 Nov 9, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mergify bot commented Nov 4, 2024

DarkLight1337 left a comment

Choose a reason for hiding this comment

DarkLight1337 Nov 9, 2024 •

edited

Loading