[Core]Init vllm-ascend #2

wangxiyuan · 2025-01-29T13:04:49Z

No description provided.

Signed-off-by: MengqingCao <[email protected]>

* add requirements.txt for npu * update setup.py Signed-off-by: MengqingCao <[email protected]>

Add npu worker and model_runner

Signed-off-by: MengqingCao <[email protected]>

* add block_size arg to inference script * add get_current_memory_usage to platform Signed-off-by: MengqingCao <[email protected]>

Signed-off-by: MengqingCao <[email protected]>

[bugfix]: attention alibi bias shape is wrong

[Docs] Update docs and code format

[Format] Update docs and add format.sh

Signed-off-by: MengqingCao <[email protected]>

Signed-off-by: Yikun <[email protected]>

Signed-off-by: MengqingCao <[email protected]>

Signed-off-by: Shanshan Shen <[email protected]>

Signed-off-by: Yikun <[email protected]>

* enable npu profiling Signed-off-by: wangli <[email protected]> * adjust import position Signed-off-by: wangli <[email protected]> --------- Signed-off-by: wangli <[email protected]>

* [CI] use cached vLLM Signed-off-by: MengqingCao <[email protected]> * fix path Signed-off-by: MengqingCao <[email protected]> --------- Signed-off-by: MengqingCao <[email protected]>

Signed-off-by: Yikun <[email protected]>

* update contributing doc Signed-off-by: Shanshan Shen <[email protected]> * update contributing doc Signed-off-by: Shanshan Shen <[email protected]> * update contributing doc Signed-off-by: Shanshan Shen <[email protected]> * update contributing doc Signed-off-by: Shanshan Shen <[email protected]> * add ut pr label Signed-off-by: Shanshan Shen <[email protected]> * update pr labels Signed-off-by: Shanshan Shen <[email protected]> * update pr labels Signed-off-by: Shanshan Shen <[email protected]> --------- Signed-off-by: Shanshan Shen <[email protected]>

Signed-off-by: Yikun <[email protected]>

* Init vLLM Ascend backend plugin README.md Signed-off-by: Yikun <[email protected]> * Fix words and logos Signed-off-by: Yikun <[email protected]> --------- Signed-off-by: Yikun <[email protected]>

### What this PR does / why we need it? Add Python ignore and `.DS_Store` ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? git status see related file ignored Signed-off-by: Yikun <[email protected]>

### What this PR does / why we need it? This patch align with the meta data of the vllm community and complete the necessary information: - Add copyright - Update version: 0.1.0a1 alpha release follow https://packaging.python.org/en/latest/specifications/version-specifiers - Add author/license/description - Remove extras_require ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? ``` # Prepare $ python3 -m venv ./.venv-test $ source ./.venv-test/bin/activate # Install without failaure $ pip install -e . --no-deps # Check metadata as exptected $ ls ./.venv-test/lib/python3.12/site-packages/vllm_ascend-0.1.0a1.dist-info INSTALLER LICENSE METADATA RECORD REQUESTED WHEEL direct_url.json entry_points.txt top_level.txt ``` Signed-off-by: Yikun <[email protected]>

init vLLM native UTs Signed-off-by: MengqingCao <[email protected]>

* Refactor worker to decouple it from gpuworker * add `get_model` func to `model_runner` and `worker` plz merge this pr before #32 , because it relys on the reconstruction in this PR Signed-off-by: MengqingCao <[email protected]>

Mainly changes: * exit if vLLM native ut failed * rm redundant mappings in docker * update TEST_FILES `test_config.py` is disabled beacuse some models require for access: ![image](https://github.com/user-attachments/assets/79ceb1a5-f8e1-4228-8191-f0a83803ba6a) Signed-off-by: MengqingCao <[email protected]>

### What this PR does / why we need it? Adapts AttentionLayer and kvcache scaling interface to make ci happy with master vLLM. - vllm-project/vllm@e97f802 Add the dynamic kv cache scaling factors - vllm-project/vllm@86bfb6d Add AttentionLayer to forward ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? - run with master vllm ``` # Install vLLM main git clone --depth 1 https://github.com/vllm-project/vllm.git /workspace/vllm python3 -m pip install -r /workspace/vllm/requirements-build.txt VLLM_TARGET_DEVICE="empty" python3 -m pip install /workspace/vllm/ # Install vllm-ascend main python3 -m pip install /workspace/vllm_ascend/ # test pytest tests -sv ``` - CI passed Signed-off-by: Yikun <[email protected]>

### What this PR does / why we need it? Test with vllm master code to fix CI failure ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? By CI test Signed-off-by: wangxiyuan <[email protected]>

### What this PR does / why we need it? Remove useless ci deps because we use `VLLM_TARGET_DEVICE=empty` to build ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? CI passed Signed-off-by: Yikun <[email protected]>

### What this PR does / why we need it? - Add `Building and testing` section to CONTRIBUTING.md - Refresh Dockerfile to use empty VLLM_TARGET_DEVICE ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Preview and test Signed-off-by: Yikun Jiang <[email protected]>

wangxiyuan · 2025-01-29T13:11:13Z

Please Do not merge, the CI need work first.

Yikun

Just a note: some hide files like .github and .gitignore havn't uploaded yet.

wangxiyuan and others added 30 commits December 19, 2024 10:22

Initial commit

c1eb29e

Init project

d80d00b

Update register way

48a9120

Init platform class

b3e620a

Add torch compile backend

52f8a46

init attn and communicator

b4d195b

Signed-off-by: MengqingCao <[email protected]>

Some updates

68e03e8

* add requirements.txt for npu * update setup.py Signed-off-by: MengqingCao <[email protected]>

add npu worker and model_runner

c93de5c

Merge pull request #2 from cosdt/sss

8e4a606

Add npu worker and model_runner

tiny fix

67aa293

Signed-off-by: MengqingCao <[email protected]>

update install doc

0260198

Signed-off-by: MengqingCao <[email protected]>

Update interface

b7a904a

update communicator and ray-related vars

f30bd3d

Signed-off-by: MengqingCao <[email protected]>

Some fixes

79ec53f

* add block_size arg to inference script * add get_current_memory_usage to platform Signed-off-by: MengqingCao <[email protected]>

init block size

9502c4f

Fix import error

568759e

Update README.md

87c702e

add dist infer scripts & update readme

77ae657

Signed-off-by: MengqingCao <[email protected]>

add audio infer example

e06051b

Signed-off-by: MengqingCao <[email protected]>

update extras_require in setup.py

aa32a0b

Signed-off-by: MengqingCao <[email protected]>

bugfix: attention alibi bias shape is wrong

80a6a8e

Merge pull request #6 from cosdt/sss

58059da

[bugfix]: attention alibi bias shape is wrong

Cleanup

c38b702

update docs and code format

f6a44f1

Merge pull request #7 from cosdt/sss

969b7e7

[Docs] Update docs and code format

update docs and format.sh

dbbe47c

Merge pull request #8 from cosdt/sss

21bb285

[Format] Update docs and add format.sh

add Dockerfile

6c6266f

Signed-off-by: MengqingCao <[email protected]>

Update README.md

8baa637

Rename to vllm-ascend

dc63477

Yikun and others added 23 commits January 16, 2025 09:04

Copy vLLM DCO and CONTRIBUTING.md (#18)

60b2bfa

Signed-off-by: Yikun <[email protected]>

Add CODE_OF_CONDUCT.md (#19)

317a14e

Signed-off-by: Yikun <[email protected]>

[CI] enable CI for all branches (#23)

efac75e

Signed-off-by: MengqingCao <[email protected]>

[CI] add npu CI & init ut (#14)

40c8eed

Signed-off-by: MengqingCao <[email protected]>

[CI] use secret var (#28)

51e3ee9

Signed-off-by: MengqingCao <[email protected]>

update format scripts (#31)

5a1a8f1

Signed-off-by: Shanshan Shen <[email protected]>

Add Apache 2.0 license (#22)

14d86c4

Signed-off-by: Yikun <[email protected]>

Add PR template (#36)

bd6df6f

Signed-off-by: Yikun <[email protected]>

[WORKER] Enable npu profiling (#35)

3202eb0

* enable npu profiling Signed-off-by: wangli <[email protected]> * adjust import position Signed-off-by: wangli <[email protected]> --------- Signed-off-by: wangli <[email protected]>

[CI] use cached vLLM (#39)

d3c73a9

* [CI] use cached vLLM Signed-off-by: MengqingCao <[email protected]> * fix path Signed-off-by: MengqingCao <[email protected]> --------- Signed-off-by: MengqingCao <[email protected]>

Add logo (#41)

99f615e

Signed-off-by: Yikun <[email protected]>

Add license header (#40)

4a044cd

Signed-off-by: Yikun <[email protected]>

[Doc] Init vLLM Ascend plugin README.md (#21)

188baa3

* Init vLLM Ascend backend plugin README.md Signed-off-by: Yikun <[email protected]> * Fix words and logos Signed-off-by: Yikun <[email protected]> --------- Signed-off-by: Yikun <[email protected]>

[Misc] Add Python ignore (#42)

22dd5ae

### What this PR does / why we need it? Add Python ignore and `.DS_Store` ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? git status see related file ignored Signed-off-by: Yikun <[email protected]>

[CI][UT] init vLLM native UTs (#43)

8b28736

init vLLM native UTs Signed-off-by: MengqingCao <[email protected]>

[CI]Fix broken CI (#46)

c240cb4

### What this PR does / why we need it? Test with vllm master code to fix CI failure ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? By CI test Signed-off-by: wangxiyuan <[email protected]>

[CI] Remove useless ci deps (#48)

6889a52

### What this PR does / why we need it? Remove useless ci deps because we use `VLLM_TARGET_DEVICE=empty` to build ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? CI passed Signed-off-by: Yikun <[email protected]>

wangxiyuan mentioned this pull request Jan 29, 2025

Glad to see that vLLM has officially added support for the Ascend hardware backend! #1

Open

Yikun reviewed Jan 30, 2025

View reviewed changes

Yikun force-pushed the main branch 2 times, most recently from 83e76e3 to f782b51 Compare February 5, 2025 01:29

wangxiyuan closed this Feb 5, 2025

wangxiyuan force-pushed the main branch from f782b51 to 5caed00 Compare February 5, 2025 01:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Core]Init vllm-ascend #2

[Core]Init vllm-ascend #2

wangxiyuan commented Jan 29, 2025 •

edited

Loading

wangxiyuan commented Jan 29, 2025

Yikun left a comment

[Core]Init vllm-ascend #2

[Core]Init vllm-ascend #2

Conversation

wangxiyuan commented Jan 29, 2025 • edited Loading

wangxiyuan commented Jan 29, 2025

Yikun left a comment

Choose a reason for hiding this comment

wangxiyuan commented Jan 29, 2025 •

edited

Loading