Set vllm-hpu-extension to 6ac93fb #684

mfylcek · 2025-01-14T10:18:06Z

remove expert_max hard code (#47)
vLLM-Ext: Full enabling of ALiBi (#34)
Add version inference via setuptools-scm (#58)
Revert "vLLM-Ext: Full enabling of ALiBi (#34)" (#59)
Remove punica_hpu.py from vllm_hpu_extension (#66)
Removed previous (not-pipelined) pa implementation (#72)
Add flag to enable running softmax in fp32 (#71)
Update calibration readme link (#73)
allow lm_head quantization in calibration process (#65)
Pad to bmin if value is less (#67)
Update pyproject.toml (#75)

remove expert_max hard code (#47) vLLM-Ext: Full enabling of ALiBi (#34) Add version inference via setuptools-scm (#58) Revert "vLLM-Ext: Full enabling of ALiBi (#34)" (#59) Remove punica_hpu.py from vllm_hpu_extension (#66) Removed previous (not-pipelined) pa implementation (#72) Add flag to enable running softmax in fp32 (#71) Update calibration readme link (#73) allow lm_head quantization in calibration process (#65) Pad to bmin if value is less (#67) Update pyproject.toml (#75) --------- Co-authored-by: Michał Kuligowski <[email protected]>

Set vllm-hpu-extension to 6ac93fb

61252f3

mfylcek marked this pull request as ready for review January 14, 2025 10:18

mfylcek requested review from kzawora-intel, madamczykhabana, michalkuligowski, mgawarkiewicz and vivekgoe as code owners January 14, 2025 10:18

Merge branch 'habana_main' into dev/mfylcek/hpu-extension-update

37d8a6c

mfylcek requested a review from afierka-intel as a code owner January 14, 2025 11:30

michalkuligowski added 2 commits January 14, 2025 12:43

Update requirements-hpu.txt

4f2947f

Update requirements-hpu.txt

cf817b5

michalkuligowski approved these changes Jan 15, 2025

View reviewed changes

michalkuligowski merged commit 885c60d into habana_main Jan 15, 2025
53 checks passed

michalkuligowski deleted the dev/mfylcek/hpu-extension-update branch January 15, 2025 14:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set vllm-hpu-extension to 6ac93fb #684

Set vllm-hpu-extension to 6ac93fb #684

mfylcek commented Jan 14, 2025 •

edited by github-actions bot

Loading

Set vllm-hpu-extension to 6ac93fb #684

Set vllm-hpu-extension to 6ac93fb #684

Conversation

mfylcek commented Jan 14, 2025 • edited by github-actions bot Loading

mfylcek commented Jan 14, 2025 •

edited by github-actions bot

Loading