-
-
Notifications
You must be signed in to change notification settings - Fork 4.7k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Core] Integrate Fastsafetensor loader for loading model weights
ci/build
documentation
Improvements or additions to documentation
#10647
opened Nov 26, 2024 by
manish-sethi
•
Draft
[V1] VLM - Support running the mm_mapper preprocessor in the frontend process
frontend
needs-rebase
#10640
opened Nov 25, 2024 by
alexm-neuralmagic
Loading…
[Frontend] don't block event loop in tokenization (preprocess) in OpenAI compatible server
frontend
#10635
opened Nov 25, 2024 by
tomeras91
Loading…
[Misc] Allow LoRA to adaptively increase rank and remove possible_max_ranks
#10623
opened Nov 25, 2024 by
JinhyunBang
Loading…
[Core][Bugfix] Use correct device to initialize GPU data during CUDA-graph-capture
#10608
opened Nov 24, 2024 by
IdoAsraff
Loading…
[fix] Correct num_accepted_tokens counting
ready
ONLY add when PR is ready to merge/full CI is needed
#10604
opened Nov 24, 2024 by
KexinFeng
Loading…
[Kernel] Remove hard-dependencies of Speculative decode to CUDA workers
ready
ONLY add when PR is ready to merge/full CI is needed
#10587
opened Nov 23, 2024 by
xuechendi
Loading…
[ Kernels ] [ AMD ] Add Fused MoE Configs
#10574
opened Nov 22, 2024 by
robertgshaw2-neuralmagic
•
Draft
[V1] Refactor model executable interface for multimodal models
#10570
opened Nov 22, 2024 by
ywang96
Loading…
14 tasks done
[Hardware][Intel-Gaudi] Enable LoRA support for Intel Gaudi (HPU)
#10565
opened Nov 22, 2024 by
SanjuCSudhakaran
Loading…
[Model] Added GLM-4 series hf format model support vllm==0.6.4
#10561
opened Nov 22, 2024 by
sixsixcoder
Loading…
[Benchmark] Benchmark structured output with datasets
#10557
opened Nov 22, 2024 by
xuechendi
Loading…
[Docs] Add dedicated tool calling page to docs
documentation
Improvements or additions to documentation
#10554
opened Nov 21, 2024 by
mgoin
Loading…
[Misc] Enable vLLM to Dynamically Load LoRA from a Remote Server
frontend
#10546
opened Nov 21, 2024 by
angkywilliam
Loading…
[core] overhaul memory profiling and fix backward compatibility
needs-rebase
#10511
opened Nov 21, 2024 by
youkaichao
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.