Skip to content

Actions: zifeitong/vllm

clang-format

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
42 workflow runs
42 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Bugfix] bitsandbytes models fail to run pipeline parallel (#10200)
clang-format #42: Commit ac49b59 pushed by zifeitong
November 13, 2024 23:48 15s main
November 13, 2024 23:48 15s
Disable spec-decode + chunked-prefill for draft models with tensor pa…
clang-format #41: Commit f677862 pushed by zifeitong
November 8, 2024 17:16 20s main
November 8, 2024 17:16 20s
[Misc] Add logging for CUDA memory (#10027)
clang-format #40: Commit 09d3550 pushed by zifeitong
November 5, 2024 18:15 17s main
November 5, 2024 18:15 17s
[Model][VLM] Add multi-video support for LLaVA-Onevision (#8905)
clang-format #39: Commit 5f8d807 pushed by zifeitong
October 28, 2024 19:28 23s main
October 28, 2024 19:28 23s
[Model][Bugfix] Add FATReLU activation and support for openbmb/MiniCP…
clang-format #38: Commit 5b8a1fd pushed by zifeitong
October 16, 2024 18:05 28s main
October 16, 2024 18:05 28s
[Doc] Compatibility matrix for mutual exclusive features (#8512)
clang-format #37: Commit 8baf85e pushed by zifeitong
October 11, 2024 18:58 22s main
October 11, 2024 18:58 22s
[Misc] Support quantization of MllamaForCausalLM (#8822)
clang-format #36: Commit 7193774 pushed by zifeitong
September 25, 2024 23:01 15s main
September 25, 2024 23:01 15s
Fix typical acceptance sampler with correct recovered token ids (#8562)
clang-format #35: Commit 5f7bb58 pushed by zifeitong
September 23, 2024 20:33 19s main
September 23, 2024 20:33 19s
[Bugfix] add dead_error property to engine client (#8574)
clang-format #34: Commit 0d47bf3 pushed by zifeitong
September 18, 2024 23:41 18s main
September 18, 2024 23:41 18s
[Model] Add mistral function calling format to all models loaded with…
clang-format #33: Commit a54ed80 pushed by zifeitong
September 17, 2024 18:04 19s main
September 17, 2024 18:04 19s
Bump version to v0.6.1 (#8379)
clang-format #32: Commit 3fd2b0d pushed by zifeitong
September 11, 2024 22:29 19s main
September 11, 2024 22:29 19s
[CI/Build] Increasing timeout for multiproc worker tests (#8203)
clang-format #31: Commit 1447c97 pushed by zifeitong
September 6, 2024 19:56 20s main
September 6, 2024 19:56 20s
[MISC] Replace input token throughput with total token throughput (#8…
clang-format #30: Commit 77d9e51 pushed by zifeitong
September 4, 2024 21:42 16s main
September 4, 2024 21:42 16s
[TPU] Async output processing for TPU (#8011)
clang-format #29: Commit 80c7b08 pushed by zifeitong
August 30, 2024 02:43 17s main
August 30, 2024 02:43 17s
[Misc] Update qqq to use vLLMParameters (#7805)
clang-format #28: Commit 6653040 pushed by zifeitong
August 26, 2024 19:44 16s main
August 26, 2024 19:44 16s
[CI/Build] Avoid downloading all HF files in RemoteOpenAIServer (#7…
clang-format #27: Commit 029c71d pushed by zifeitong
August 26, 2024 16:13 23s main
August 26, 2024 16:13 23s
[ci][test] fix RemoteOpenAIServer (#7838)
clang-format #26: Commit aab0fcd pushed by zifeitong
August 24, 2024 18:43 16s main
August 24, 2024 18:43 16s
[Bugfix] Use LoadFormat values for vllm serve --load-format (#7784)
clang-format #25: Commit 15310b5 pushed by zifeitong
August 22, 2024 19:39 15s main
August 22, 2024 19:39 15s
[BUG] fix crash on flashinfer backend with cudagraph disabled, when a…
clang-format #24: Commit 53328d7 pushed by zifeitong
August 21, 2024 15:58 18s main
August 21, 2024 15:58 18s
[Intel GPU] fix xpu not support punica kernel (which use torch.librar…
clang-format #23: Commit 6e4658c pushed by zifeitong
August 20, 2024 19:14 20s main
August 20, 2024 19:14 20s
[Misc] Remove Gemma RoPE (#7638)
clang-format #22: Commit df845b2 pushed by zifeitong
August 19, 2024 18:36 18s main
August 19, 2024 18:36 18s
[Misc] Update dockerfile for CPU to cover protobuf installation (#7182)
clang-format #21: Commit f4da5f7 pushed by zifeitong
August 15, 2024 17:03 18s main
August 15, 2024 17:03 18s
[Misc] Update gptq_marlin to use new vLLMParameters (#7281)
clang-format #20: Commit fb377d7 pushed by zifeitong
August 13, 2024 18:51 21s main
August 13, 2024 18:51 21s
[Core] Subclass ModelRunner to support cross-attention & encoder sequ…
clang-format #19: Commit fd95e02 pushed by zifeitong
August 6, 2024 21:57 18s main
August 6, 2024 21:57 18s
July 29, 2024 18:19 21s