Skip to content

Actions: NVIDIA/TensorRT-LLM

Blossom-CI

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
150 workflow runs
150 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

how to set "eagle_choices" or "medusa_choices" per request
Blossom-CI #25: Issue comment #2522 (comment) created by nekorobov
December 2, 2024 17:38 5s
December 2, 2024 17:38 5s
Docker image
Blossom-CI #24: Issue comment #52 (comment) created by Isaac4real
December 2, 2024 16:21 6s
December 2, 2024 16:21 6s
[Question] Int8 Gemm's perf degraded in real models.
Blossom-CI #23: Issue comment #2351 (comment) created by foreverlms
December 2, 2024 14:57 5s
December 2, 2024 14:57 5s
MPI Abort Error when using disaggServerBenchmark
Blossom-CI #22: Issue comment #2518 (comment) created by chuangz0
December 2, 2024 14:17 5s
December 2, 2024 14:17 5s
MPI Abort Error when using disaggServerBenchmark
Blossom-CI #21: Issue comment #2518 (comment) created by zhangts20
December 2, 2024 13:31 5s
December 2, 2024 13:31 5s
[Feature Request] optional tracing(onnx etc.) based solution inside trt-llm
Blossom-CI #20: Issue comment #2519 (comment) created by tp-nan
December 2, 2024 09:25 5s
December 2, 2024 09:25 5s
Can't build whisper engines with past two releases
Blossom-CI #19: Issue comment #2508 (comment) created by MahmoudAshraf97
December 2, 2024 08:55 5s
December 2, 2024 08:55 5s
MPI Abort Error when using disaggServerBenchmark
Blossom-CI #18: Issue comment #2518 (comment) created by chuangz0
December 2, 2024 08:52 4s
December 2, 2024 08:52 4s
Can't build whisper engines with past two releases
Blossom-CI #17: Issue comment #2508 (comment) created by hello-11
December 2, 2024 08:23 6s
December 2, 2024 08:23 6s
Qwen2-72B w4a8 empty output
Blossom-CI #15: Issue comment #2392 (comment) created by calico-niko
December 2, 2024 07:39 5s
December 2, 2024 07:39 5s
Qwen2-VL Batch Bug
Blossom-CI #14: Issue comment #2495 (comment) created by YSF-A
December 2, 2024 07:14 5s
December 2, 2024 07:14 5s
qserve is slower then awq int4 for llama2-7b on H100
Blossom-CI #13: Issue comment #2509 (comment) created by bobboli
December 2, 2024 04:43 4s
December 2, 2024 04:43 4s
Blossom-CI
Blossom-CI #12: created by sun2011yao
December 2, 2024 03:25 4s
December 2, 2024 03:25 4s
Qwen2-VL Batch Bug
Blossom-CI #11: Issue comment #2495 (comment) created by sun2011yao
December 2, 2024 03:21 4s
December 2, 2024 03:21 4s
Error happened when quantizate Qwen2.5-14B-Instruct by SmoothQuant
Blossom-CI #10: Issue comment #2319 (comment) created by Wonder-donbury
December 2, 2024 02:08 5s
December 2, 2024 02:08 5s
Does recurrentgemma support quantization?
Blossom-CI #8: Issue comment #2450 (comment) created by daiwk
November 30, 2024 03:02 5s
November 30, 2024 03:02 5s
Blossom-CI
Blossom-CI #7: created by niukuo
November 29, 2024 08:48 4m 30s
November 29, 2024 08:48 4m 30s
Blossom-CI
Blossom-CI #6: created by niukuo
November 29, 2024 08:24 17s
November 29, 2024 08:24 17s
Blossom-CI
Blossom-CI #5: created by niukuo
November 29, 2024 07:44 1d 11h 40m 18s
November 29, 2024 07:44 1d 11h 40m 18s
Qwen2-VL Batch Bug
Blossom-CI #4: Issue comment #2495 (comment) created by sun2011yao
November 29, 2024 07:34 5s
November 29, 2024 07:34 5s
Blossom-CI
Blossom-CI #3: created by niukuo
November 29, 2024 07:31 15s
November 29, 2024 07:31 15s
Blossom-CI
Blossom-CI #2: created by niukuo
November 29, 2024 07:29 15s
November 29, 2024 07:29 15s
Blossom-CI
Blossom-CI #1: created by niukuo
November 29, 2024 07:21 14s
November 29, 2024 07:21 14s