Skip to content

Actions: intel/xFasterTransformer

XFT PR Validation

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
779 workflow runs
779 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Kernel] Add GPU kernels and enable LLaMA model.
XFT PR Validation #813: Pull request #372 synchronize by changqi1
June 13, 2024 07:18 22m 42s changqi1:changqing/feature/gpu_rope
June 13, 2024 07:18 22m 42s
[Doc] Add vllm benchmark docs.
XFT PR Validation #812: Pull request #448 opened by marvin-Yu
June 13, 2024 06:02 22m 47s doc/add_BM_2_vllm-serving
June 13, 2024 06:02 22m 47s
[Kernel] Add GPU kernels and enable LLaMA model.
XFT PR Validation #811: Pull request #372 synchronize by changqi1
June 13, 2024 04:22 24m 27s changqi1:changqing/feature/gpu_rope
June 13, 2024 04:22 24m 27s
[Kernel] Add GPU kernels and enable LLaMA model.
XFT PR Validation #809: Pull request #372 synchronize by changqi1
June 13, 2024 03:25 23m 36s changqi1:changqing/feature/gpu_rope
June 13, 2024 03:25 23m 36s
[Version] v1.7.1.
XFT PR Validation #802: Pull request #445 opened by Duyi-Wang
June 12, 2024 05:23 22m 14s Duyi-Wang:v1.7.1
June 12, 2024 05:23 22m 14s
[Model] Fix array out of bounds when rank > 2.
XFT PR Validation #794: Pull request #441 synchronize by Duyi-Wang
June 6, 2024 06:51 22m 29s Duyi-Wang:fix_multi_rank_cb_issue
June 6, 2024 06:51 22m 29s
[Model] Add Qwen2 GPTQ model support
XFT PR Validation #792: Pull request #439 opened by xiangzez
June 6, 2024 03:36 22m 42s xiangzez:qwen2_gptq_convert
June 6, 2024 03:36 22m 42s
[Kernel] Expand rmsNorm op.
XFT PR Validation #791: Pull request #437 synchronize by changqi1
June 5, 2024 13:50 36m 30s changqi1:changqing/feature/rmsnorm
June 5, 2024 13:50 36m 30s
Add Continue Batching support for Chatglm2/3
XFT PR Validation #789: Pull request #438 opened by a3213105
June 5, 2024 13:41 36m 6s a3213105:chatglm2_3_cb_support
June 5, 2024 13:41 36m 6s