Skip to content

Actions: intel/xFasterTransformer

XFT PR Validation

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
750 workflow runs
750 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Python] Add get_env() to get LD_PRELOAD set.
XFT PR Validation #763: Pull request #427 opened by Duyi-Wang
May 30, 2024 07:48 22m 58s Duyi-Wang:get_env
May 30, 2024 07:48 22m 58s
[CI] Check gcc version.
XFT PR Validation #761: Pull request #426 opened by changqi1
May 28, 2024 06:02 15m 49s changqi1:changqing/feature/ci_fix
May 28, 2024 06:02 15m 49s
[Layers] Fixed the seg fault error when running with more than 4 ranks
XFT PR Validation #756: Pull request #424 synchronize by abenmao
May 28, 2024 02:18 45m 26s abenmao:dist/utils/split
May 28, 2024 02:18 45m 26s
[Kernel] Add GPU kernels and enable LLaMA model.
XFT PR Validation #755: Pull request #372 synchronize by changqi1
May 27, 2024 10:21 1h 29m 48s changqi1:changqing/feature/gpu_rope
May 27, 2024 10:21 1h 29m 48s
[Distribute] Add distribute support for continuous batching api.
XFT PR Validation #753: Pull request #421 synchronize by Duyi-Wang
May 27, 2024 08:37 22m 45s Duyi-Wang:cb_distribute
May 27, 2024 08:37 22m 45s
[Kernel] Add FP16 MHA and MLP kernels.
XFT PR Validation #750: Pull request #415 synchronize by changqi1
May 27, 2024 03:05 22m 40s changqi1:changqing/feature/full_fp16_3
May 27, 2024 03:05 22m 40s
[Kernel] Add FP16 MHA and MLP kernels.
XFT PR Validation #749: Pull request #415 synchronize by changqi1
May 27, 2024 03:03 1h 2m 17s changqi1:changqing/feature/full_fp16_3
May 27, 2024 03:03 1h 2m 17s
[Distribute] Add distribute support for continuous batching api.
XFT PR Validation #746: Pull request #421 synchronize by Duyi-Wang
May 24, 2024 08:53 1d 7h 40m 2s Duyi-Wang:cb_distribute
May 24, 2024 08:53 1d 7h 40m 2s
[Distribute] Add distribute support for continuous batching api.
XFT PR Validation #745: Pull request #421 opened by Duyi-Wang
May 24, 2024 08:34 1d 7h 58m 33s Duyi-Wang:cb_distribute
May 24, 2024 08:34 1d 7h 58m 33s
[Kernel] Less compute for Self-Attention (Q * K)
XFT PR Validation #744: Pull request #420 opened by pujiang2018
May 24, 2024 05:18 42m 2s pujiang2018:main
May 24, 2024 05:18 42m 2s
Add --padding and fix bug
XFT PR Validation #743: Pull request #418 synchronize by yangkunx
May 23, 2024 09:03 1h 7m 37s yangkunx:add-arg-padding
May 23, 2024 09:03 1h 7m 37s
[Kernel] Add oneDNN AMX_FP16 compute kernels.
XFT PR Validation #740: Pull request #417 opened by changqi1
May 23, 2024 01:48 1d 14h 44m 17s wenhuanh:feat/fp16_amx
May 23, 2024 01:48 1d 14h 44m 17s
[Dependency] Update torch to 2.3.0.
XFT PR Validation #739: Pull request #416 opened by Duyi-Wang
May 22, 2024 01:12 1d 11h 25m 50s Duyi-Wang:update_torch_2.3
May 22, 2024 01:12 1d 11h 25m 50s
[Kernel] Add FP16 MHA and MLP kernels.
XFT PR Validation #738: Pull request #415 synchronize by changqi1
May 21, 2024 10:09 13m 42s changqi1:changqing/feature/full_fp16_3
May 21, 2024 10:09 13m 42s
[Kernel] Add FP16 MHA and MLP kernels.
XFT PR Validation #737: Pull request #415 synchronize by changqi1
May 21, 2024 09:14 14m 19s changqi1:changqing/feature/full_fp16_3
May 21, 2024 09:14 14m 19s