Skip to content

[Bugfix] Enable loading FP8 checkpoints for gpt_bigcode models #5460

Merged
robertgshaw2-redhat merged 3 commits intovllm-project:mainfrom tdoublep:gpt_bigcode_fp8Jun 14, 2024

Commits

Commits on Jun 12, 2024

Commits on Jun 14, 2024