Skip to content

Pull requests: huggingface/text-generation-inference

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Trying to put back the archlist (to fix the oom).
#2947 opened Jan 23, 2025 by Narsil Loading…
5 tasks
Improve qwen vl impl
#2943 opened Jan 22, 2025 by drbh Loading…
5 tasks done
llava next image encoder to allow un-aligned patch / image sizes
#2936 opened Jan 22, 2025 by jimexist Loading…
5 tasks
Add fp8 support moe models
#2928 opened Jan 20, 2025 by mht-sharma Loading…
5 tasks
Update Dockerfile to use devel image for compatibility
#2848 opened Dec 16, 2024 by YaserJaradeh Loading…
2 of 5 tasks
Enable qwen2vl video
#2756 opened Nov 18, 2024 by drbh Loading…
9 tasks done
Add llama.cpp backend
#2723 opened Nov 4, 2024 by mfuntowicz Loading…
[WIP] Add gfx1100 support to AMD pytorch build
#2642 opened Oct 13, 2024 by cazlo Draft
1 of 5 tasks
Add model_load_time metric
#2311 opened Jul 26, 2024 by Edwinhr716 Loading…
2 of 5 tasks
ProTip! Mix and match filters to narrow down what you’re looking for.