Skip to content

Pull requests: huggingface/text-generation-inference

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

General fixes for tool calling
#2954 opened Jan 24, 2025 by Trofleb Loading…
2 of 4 tasks
Attempt to remove AWS S3 flaky cache for sccache
#2953 opened Jan 24, 2025 by mfuntowicz Loading…
Transformers TP monkey patch support
#2951 opened Jan 24, 2025 by Cyrilvallez Loading…
Update to attention-kernels 0.2.0
#2950 opened Jan 24, 2025 by danieldk Loading…
5 tasks
Fix tool call response to adhere to OpenAI spec
#2949 opened Jan 24, 2025 by Datta0 Loading…
Improve qwen vl impl
#2943 opened Jan 22, 2025 by drbh Loading…
5 tasks done
llava next image encoder to allow un-aligned patch / image sizes
#2936 opened Jan 22, 2025 by jimexist Loading…
5 tasks
Add fp8 support moe models
#2928 opened Jan 20, 2025 by mht-sharma Loading…
5 tasks
Update Dockerfile to use devel image for compatibility
#2848 opened Dec 16, 2024 by YaserJaradeh Loading…
2 of 5 tasks
Enable qwen2vl video
#2756 opened Nov 18, 2024 by drbh Loading…
9 tasks done
Add llama.cpp backend
#2723 opened Nov 4, 2024 by mfuntowicz Loading…
[WIP] Add gfx1100 support to AMD pytorch build
#2642 opened Oct 13, 2024 by cazlo Draft
1 of 5 tasks
Add model_load_time metric
#2311 opened Jul 26, 2024 by Edwinhr716 Loading…
2 of 5 tasks
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.