huggingface / text-generation-inference Public

Notifications You must be signed in to change notification settings
Fork 1.1k
Star 9.6k

Code
Issues 186
Pull requests 15
Discussions
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Security
Insights

Pull requests: huggingface/text-generation-inference

Labels 13 Milestones 1

New pull request New

15 Open 1,435 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

General fixes for tool calling

#2954 opened Jan 24, 2025 by Trofleb

Loading…

2 of 4 tasks

Attempt to remove AWS S3 flaky cache for sccache

#2953 opened Jan 24, 2025 by mfuntowicz

Loading…

Transformers TP monkey patch support

#2951 opened Jan 24, 2025 by Cyrilvallez

Loading…

Update to attention-kernels 0.2.0

#2950 opened Jan 24, 2025 by danieldk

Loading…

5 tasks

Fix tool call response to adhere to OpenAI spec

#2949 opened Jan 24, 2025 by Datta0

Loading…

add local file read path for image which could work with dataset like…

#2948 opened Jan 24, 2025 by sywangyi

Loading…

5 tasks

Improve qwen vl impl

#2943 opened Jan 22, 2025 by drbh

Loading…

5 tasks done

llava next image encoder to allow un-aligned patch / image sizes

#2936 opened Jan 22, 2025 by jimexist

Loading…

5 tasks

Add fp8 support moe models

#2928 opened Jan 20, 2025 by mht-sharma

Loading…

5 tasks

Update Dockerfile to use devel image for compatibility

#2848 opened Dec 16, 2024 by YaserJaradeh

Loading…

2 of 5 tasks

Enable qwen2vl video

#2756 opened Nov 18, 2024 by drbh

Loading…

9 tasks done

Add llama.cpp backend

#2723 opened Nov 4, 2024 by mfuntowicz

Loading…

Get opentelemetry trace id from request headers instead of creating a new trace

#2648 opened Oct 15, 2024 by kozistr

Loading…

3 of 5 tasks

[WIP] Add gfx1100 support to AMD pytorch build

#2642 opened Oct 13, 2024 by cazlo • Draft

1 of 5 tasks

Add model_load_time metric

#2311 opened Jul 26, 2024 by Edwinhr716

Loading…

2 of 5 tasks

ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly