[GenAIOrchestrator] add huggingface tgi - 1665 #1702

morgandiverrez · 2024-07-26T13:14:25Z

add LLM provider hugging face tgi
issue 1665

morgandiverrez · 2024-07-26T13:19:47Z

Do not merge until the PR on huggingface_hub is merged. The pyproject.toml points to a fork of this library in the meantime.

morgandiverrez · 2024-08-14T09:32:24Z

PR on huggingface_hub is merged, ready for review

Benvii · 2024-08-27T16:05:12Z

This PR installs a tons of dependencies for instance :

nvidia-cudnn-cu12 (9.1.0.70): Downloading... 10%
• Installing nvidia-cufft-cu12 (11.0.2.54): Downloading... 60%
• Installing nvidia-cusolver-cu12 (11.4.5.107): Downloading... 59%
• Installing nvidia-nccl-cu12
• Installing torch (2.4.0): Downloading... 44%

It seems quite huge, I need to take a deeper look at that, a simple "huggingface hub client" shouldn't require any nvidia (cuda) dependencies, maybe you can isolate a sub group of dependencies. Otherwise the docker image will become too usage

morgandiverrez · 2024-09-03T08:30:31Z

The langchain_huggingface library heavily relies on other libraries, such as CUDA and PyTorch. I found a discussion on GitHub about this: langchain-ai/langchain#24482

morgandiverrez · 2024-09-05T20:56:55Z

A discussion was create on langchain about this probleme

Benvii · 2024-11-20T10:19:08Z

We are discussing this integration internally at CM Arkéa, we are no longer sure that TGI will be used as our inference server meaning we wouldn't be able to maintain this integration.

Waiting for a clear position on this subject on our side (test in progress to use OpenAI integration for vLLM). If any tock community user uses TGI and want this integration feel free to leave a comment here.

morgandiverrez self-assigned this Jul 26, 2024

morgandiverrez requested a review from Benvii July 26, 2024 13:14

morgandiverrez linked an issue Jul 26, 2024 that may be closed by this pull request

[GenAIOrchestrator] Intégrer hugging face TGI #1665

Open

morgandiverrez marked this pull request as ready for review July 26, 2024 13:15

morgandiverrez requested a review from assouktim July 26, 2024 13:15

morgandiverrez marked this pull request as draft July 26, 2024 13:23

Benvii mentioned this pull request Aug 6, 2024

[GenaAI orchestrator] Add TGI connection with proxy #1705

Closed

morgandiverrez marked this pull request as ready for review August 14, 2024 09:32

Morgan Diverrez added 2 commits August 27, 2024 17:04

[DERCBOT-874] add TGI integration

d9b7253

new pyproject after huggingface_hub PR release

def2e65

morgandiverrez force-pushed the 1665-genaiorchestrator-integrer-hugging-face-tgi branch from 328fcdf to def2e65 Compare August 27, 2024 15:11

WIP

7727fd2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GenAIOrchestrator] add huggingface tgi - 1665 #1702

[GenAIOrchestrator] add huggingface tgi - 1665 #1702

morgandiverrez commented Jul 26, 2024

morgandiverrez commented Jul 26, 2024 •

edited

Loading

morgandiverrez commented Aug 14, 2024

Benvii commented Aug 27, 2024

morgandiverrez commented Sep 3, 2024

morgandiverrez commented Sep 5, 2024

Benvii commented Nov 20, 2024

[GenAIOrchestrator] add huggingface tgi - 1665 #1702

Are you sure you want to change the base?

[GenAIOrchestrator] add huggingface tgi - 1665 #1702

Conversation

morgandiverrez commented Jul 26, 2024

morgandiverrez commented Jul 26, 2024 • edited Loading

morgandiverrez commented Aug 14, 2024

Benvii commented Aug 27, 2024

morgandiverrez commented Sep 3, 2024

morgandiverrez commented Sep 5, 2024

Benvii commented Nov 20, 2024

morgandiverrez commented Jul 26, 2024 •

edited

Loading