Using vLLM or TGI #7688

prashil1996 · 2024-08-20T11:03:05Z

prashil1996
Aug 20, 2024

Hi, so I have a troubling question where I'm confused to use either vLLM or TGI (https://github.com/huggingface/text-generation-inference).
As of today vLLM only supports only Decoder Models as mentioned here (https://docs.vllm.ai/en/latest/models/supported_models.html) and my use case requires me to also support certain encoder-decoder models for which I want to use TGI.

Question:

How can I distinguish that a model would work with vLLM or TGI before I am about to bring up the model itself? Is there a way that would tell me if the model would come up successfully?

Can I make use of any parameters (which would be available for all models) present in config.json file for the models to say that yes, this model would work with vLLM and not TGI, and vice-versa.

FYI: The models I'm looking to host are if HuggingFace format itself.

@wooyeonlee0 @youkaichao @simon-mo @tmm1 @zhouyuan

Answered by simon-mo

Aug 20, 2024

The architecture field in config.json should be what you want to cross check: https://docs.vllm.ai/en/latest/models/supported_models.html

View full answer

simon-mo · 2024-08-20T16:40:44Z

simon-mo
Aug 20, 2024
Maintainer

The architecture field in config.json should be what you want to cross check: https://docs.vllm.ai/en/latest/models/supported_models.html

1 reply

prashil1996 Aug 21, 2024
Author

Thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using vLLM or TGI #7688

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

Using vLLM or TGI #7688

prashil1996 Aug 20, 2024

Question:

Replies: 1 comment · 1 reply

simon-mo Aug 20, 2024 Maintainer

prashil1996 Aug 21, 2024 Author

prashil1996
Aug 20, 2024

Replies: 1 comment 1 reply

simon-mo
Aug 20, 2024
Maintainer

prashil1996 Aug 21, 2024
Author