FLAN-T5 XXL

FLAN-T5 is a combination of two: a network and a model. Here, FLAN is Finetuned Language Net and T5 is a language model developed and published by Google in 2020. This model provides an improvement on the T5 model by improving the effectiveness of the zero-shot learning. FLAN-T5 model comes with many variants based on the numbers of parameters: FLAN-T5 small (80M); FLAN-T5 base (250M); FLAN-T5 large (780M); FLAN-T5 XL (3B); FLAN-T5 XXL (11B). This deployment uses the XXL variant with 11B parameters.

Hugging Face repository https://huggingface.co/google/flan-t5-xxl

Usage

Min GPU NVIDIA 3090. At the first start will be downloaded files of models with a total weight of about 45 GB. After that checkpoint shards will be loaded, thal also take a some time. For running just deploy this SDL and wait for the models to finish loading, watching the process in the logs.

Logs

The logs on the screenshot show that the loading of the models has completed and the application web server has started and the application is ready to work:

Demo Video

Peek.2023-07-05.02-16.mp4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

FLAN-T5 XXL

Usage

Logs

Demo Video

Files

README.md

Latest commit

History

README.md

File metadata and controls

FLAN-T5 XXL

Usage

Logs

Demo Video