Deploy HuggingFace's 🤗 text-embedding-inference Apolo app.
This example deploys nomic-ai/nomic-embed-text-v1
embeddings model.
apolo run --pass-config ghcr.io/neuro-inc/app-deployment -- install https://github.com/neuro-inc/app-text-embeddings-inference \
text-embeddings-inference tei charts/app-text-embedding-inference \
--set timeout=600 \
--set "model.modelHFName=nomic-ai/nomic-embed-text-v1" \
--set "model.modelRevision=720244025c1a7e15661a174c63cce63c8218e52b" \ # optional
--set "serverExtraArgs[0]=--max-client-batch-size=100" \ # optional
--set "image.tag=hopper-1.2.3" \ # optional
--set "env.HUGGING_FACE_HUB_TOKEN=YOUR_TOKEN" \ # optional
--set "preset_name=H100x1" \ # set needed preset
--set "ingress.enabled=True" \ # optional
--set "ingress.clusterName=scottdc" # optional