Skip to content

neuro-inc/app-text-embeddings-inference

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 

Repository files navigation

Huggingface's text-embedding-inference app

Deploy HuggingFace's 🤗 text-embedding-inference Apolo app.

Platform deploymet example:

This example deploys nomic-ai/nomic-embed-text-v1 embeddings model.

apolo run --pass-config ghcr.io/neuro-inc/app-deployment -- install https://github.com/neuro-inc/app-text-embeddings-inference \
  text-embeddings-inference tei charts/app-text-embedding-inference \
  --set timeout=600 \
  --set "model.modelHFName=nomic-ai/nomic-embed-text-v1" \
  --set "model.modelRevision=720244025c1a7e15661a174c63cce63c8218e52b" \ # optional
  --set "serverExtraArgs[0]=--max-client-batch-size=100" \  # optional
  --set "image.tag=hopper-1.2.3" \ # optional
  --set "env.HUGGING_FACE_HUB_TOKEN=YOUR_TOKEN" \ # optional
  --set "preset_name=H100x1" \  # set needed preset
  --set "ingress.enabled=True" \ # optional
  --set "ingress.clusterName=scottdc" # optional

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages