This repo contains information on how to deploy ollama on OpenShift.
- OpenShift >= 4.15
- A GPU worker node with at least 16GB of GPU memory.
- AWS
g4dn.2xlarge
g5.2xlarge
- AWS
Use CPU only
# setup ollama
until oc apply -k deploy; do : ; done
Use Nvidia GPUs
# setup nvidia gpu nodes
until oc apply -k deploy/nvidia-gpu-autoscale; do : ; done
# setup ollama
until oc apply -k deploy; do : ; done
until oc apply -k deploy/ollama-gpu; do : ; done