Deploy VisualQnA in a Kubernetes Cluster

This document outlines the deployment process for a Visual Question Answering (VisualQnA) application that utilizes the GenAIComps microservice components on Intel Xeon servers and Gaudi machines.

Please install GMC in your Kubernetes cluster, if you have not already done so, by following the steps in Section "Getting Started" at GMC Install. We will soon publish images to Docker Hub, at which point no builds will be required, further simplifying install.

If you have only Intel Xeon machines you could use the visualqna_xeon.yaml file or if you have a Gaudi cluster you could use visualqna_gaudi.yaml In the below example we illustrate on Xeon.

Deploy the VisualQnA application

Create the desired namespace if it does not already exist and deploy the application

export APP_NAMESPACE=CT
kubectl create ns $APP_NAMESPACE
sed -i "s|namespace: visualqna|namespace: $APP_NAMESPACE|g"  ./visualqna_xeon.yaml
kubectl apply -f ./visualqna_xeon.yaml

Check if the application is up and ready
```
kubectl get pods -n $APP_NAMESPACE
```

Deploy a client pod for testing

kubectl create deployment client-test -n $APP_NAMESPACE --image=python:3.8.13 -- sleep infinity

Check that client pod is ready
```
kubectl get pods -n $APP_NAMESPACE
```

Send request to application

export CLIENT_POD=$(kubectl get pod -n $APP_NAMESPACE -l app=client-test -o jsonpath={.items..metadata.name})
export accessUrl=$(kubectl get gmc -n $APP_NAMESPACE -o jsonpath="{.items[?(@.metadata.name=='visualqna')].status.accessUrl}")
kubectl exec "$CLIENT_POD" -n $APP_NAMESPACE -- curl $accessUrl -X POST -d '{"messages": [
   {
     "role": "user",
     "content": [
       {
         "type": "text",
         "text": "What'\''s in this image?"
       },
       {
         "type": "image_url",
         "image_url": {
           "url": "https://www.ilankelman.org/stopsigns/australia.jpg"
         }
       }
     ]
   }
 ],
 "max_tokens": 128}' -H 'Content-Type: application/json' > $LOG_PATH/gmc_visualqna.log

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Deploy VisualQnA in a Kubernetes Cluster

Deploy the VisualQnA application

Files

README.md

Latest commit

History

README.md

File metadata and controls

Deploy VisualQnA in a Kubernetes Cluster

Deploy the VisualQnA application