Skip to content

Commit

Permalink
Fix typo (#1117)
Browse files Browse the repository at this point in the history
Signed-off-by: Cathy Zhang <[email protected]>
  • Loading branch information
bjzhjing authored Nov 12, 2024
1 parent e3187be commit f7a7f8a
Show file tree
Hide file tree
Showing 2 changed files with 3 additions and 3 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -100,12 +100,12 @@ python deploy.py --uninstall
#### Case 2: Baseline Deployment without Rerank
```bash
python deploy.py --hftoken $HFTOKEN --modeldir $MODELDIR --num-nodes 2
python deploy.py --hf-token $HFTOKEN --model-dir $MODELDIR --num-nodes 2
```
#### Case 3: Tuned Deployment with Rerank
```bash
python deploy.py --hftoken $HFTOKEN --modeldir $MODELDIR --num-nodes 2 --with-rerank --tuned
python deploy.py --hf-token $HFTOKEN --model-dir $MODELDIR --num-nodes 2 --with-rerank --tuned
```
## Benchmark
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -99,7 +99,7 @@ def generate_helm_values(with_rerank, num_nodes, hf_token, model_dir, node_selec
},
{"name": "teirerank", "resources": {"limits": {"habana.ai/gaudi": 1}}} if with_rerank else None,
{"name": "tgi", "resources": {"limits": {"habana.ai/gaudi": 1}}},
{"name": "retriever", "resources": {"requests": {"cpu": "8", "memory": "8000Mi"}}},
{"name": "retriever-usvc", "resources": {"requests": {"cpu": "8", "memory": "8000Mi"}}},
]

# Filter out any None values directly as part of initialization
Expand Down

0 comments on commit f7a7f8a

Please sign in to comment.