Skip to content

Commit f7a7f8a

Browse files
authored
Fix typo (#1117)
Signed-off-by: Cathy Zhang <[email protected]>
1 parent e3187be commit f7a7f8a

File tree

2 files changed

+3
-3
lines changed

2 files changed

+3
-3
lines changed

ChatQnA/benchmark/performance/kubernetes/intel/gaudi/README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -100,12 +100,12 @@ python deploy.py --uninstall
100100
#### Case 2: Baseline Deployment without Rerank
101101

102102
```bash
103-
python deploy.py --hftoken $HFTOKEN --modeldir $MODELDIR --num-nodes 2
103+
python deploy.py --hf-token $HFTOKEN --model-dir $MODELDIR --num-nodes 2
104104
```
105105
#### Case 3: Tuned Deployment with Rerank
106106

107107
```bash
108-
python deploy.py --hftoken $HFTOKEN --modeldir $MODELDIR --num-nodes 2 --with-rerank --tuned
108+
python deploy.py --hf-token $HFTOKEN --model-dir $MODELDIR --num-nodes 2 --with-rerank --tuned
109109
```
110110

111111
## Benchmark

ChatQnA/benchmark/performance/kubernetes/intel/gaudi/generate_helm_values.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -99,7 +99,7 @@ def generate_helm_values(with_rerank, num_nodes, hf_token, model_dir, node_selec
9999
},
100100
{"name": "teirerank", "resources": {"limits": {"habana.ai/gaudi": 1}}} if with_rerank else None,
101101
{"name": "tgi", "resources": {"limits": {"habana.ai/gaudi": 1}}},
102-
{"name": "retriever", "resources": {"requests": {"cpu": "8", "memory": "8000Mi"}}},
102+
{"name": "retriever-usvc", "resources": {"requests": {"cpu": "8", "memory": "8000Mi"}}},
103103
]
104104

105105
# Filter out any None values directly as part of initialization

0 commit comments

Comments
 (0)