-
Notifications
You must be signed in to change notification settings - Fork 141
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge branch 'main' of https://github.com/microsoft/onnxruntime-genai …
…into user/pavignol/add-dml-fp16-cast
- Loading branch information
Showing
8 changed files
with
76 additions
and
128 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,8 +1,4 @@ | ||
# Description: Example of generate end-to-end usage, including model building and running. | ||
pip install numpy | ||
pip install transformers | ||
pip install torch | ||
pip install onnx | ||
pip install onnxruntime-gpu | ||
# Description: Example of generate end-to-end usage, including model building and running | ||
pip install numpy transformers torch onnx onnxruntime | ||
python3 -m onnxruntime_genai.models.builder -m microsoft/phi-2 -o genai_models/phi2-int4-cpu -p int4 -e cpu -c hf_cache | ||
python3 model-generate.py -m genai_models/phi2-int4-cpu -pr "my favorite movie is" "write a function that always returns True" "I am very happy" -ep cpu -p 0.0 -k 1 -v | ||
python3 model-generate.py -m genai_models/phi2-int4-cpu -pr "my favorite movie is" "write a function that always returns True" "I am very happy" -p 0.0 -k 1 -v |
2 changes: 1 addition & 1 deletion
2
examples/python/chat-e2e-example.sh → examples/python/qa-e2e-example.sh
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,3 +1,3 @@ | ||
# Description: Example of chatbot end-to-end usage, including model building and running. | ||
python3 -m onnxruntime_genai.models.builder -m microsoft/phi-2 -o genai_models/phi2-int4-cpu -p int4 -e cpu -c hf_cache | ||
python3 model-chat.py -m genai_models/phi2-int4-cpu -ep cpu -p 0.0 -k 1 | ||
python3 model-qa.py -m genai_models/phi2-int4-cpu -ep cpu -p 0.0 -k 1 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,8 @@ | ||
-f https://download.pytorch.org/whl/torch_stable.html | ||
torch==2.2.1+cu118 | ||
numpy | ||
pytest | ||
onnx | ||
onnxruntime_gpu | ||
transformers | ||
huggingface_hub[cli] |
21 changes: 0 additions & 21 deletions
21
tools/ci_build/github/linux/docker/inference/x64/default/gpu/Dockerfile
This file was deleted.
Oops, something went wrong.
12 changes: 0 additions & 12 deletions
12
tools/ci_build/github/linux/docker/inference/x64/default/gpu/scripts/install_centos.sh
This file was deleted.
Oops, something went wrong.
53 changes: 0 additions & 53 deletions
53
tools/ci_build/github/linux/docker/inference/x64/default/gpu/scripts/install_deps.sh
This file was deleted.
Oops, something went wrong.