Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Generic "truncate_float" class for bf16 and fp16 quantization #3591

Open
wants to merge 59 commits into
base: develop
Choose a base branch
from

Conversation

richagadgil
Copy link
Contributor

No description provided.

@richagadgil richagadgil requested review from a team and causten as code owners November 5, 2024 22:31
@richagadgil richagadgil self-assigned this Nov 5, 2024
@migraphx-bot
Copy link
Collaborator

Test Batch Rate new
ffec08
Rate old
4b96e1
Diff Compare
torchvision-resnet50 64 3,260.10 3,260.40 -0.01%
torchvision-resnet50_fp16 64 6,978.76 6,981.88 -0.04%
torchvision-densenet121 32 2,435.50 2,436.50 -0.04%
torchvision-densenet121_fp16 32 4,070.11 4,081.96 -0.29%
torchvision-inceptionv3 32 1,638.92 1,638.04 0.05%
torchvision-inceptionv3_fp16 32 2,764.34 2,760.86 0.13%
cadene-inceptionv4 16 775.84 776.56 -0.09%
cadene-resnext64x4 16 811.94 811.67 0.03%
slim-mobilenet 64 7,535.93 7,540.50 -0.06%
slim-nasnetalarge 64 211.50 211.49 0.00%
slim-resnet50v2 64 3,504.60 3,506.73 -0.06%
bert-mrpc-onnx 8 1,150.82 1,147.08 0.33%
bert-mrpc-tf 1 463.44 465.87 -0.52%
pytorch-examples-wlang-gru 1 416.07 423.73 -1.81%
pytorch-examples-wlang-lstm 1 381.39 389.07 -1.97%
torchvision-resnet50_1 1 770.18 788.22 -2.29%
cadene-dpn92_1 1 397.62 402.19 -1.14%
cadene-resnext101_1 1 382.61 382.83 -0.06%
onnx-taau-downsample 1 343.21 343.07 0.04%
dlrm-criteoterabyte 1 33.32 33.34 -0.05%
dlrm-criteoterabyte_fp16 1 52.73 52.75 -0.04%
agentmodel 1 8,446.16 8,325.15 1.45%
unet_fp16 2 58.86 58.80 0.10%
resnet50v1_fp16 1 944.40 953.06 -0.91%
resnet50v1_int8 1 1,007.47 1,005.99 0.15%
bert_base_cased_fp16 64 1,171.84 1,170.44 0.12%
bert_large_uncased_fp16 32 363.32 363.37 -0.01%
bert_large_fp16 1 198.91 198.99 -0.04%
distilgpt2_fp16 16 2,205.06 2,201.23 0.17%
yolov5s 1 540.06 536.00 0.76%
tinyllama 1 43.43 43.45 -0.04%
vicuna-fastchat 1 173.01 174.10 -0.63%
whisper-tiny-encoder 1 419.04 418.74 0.07%
whisper-tiny-decoder 1 428.75 425.97 0.65%

This build is OK for merge ✅

@migraphx-bot
Copy link
Collaborator


❌bert-mrpc-onnx: ERROR - check error outputTraceback (most recent call last):
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 340, in
main()
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 249, in main
migraphx_arg = migraphx.argument(test_input)
RuntimeError: NumPy type info missing for N8migraphx14version_2_11_013generic_floatILj7ELj8ELj0EEE


❌bert-mrpc-tf: ERROR - check error outputTraceback (most recent call last):
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 340, in
main()
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 249, in main
migraphx_arg = migraphx.argument(test_input)
RuntimeError: NumPy type info missing for N8migraphx14version_2_11_013generic_floatILj7ELj8ELj0EEE


❌pytorch-examples-wlang-gru: ERROR - check error outputTraceback (most recent call last):
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 340, in
main()
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 249, in main
migraphx_arg = migraphx.argument(test_input)
RuntimeError: NumPy type info missing for N8migraphx14version_2_11_013generic_floatILj7ELj8ELj0EEE


❌pytorch-examples-wlang-lstm: ERROR - check error outputTraceback (most recent call last):
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 340, in
main()
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 249, in main
migraphx_arg = migraphx.argument(test_input)
RuntimeError: NumPy type info missing for N8migraphx14version_2_11_013generic_floatILj7ELj8ELj0EEE


❌torchvision-resnet50_1: ERROR - check error outputTraceback (most recent call last):
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 340, in
main()
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 249, in main
migraphx_arg = migraphx.argument(test_input)
RuntimeError: NumPy type info missing for N8migraphx14version_2_11_013generic_floatILj7ELj8ELj0EEE


❌cadene-dpn92_1: ERROR - check error outputTraceback (most recent call last):
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 340, in
main()
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 249, in main
migraphx_arg = migraphx.argument(test_input)
RuntimeError: NumPy type info missing for N8migraphx14version_2_11_013generic_floatILj7ELj8ELj0EEE


❌cadene-resnext101_1: ERROR - check error outputTraceback (most recent call last):
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 340, in
main()
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 249, in main
migraphx_arg = migraphx.argument(test_input)
RuntimeError: NumPy type info missing for N8migraphx14version_2_11_013generic_floatILj7ELj8ELj0EEE


❌dlrm-criteoterabyte: ERROR - check error outputTraceback (most recent call last):
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 340, in
main()
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 249, in main
migraphx_arg = migraphx.argument(test_input)
RuntimeError: NumPy type info missing for N8migraphx14version_2_11_013generic_floatILj7ELj8ELj0EEE


❌agentmodel: ERROR - check error outputTraceback (most recent call last):
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 340, in
main()
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 249, in main
migraphx_arg = migraphx.argument(test_input)
RuntimeError: NumPy type info missing for N8migraphx14version_2_11_013generic_floatILj7ELj8ELj0EEE


❌unet: ERROR - check error outputTraceback (most recent call last):
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 340, in
main()
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 249, in main
migraphx_arg = migraphx.argument(test_input)
RuntimeError: NumPy type info missing for N8migraphx14version_2_11_013generic_floatILj7ELj8ELj0EEE


❌resnet50v1: ERROR - check error outputTraceback (most recent call last):
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 340, in
main()
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 249, in main
migraphx_arg = migraphx.argument(test_input)
RuntimeError: NumPy type info missing for N8migraphx14version_2_11_013generic_floatILj7ELj8ELj0EEE


❌bert_base_cased_fp16: ERROR - check error outputTraceback (most recent call last):
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 340, in
main()
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 249, in main
migraphx_arg = migraphx.argument(test_input)
RuntimeError: NumPy type info missing for N8migraphx14version_2_11_013generic_floatILj7ELj8ELj0EEE


❌bert_large_uncased_fp16: ERROR - check error outputTraceback (most recent call last):
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 340, in
main()
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 249, in main
migraphx_arg = migraphx.argument(test_input)
RuntimeError: NumPy type info missing for N8migraphx14version_2_11_013generic_floatILj7ELj8ELj0EEE


❌bert_large: ERROR - check error outputTraceback (most recent call last):
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 340, in
main()
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 249, in main
migraphx_arg = migraphx.argument(test_input)
RuntimeError: NumPy type info missing for N8migraphx14version_2_11_013generic_floatILj7ELj8ELj0EEE


❌yolov5s: ERROR - check error outputTraceback (most recent call last):
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 340, in
main()
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 249, in main
migraphx_arg = migraphx.argument(test_input)
RuntimeError: NumPy type info missing for N8migraphx14version_2_11_013generic_floatILj7ELj8ELj0EEE


❌tinyllama: ERROR - check error outputTraceback (most recent call last):
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 340, in
main()
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 249, in main
migraphx_arg = migraphx.argument(test_input)
RuntimeError: NumPy type info missing for N8migraphx14version_2_11_013generic_floatILj7ELj8ELj0EEE


❌vicuna-fastchat: ERROR - check error outputTraceback (most recent call last):
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 340, in
main()
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 249, in main
migraphx_arg = migraphx.argument(test_input)
RuntimeError: NumPy type info missing for N8migraphx14version_2_11_013generic_floatILj7ELj8ELj0EEE


❌whisper-tiny-encoder: ERROR - check error outputTraceback (most recent call last):
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 340, in
main()
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 249, in main
migraphx_arg = migraphx.argument(test_input)
RuntimeError: NumPy type info missing for N8migraphx14version_2_11_013generic_floatILj7ELj8ELj0EEE


❌whisper-tiny-decoder: ERROR - check error outputTraceback (most recent call last):
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 340, in
main()
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 249, in main
migraphx_arg = migraphx.argument(test_input)
RuntimeError: NumPy type info missing for N8migraphx14version_2_11_013generic_floatILj7ELj8ELj0EEE


❌distilgpt2_fp16: ERROR - check error outputTraceback (most recent call last):
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 340, in
main()
File "/src/AMDMIGraphX/tools/accuracy/accuracy_checker.py", line 249, in main
migraphx_arg = migraphx.argument(test_input)
RuntimeError: NumPy type info missing for N8migraphx14version_2_11_013generic_floatILj7ELj8ELj0EEE

@richagadgil richagadgil changed the base branch from develop to bf16 November 18, 2024 21:39
@richagadgil richagadgil changed the base branch from bf16 to develop November 18, 2024 21:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants