Bf16 gpu support #3630

richagadgil · 2024-11-19T00:25:45Z

No description provided.

codecov · 2024-11-19T02:42:51Z

Codecov Report

Attention: Patch coverage is 12.50000% with 7 lines in your changes missing coverage. Please review.

Project coverage is 92.14%. Comparing base (b39a938) to head (59eec66).
Report is 3 commits behind head on develop.

Files with missing lines	Patch %	Lines
src/quantization.cpp	0.00%	7 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #3630      +/-   ##
===========================================
- Coverage    92.18%   92.14%   -0.04%     
===========================================
  Files          513      513              
  Lines        21576    21584       +8     
===========================================
- Hits         19889    19888       -1     
- Misses        1687     1696       +9

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚨 Try these New Features:

Flaky Tests Detection - Detect and resolve failed and flaky tests

pfultz2 · 2024-11-19T14:33:27Z

src/targets/gpu/kernels/include/migraphx/kernels/types.hpp

 using half  = _Float16;
 using half2 = migraphx::vec<half, 2>;
+using bf16 =  ushort;


There is duplicate typedefs here and they are both wrong. It should be using bf16 = __hip_bfloat16.

migraphx-bot · 2024-11-20T05:23:07Z

Test	Batch	Rate new 59eec6	Rate old 0f36aa	Diff	Compare
torchvision-resnet50	64	3,247.51	3,261.99	-0.44%	✅
torchvision-resnet50_fp16	64	2,766.25	6,984.41	-60.39%	🔴
torchvision-densenet121	32	2,264.43	2,434.46	-6.98%	🔴
torchvision-densenet121_fp16	32	3,889.55	4,068.77	-4.40%	🔴
torchvision-inceptionv3	32	1,634.16	1,630.14	0.25%	✅
torchvision-inceptionv3_fp16	32	2,752.52	2,746.22	0.23%	✅
cadene-inceptionv4	16	761.60	765.59	-0.52%	✅
cadene-resnext64x4	16	806.14	809.78	-0.45%	✅
slim-mobilenet	64	7,356.64	7,474.57	-1.58%	✅
slim-nasnetalarge	64	208.59	208.58	0.00%	✅
slim-resnet50v2	64	3,436.22	3,441.49	-0.15%	✅
bert-mrpc-onnx	8	1,150.74	1,150.80	-0.01%	✅
bert-mrpc-tf	1	482.60	465.54	3.66%	🔆
pytorch-examples-wlang-gru	1	424.90	420.06	1.15%	✅
pytorch-examples-wlang-lstm	1	174.63	381.98	-54.28%	🔴
torchvision-resnet50_1	1	771.08	750.44	2.75%	✅
cadene-dpn92_1	1	405.44	398.35	1.78%	✅
cadene-resnext101_1	1	381.56	382.96	-0.37%	✅
onnx-taau-downsample	1	159.45	346.08	-53.93%	🔴
dlrm-criteoterabyte	1	33.34	33.35	-0.01%	✅
dlrm-criteoterabyte_fp16	1	52.74	52.68	0.10%	✅
agentmodel	1	8,250.55	8,091.53	1.97%	✅
unet_fp16	2	58.96	58.77	0.33%	✅
resnet50v1_fp16	1	1,004.59	943.16	6.51%	🔆
resnet50v1_int8	1	998.84	1,012.12	-1.31%	✅
bert_base_cased_fp16	64	1,170.38	1,169.97	0.04%	✅
bert_large_uncased_fp16	32	363.05	363.75	-0.19%	✅
bert_large_fp16	1	198.54	199.03	-0.25%	✅
distilgpt2_fp16	16	2,200.89	2,201.98	-0.05%	✅
yolov5s	1	531.48	539.79	-1.54%	✅
tinyllama	1	43.41	43.42	-0.01%	✅
vicuna-fastchat	1	168.79	175.75	-3.96%	🔴
whisper-tiny-encoder	1	417.55	418.02	-0.11%	✅
whisper-tiny-decoder	1	428.26	428.37	-0.03%	✅

This build is not recommended to merge 🔴

migraphx-bot · 2024-11-20T05:23:09Z

✅ bert-mrpc-onnx: PASSED: MIGraphX meets tolerance

✅ bert-mrpc-tf: PASSED: MIGraphX meets tolerance

✅ pytorch-examples-wlang-gru: PASSED: MIGraphX meets tolerance

✅ pytorch-examples-wlang-lstm: PASSED: MIGraphX meets tolerance

✅ torchvision-resnet50_1: PASSED: MIGraphX meets tolerance

✅ cadene-dpn92_1: PASSED: MIGraphX meets tolerance

✅ cadene-resnext101_1: PASSED: MIGraphX meets tolerance

✅ dlrm-criteoterabyte: PASSED: MIGraphX meets tolerance

✅ agentmodel: PASSED: MIGraphX meets tolerance

✅ unet: PASSED: MIGraphX meets tolerance

✅ resnet50v1: PASSED: MIGraphX meets tolerance

✅ bert_base_cased_fp16: PASSED: MIGraphX meets tolerance

🔴bert_large_uncased_fp16: FAILED: MIGraphX is not within tolerance - check verbose output

✅ bert_large: PASSED: MIGraphX meets tolerance

✅ yolov5s: PASSED: MIGraphX meets tolerance

✅ tinyllama: PASSED: MIGraphX meets tolerance

✅ vicuna-fastchat: PASSED: MIGraphX meets tolerance

✅ whisper-tiny-encoder: PASSED: MIGraphX meets tolerance

✅ whisper-tiny-decoder: PASSED: MIGraphX meets tolerance

✅ distilgpt2_fp16: PASSED: MIGraphX meets tolerance

richagadgil added 30 commits October 10, 2024 17:53

first pass at integrating generic float

c51c1ce

fix namespaces

134b408

fix mantissa

d4fa6eb

refactor

0b60841

refactor

7a646f1

add fp

ebe819b

fixed generic float class

379a77a

add fp32 test

174384c

remove import

787b651

update tests

1d1fa1c

fp16 tests that work

1791092

update tests

a2eb005

updated fp16 and fp32 tests

ff8ffc7

half tests

e36fd65

underflow and overflow tests

9ac4e2a

generate map

f05fd31

add more tests

cb4d92d

fix names

0cc1946

update tests

85a761b

remove and

65cf9ae

disable warning

fbabf54

fix tidy warning

549f5e6

migraphx py fix

d302e5d

add increments

8d475e3

fix warnings

a0fd055

disable duplicate branch warning

41379fe

add countzero_std

0c29c7b

ci error

4b012a8

simplify countl

dbaa3a8

fix ci

b2bd2a0

richagadgil and others added 22 commits November 4, 2024 10:20

Update bf16.cpp

b9d204e

Update generic_float.hpp

fb6df2d

Merge branch 'develop' into bf16

bb78138

add extra common type

8e1f99e

tidy

6192970

Update bf16.hpp

c0d6bc4

Update generic_float.hpp

7bfc407

Merge branch 'develop' into bf16

4cb96ad

remove imports

ffd4ba2

Merge branch 'develop' into bf16

8a10da3

ref tests

1565a0e

migraphx_py fix

e6d1155

fix test cae by index

867e960

add rocblas type

9852da5

fix tgts err

bf50653

address changes

0ebd220

Merge branch 'develop' into bf16

043e322

bf16 gpu support

a3ca184

add vector types

490d326

rocblas

a63ac1e

bf16 gpu testing

94990bb

mlir bf16

8aaae90

pfultz2 reviewed Nov 19, 2024

View reviewed changes

richagadgil added 3 commits November 19, 2024 11:25

fix type

208232e

fix type

d4866d5

add type

59eec66

richagadgil self-assigned this Nov 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bf16 gpu support #3630

Bf16 gpu support #3630

richagadgil commented Nov 19, 2024

codecov bot commented Nov 19, 2024 •

edited

Loading

pfultz2 Nov 19, 2024

migraphx-bot commented Nov 20, 2024

migraphx-bot commented Nov 20, 2024

Bf16 gpu support #3630

Are you sure you want to change the base?

Bf16 gpu support #3630

Conversation

richagadgil commented Nov 19, 2024

codecov bot commented Nov 19, 2024 • edited Loading

Codecov Report

pfultz2 Nov 19, 2024

Choose a reason for hiding this comment

migraphx-bot commented Nov 20, 2024

migraphx-bot commented Nov 20, 2024

codecov bot commented Nov 19, 2024 •

edited

Loading