Half precision support #1257

yhmtsai · 2023-01-12T05:30:02Z

It adds the half (fp16) into ginkgo.
Some related collection:

GCC 12+ supports _Float16
Clang 15+ supports _Float16
Cuda 10+ supports 16 bit atomicAdd on Compute Capability >= 7.0 (atomicCAS has same condition although it only mentioned the version in the atomicAdd)
Hip does not support 16 bit atomic
Sycl (I am not sure)
c++-23 supports float16 and bfloat16

~~Cuda 9.2 __half does not contain +=. (I think it can be added by more operator overload outside)~~
ROCm 4.0 can not convert __half to double. (I do not think it can be added outside?)
I will disable half support for these two version.

There are two additional commit: one is fixing oneAPI6 (#1251) and the other is multigrid experiments.
They will be cleaned afterwards.

The following PR is related to half:

Discussion:
Do we still need to use float2half from vendor in gko::half?

Some fixes are extracted from this pr to #1253
TODO:

Closes #73

MarcelKoch · 2023-01-12T08:05:23Z

I would suggest putting all the todos into separate PRs. Having it compile and run in the first PR is IMO already enough. You could create a github project to track the rest of the todo.

sonarcloud · 2023-03-25T17:45:31Z

SonarCloud Quality Gate failed.

2 Bugs
0 Vulnerabilities
0 Security Hotspots
137 Code Smells

17.3% Coverage
1.6% Duplication

Note: the issue is that numerical_limits<half>::infinite returns float instead of half. Maybe changing that would be a better solution

- use Csr in residual norm for half apply support - use higher tolerance for mc64 due to half range - some example can not finish in half precision for mc64 - skip some test in half due to half range - fix the half limit value Co-authored-by: Marcel Koch <[email protected]>

Co-authored-by: Marcel Koch <[email protected]> Co-authored-by: Thomas Grützmacher <[email protected]>

Co-authored-by: Thomas Grützmacher <[email protected]>

yhmtsai added the 1:ST:WIP This PR is a work in progress. Not ready for review. label Jan 12, 2023

thoasm mentioned this pull request Jan 12, 2023

Fix the wrong type and pass real-number value with device_type to devices #1253

Merged

upsj linked an issue Jan 26, 2023 that may be closed by this pull request

Full support for custom datatypes #53

Open

9 tasks

yhmtsai force-pushed the half branch from 8357af3 to a141380 Compare February 6, 2023 22:22

upsj mentioned this pull request Feb 8, 2023

Triangular solvers #1193

Open

yhmtsai force-pushed the half branch from e8bd1dc to b9bc95c Compare February 9, 2023 22:57

upsj requested review from a team and removed request for a team February 14, 2023 13:37

yhmtsai force-pushed the half branch from b9bc95c to bbb0fcb Compare March 23, 2023 15:25

yhmtsai force-pushed the half branch from f5113e2 to 7c321c2 Compare March 27, 2023 21:10

yhmtsai force-pushed the half branch from 7c321c2 to 79eebb0 Compare April 4, 2023 14:59

yhmtsai force-pushed the half branch 2 times, most recently from ade5e58 to 8a6d5fb Compare June 18, 2023 22:35

yhmtsai added 1:ST:ready-for-review This PR is ready for review and removed 1:ST:WIP This PR is a work in progress. Not ready for review. labels Jun 19, 2023

yhmtsai and others added 28 commits October 23, 2024 13:41

add half spmv benchmark (with cusparse for cuda)

710e037

fixes batched support for half

34845f3

generate PTX load/stores for half

48afbb5

fix mc64 for half

a51f136

Note: the issue is that numerical_limits<half>::infinite returns float instead of half. Maybe changing that would be a better solution

fix hip memory.hip.hpp for half

60123dc

WIP: can compile but three tests are still failed

8f1e28f

fix config, ambiguous namespace, and batch

6dbd616

update format

cd270e1

fix windows and icpx

57fc170

hip does not support atomic on 16 bits

18e825f

fix batch

825f76f

add miss instantiation

81d63ac

update documentation, remove half.hpp

2a6d382

Co-authored-by: Marcel Koch <[email protected]> Co-authored-by: Thomas Grützmacher <[email protected]>

put function in gko not std

8731fc3

Co-authored-by: Thomas Grützmacher <[email protected]>

fix after rebase

64406f3

hip does not support 16bit shuffle

baa95f7

merge two #if block

4bb8093

do not use attributes in sqrt and abs

c539398

make half constexpr

d0e2446

isolate half out of device completely

0d777df

bits constexpr construct half and make numeric_limit in half

56e2af8

refine the code and fix error without half

6cc26d7

reduce abs/sqrt location

3d15350

move the math function to math

c4697a5

nohalf

377432a

cbgmres without half

e806a0a

direct without half

3e49252

yhmtsai force-pushed the half branch from 2a40a7d to 3e49252 Compare October 23, 2024 13:13

yhmtsai mentioned this pull request Oct 25, 2024

solver (mostly krylov solver) and the residual norm with half #1711

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Half precision support #1257

Half precision support #1257

yhmtsai commented Jan 12, 2023 •

edited

Loading

MarcelKoch commented Jan 12, 2023

sonarcloud bot commented Mar 25, 2023

Half precision support #1257

Are you sure you want to change the base?

Half precision support #1257

Conversation

yhmtsai commented Jan 12, 2023 • edited Loading

MarcelKoch commented Jan 12, 2023

sonarcloud bot commented Mar 25, 2023

yhmtsai commented Jan 12, 2023 •

edited

Loading