Add GPU support based on (exchangable) structs for solver state #81

nHackel · 2024-04-02T16:23:39Z

This PR adds GPU support by changing the solver structs to the following pattern:

mutable struct Solver{matT, ...}
  A::matT
  # Other "static" fields
  state::AbstractSolverState{<:Solver}
end

mutable struct SolverState{T, tempT} <: AbstractSolverState{Solver}
  x::tempT
  rho::T
  # ...
  iteration::Int64
end

The trick here is to not fully constrain the state in the solver. This allows us to adapt the state based on the measurements b given to the solve/init! functions.

In this PR we use this to detect if the array type of our preallocated variables are the same as the one of b. If not we adapt our variables with similar, store the new state and initialize as usual. This also doesn't require any dependency on CUDA.

First tests have shown that the actual iteration method is as type-stable as it was before. I am seeing small performance regressions, however those seem to be in the init! and solve method and seem to stem from my setup having to handle both "old" and "new" variants.

ToDos:

Future options for this setup:

We could also use different states per solver to encode variantes of an algorithm. Randomized Kaczmarz or FISTA with restart might get different types. A danger here is to add a lot of maintenance cost if the number of variants explodes.

We can use the States to implement "multiframe" measurements where b is a matrix. Then we need to allocate a State with a matrix of preallocated variables (if a solver supports this directly. Otherwise we can just loop over each measurement as a fallback. For this I would also introduce a trait for Single-/Multiframe Solvers).

We can still use package extensions to write specialized versions of the iteration functions which are then conditionally loaded. We could also wrap common array operations of our solvers into "our" own function and then just specialize those for combinations of CUDA, Sparse and Dense as we did it for Kaczmarz

…stSquares.jl into gpuStates

nHackel · 2024-04-03T13:00:14Z

This will close #54 and #64

tknopp and others added 10 commits December 3, 2023 18:03

initial start for testing solvers with JLArrays

47d4dc0

fix direct solvers

17d8f6d

Merge branch 'master' into gpuStates

be93d7f

Init try of GCNR on GPU

541ca39

Impore CGNR type stability

3d8753e

Impore CGNR type stability

849af89

Merge branch 'gpuStates' of github.com:JuliaImageRecon/RegularizedLea…

b8772b5

…stSquares.jl into gpuStates

Fix merge error

57b2b9b

Init FISTA state struct based GPU support

3edb4b6

Fix power_iterations for CuArrays

d923459

nHackel added 11 commits April 3, 2024 17:28

Fix FISTA state init

0a7f4d2

Add POGM GPU support

ab7673e

Add OptISTA GPU support

2e01379

Init ADMM GPU support, fails on linear operators atm

f32f39c

Fix ADMM iteration

9ac3c2e

Init Kaczmarz state based, breaks on GPU

aa8567c

Move iterations from state to solver

9ba9b2e

Dispatch on concrete state type in re-initialization

d8367bc

Remove solver level init fetch of state

9c353ee

Fix parameter error for OptISTA

c796fe9

Fix test calls (broken for/because of Direct atm)

535d29c

nHackel mentioned this pull request Apr 17, 2024

Add GPU support to MRIReco.jl MagneticResonanceImaging/MRIReco.jl#182

Merged

nHackel added 7 commits April 25, 2024 14:03

Use copyto! for better GPU performance

e9153b5

Add regTrafo default consider AHA storage_type

56c9970

Power_iterations cuild b based on storage_type

bfad1c4

Remove mask from enfX!, can be done with MaskedRegularization

7137eb2

Add Real- and PositiveReg GPU support

b1199aa

Add GPU support to TV reg

683a563

Unsure if correct way, but use storage_type (in ADMM) to derive eltype

615cac5

nHackel added 13 commits May 28, 2024 17:54

Switch asserts to tests for PnP

119a463

Fix bug in solver init which overwrote b

44aa2e3

Add test setup for CUDA

0a06ef5

Fix relTol type in CGNR

cb527fb

Add kwarg to hide filter warning in createLinearSolver

9e3294a

Fix type instability for ADMM and POGM iterate

5bb6ca3

Add rownorm² for GPU arrays

1788d3a

Merge branch 'master' into gpuStates

7584a84

Fix bugs in proxLLR!

694e9a2

Inital GPU support for proxLLR!

a973ccf

Improve rownorm2 perf. for GPUs

1ae8709

Add buildkite for CUDA and ROCm

75ac566

Use correct test for AMDGPU

bbb8946

nHackel marked this pull request as ready for review June 28, 2024 07:34

nHackel added 15 commits June 28, 2024 16:32

Fix LLR tests

2ce1458

Increase LinearOpColl. requirements

7c398d1

Fix l21 prox on GPU

125cd88

Fix logging errors on GPU solver tests

0861b54

Init working Kaczmarz on GPU

c526e5d

Improve logging for solver tests

b5a1272

Fix Kaczmarz on GPU

3f0051e

Disable svd tests for buildkite

5b0a9c5

Update callbacks to GPU struct changes

7134b6c

Try fixing AMD solver tests

00c0790

Merge branch 'master' into gpuStates

f429f99

Add b argument to power_iterations keyword

61cd44a

Fix merge error in Project.toml

44c7c96

Merge branch 'master' into gpuStates

fcc118c

Fix error in Project.toml

c8fabac

nHackel merged commit 1741afd into master Jul 4, 2024
2 of 8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add GPU support based on (exchangable) structs for solver state #81

Add GPU support based on (exchangable) structs for solver state #81

nHackel commented Apr 2, 2024 •

edited

Loading

nHackel commented Apr 3, 2024

Add GPU support based on (exchangable) structs for solver state #81

Add GPU support based on (exchangable) structs for solver state #81

Conversation

nHackel commented Apr 2, 2024 • edited Loading

nHackel commented Apr 3, 2024

nHackel commented Apr 2, 2024 •

edited

Loading