You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Completing #1 would help this. We need to get a process in the EdgeAnalysis.selectobjects method in rms that can be parallelized using kernelabstractions.jl or some other julia gpu package.
The text was updated successfully, but these errors were encountered:
looking through the julia code, we have found 3 locations that would be good to try to parallelize with the gpu:
calcbranchingnumbers
calcradconsttermratios
processfluxes
these are taking a significant chunk of the overhead, and do have some fundamental array/matrix operations that we could put onto a gpu. To start, we will work with processfluxes, since this will run in a "vanilla" rmg execution (the other two are used only if you select certain options for your core enlargement).
Our concern is that there are a lot of memory write operations within these functions, so this might make any gpu speedup we attempt useless. we will try it first and post out results here.
Completing #1 would help this. We need to get a process in the EdgeAnalysis.selectobjects method in rms that can be parallelized using kernelabstractions.jl or some other julia gpu package.
The text was updated successfully, but these errors were encountered: