Remove managed memory from read_workerflow #82

atmyers · 2024-09-13T03:52:26Z

No description provided.

stevenhofmeyr · 2024-09-13T16:27:09Z

These modifications result in a significant increases in the infection rates seen in GPU runs, but there is no change for CPU runs. In versions before this, the CPU and GPU results are almost identical for the same seed, but now they differ significantly.

stevenhofmeyr · 2024-09-13T16:38:26Z

src/CensusData.cpp

@@ -736,16 +766,17 @@ void CensusData::assignTeachersAndWorkgroup (AgentContainer& pc /*!< Agent
 auto Ncommunity = demo.Ncommunity;

 auto np = soa.numParticles();
- for (int ip = 0; ip < np; ++ip) {
-
+ amrex::ParallelForRNG( np,


This change is what causes the difference between the GPU and CPU results. Maybe there are updates in the loop that require atomics for the GPU

Yes, you're right - I think that instead of running this the Gpu, for now we need to explicitly copy the data off the device, run this loop in serial, then copy the data back. Since this only happens at initialization I think this will be okay.

See update.

Note - with this change, AND with constructing the bins on the GPU, the code runs without crashing on development with amrex.the_arena_is_managed=0.

If I run unmanaged with random initial cases, I still get a segfault here, on line 961 in CensusData.cpp:
auto inds = bins.permutationPtr();

This appears to go away if we use the GPU binning policy rather than Serial.

Binning with BinPolicy::Serial still relies on managed memory, but if you change that to BinPolicy::GPU, it works for me - is that not what you're seeing?

Yes, that's what i'm seeing. With GPU policy, it works. So we should keep the default managed memory for reproducibility. But I can now check the other branch to see if there is anywhere managed memory is being implicitly used.

atmyers · 2024-09-27T19:56:15Z

Superseded by #84

Remove managed memory from read_workerflow

e8b5b2d

atmyers requested review from stevenhofmeyr, tannguyen153 and debog September 13, 2024 03:52

stevenhofmeyr reviewed Sep 13, 2024

View reviewed changes

run assignTeachersAndWorkgroup on the host for now

011d4d4

atmyers mentioned this pull request Sep 27, 2024

Fast interactions #84

Merged

atmyers closed this Sep 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove managed memory from read_workerflow #82

Remove managed memory from read_workerflow #82

atmyers commented Sep 13, 2024

stevenhofmeyr commented Sep 13, 2024

stevenhofmeyr Sep 13, 2024

atmyers Sep 13, 2024

atmyers Sep 13, 2024

atmyers Sep 13, 2024 •

edited

Loading

stevenhofmeyr Sep 13, 2024

stevenhofmeyr Sep 13, 2024 •

edited

Loading

atmyers Sep 13, 2024

stevenhofmeyr Sep 13, 2024

atmyers commented Sep 27, 2024

Remove managed memory from read_workerflow #82

Remove managed memory from read_workerflow #82

Conversation

atmyers commented Sep 13, 2024

stevenhofmeyr commented Sep 13, 2024

stevenhofmeyr Sep 13, 2024

Choose a reason for hiding this comment

atmyers Sep 13, 2024

Choose a reason for hiding this comment

atmyers Sep 13, 2024

Choose a reason for hiding this comment

atmyers Sep 13, 2024 • edited Loading

Choose a reason for hiding this comment

stevenhofmeyr Sep 13, 2024

Choose a reason for hiding this comment

stevenhofmeyr Sep 13, 2024 • edited Loading

Choose a reason for hiding this comment

atmyers Sep 13, 2024

Choose a reason for hiding this comment

stevenhofmeyr Sep 13, 2024

Choose a reason for hiding this comment

atmyers commented Sep 27, 2024

atmyers Sep 13, 2024 •

edited

Loading

stevenhofmeyr Sep 13, 2024 •

edited

Loading