Cut peak memory footprint in per_v_transform_reduce_dst_key_aggregated_outgoing_e #4484

seunghwak · 2024-06-12T20:01:01Z

Cut peak memory usage.

For an ultimate solution, we need to implement our own function to emulate ncclReduce with a custom reduction operator.

seunghwak · 2024-06-12T20:04:34Z

@jnke2016 After confirming that the input size of sort_by_key grows as we increase # GPUs (and graph size grows proportional to # GPUs), you can try this. With this approach, we might be able to avoid OOM for 2-4x more GPUs. But we may encounter OOM again later especially for graphs with small E / V. To solve, this we need a multi-GPU reduce function that supports a custom reduce function (@naimnv also needs this to improve the performance of matching).

jnke2016 · 2024-06-17T17:40:27Z

Can you also fix the out of memory allocation bug in Louvain in this PR please?

…to enh_mem_footprint_prim

ChuckHastings · 2024-06-24T17:05:24Z

/merge

…to enh_mem_footprint_prim

cut memory footprint

defcf36

seunghwak requested a review from a team as a code owner June 12, 2024 20:01

seunghwak self-assigned this Jun 12, 2024

github-actions bot added the cuGraph label Jun 12, 2024

seunghwak requested a review from jnke2016 June 12, 2024 20:01

seunghwak requested review from ChuckHastings and naimnv June 12, 2024 20:16

seunghwak added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Jun 12, 2024

Merge branch 'branch-24.08' into enh_mem_footprint_prim

dbaa3b6

naimnv approved these changes Jun 19, 2024

View reviewed changes

ChuckHastings approved these changes Jun 24, 2024

View reviewed changes

seunghwak added 3 commits June 24, 2024 08:32

memory overallocation bug fixes

646a57c

Merge branch 'branch-24.08' of https://github.com/rapidsai/cugraph in…

06ce2ca

…to enh_mem_footprint_prim

Merge branch 'upstream_pr4484' into enh_mem_footprint_prim

3effef0

jnke2016 approved these changes Jun 24, 2024

View reviewed changes

seunghwak added 2 commits June 24, 2024 11:17

Merge branch 'branch-24.08' of https://github.com/rapidsai/cugraph in…

d8828d7

…to enh_mem_footprint_prim

Merge branch 'branch-24.08' of https://github.com/rapidsai/cugraph in…

8abe6ec

…to enh_mem_footprint_prim

rapids-bot bot merged commit 3ca5d78 into rapidsai:branch-24.08 Jun 28, 2024
131 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cut peak memory footprint in per_v_transform_reduce_dst_key_aggregated_outgoing_e #4484

Cut peak memory footprint in per_v_transform_reduce_dst_key_aggregated_outgoing_e #4484

seunghwak commented Jun 12, 2024

seunghwak commented Jun 12, 2024

jnke2016 commented Jun 17, 2024

ChuckHastings commented Jun 24, 2024

Cut peak memory footprint in per_v_transform_reduce_dst_key_aggregated_outgoing_e #4484

Cut peak memory footprint in per_v_transform_reduce_dst_key_aggregated_outgoing_e #4484

Conversation

seunghwak commented Jun 12, 2024

seunghwak commented Jun 12, 2024

jnke2016 commented Jun 17, 2024

ChuckHastings commented Jun 24, 2024