Feature/reduce-decoder-mem-usage #84

cathalobrien · 2024-11-21T12:59:30Z

This change increases the memory saved from using chunking in the mapper. At the moment we use two arrays to accumulate chunks, this replaces it with a single array. At 9km this reduces peak memory usage by 6GB.

Below I have pictures of memory usage during the chunking part of the decoder at 9km

Before

Notice the zig-zag pattern. This is from the 'out1' tensor being constantly created and freed each chunk.

After

Now the zig-zag pattern is gone and peak memory usage has decreased by 6GB

declare an empty accum tensor outside the for loop. the old way of having out and out1 results in two copies of the array which results in more memory use. at 9km this added 6gb to peak mem usage

FussyDuck · 2024-11-21T12:59:35Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

for more information, see https://pre-commit.ci

codecov-commenter · 2024-11-21T13:04:56Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 99.85%. Comparing base (1a50508) to head (a81ab4e).

Additional details and impacted files

@@           Coverage Diff            @@
##           develop      #84   +/-   ##
========================================
  Coverage    99.85%   99.85%           
========================================
  Files           23       23           
  Lines         1350     1350           
========================================
  Hits          1348     1348           
  Misses           2        2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚨 Try these New Features:

Flaky Tests Detection - Detect and resolve failed and flaky tests

ssmmnn11 · 2024-11-22T09:19:40Z

great work. Is this from a training run or inference run?

cathalobrien · 2024-11-22T09:27:24Z

great work. Is this from a training run or inference run?

Inference. Havent tried in training bc this only happens when num_chunks > 1

ssmmnn11 · 2024-11-22T09:29:58Z

Absolutely, would be just interesting to check.

cathalobrien added 2 commits November 21, 2024 12:41

reduce decoder mem usage

a12b4f2

declare an empty accum tensor outside the for loop. the old way of having out and out1 results in two copies of the array which results in more memory use. at 9km this added 6gb to peak mem usage

typo

f7ce093

[pre-commit.ci] auto fixes from pre-commit.com hooks

a81ab4e

for more information, see https://pre-commit.ci

cathalobrien self-assigned this Nov 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/reduce-decoder-mem-usage #84

Feature/reduce-decoder-mem-usage #84

cathalobrien commented Nov 21, 2024

FussyDuck commented Nov 21, 2024

codecov-commenter commented Nov 21, 2024

ssmmnn11 commented Nov 22, 2024

cathalobrien commented Nov 22, 2024

ssmmnn11 commented Nov 22, 2024

Feature/reduce-decoder-mem-usage #84

Are you sure you want to change the base?

Feature/reduce-decoder-mem-usage #84

Conversation

cathalobrien commented Nov 21, 2024

Before

After

FussyDuck commented Nov 21, 2024

codecov-commenter commented Nov 21, 2024

Codecov Report

ssmmnn11 commented Nov 22, 2024

cathalobrien commented Nov 22, 2024

ssmmnn11 commented Nov 22, 2024