Add memray plugin #2875

fiedlerNr9 · 2024-10-29T17:47:41Z

Why are the changes needed?

Enables memray profiling on Flyte task level
renders memray report into Flytedeck

What changes were proposed in this pull request?

Adding memray flytekit plugin

How was this patch tested?

unit tests
tested local & remote run

Setup process

from flytekit import workflow, task, ImageSpec
from flytekitplugins.memray import memray_profiling
import time


flytekit_hash = "82d5ac739f5f02998edb9538c58cf93c8f6e501b"
flytekitplugins_memray = f"git+https://github.com/flyteorg/flytekit.git@{flytekit_hash}#subdirectory=plugins/flytekit-memray"

image = ImageSpec(
    name="memray_demo",
    python_version="3.11.10",
    apt_packages=["git"],
    packages=[flytekitplugins_memray],
    registry="ghcr.io/fiedlernr9",
)


def generate_data(n: int):
    leak_list = []
    for _ in range(n):  # Arbitrary large number for demonstration
        large_data = " " * 10**6  # 1 MB string
        leak_list.append(large_data)  # Keeps appending without releasing
        time.sleep(0.1)  # Slow down the loop to observe memory changes


@task(container_image=image, enable_deck=True)
@memray_profiling(memray_html_reporter="table")
def memory_usage(n: int) -> str:
    generate_data(n=n)

    return "Well"


@task(container_image=image, enable_deck=True)
@memray_profiling(trace_python_allocators=True, memray_reporter_args=["--leaks"])
def memory_leakage(n: int) -> str:
    generate_data(n=n)

    return "Well"


@workflow
def wf(n: int = 500):
    memory_usage(n=n)
    memory_leakage(n=n)

Screenshots

Flamegraph

Table

Check all the applicable boxes

I updated the documentation accordingly.
All new and existing tests passed.
All commits are signed-off.

Related PRs

Docs link

codecov · 2024-10-29T18:11:10Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 38.52%. Comparing base (3fc51af) to head (ed5cdaf).
Report is 23 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #2875      +/-   ##
==========================================
- Coverage   45.53%   38.52%   -7.02%     
==========================================
  Files         196      199       +3     
  Lines       20418    20765     +347     
  Branches     2647     2665      +18     
==========================================
- Hits         9298     7999    -1299     
- Misses      10658    12552    +1894     
+ Partials      462      214     -248

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Signed-off-by: Jan Fiedler <[email protected]>

eapolinario

This is wicked cool!

Can you add flytekit-memray to https://github.com/flyteorg/flytekit/blob/master/.github/workflows/pythonbuild.yml#L319-L364 ?

plugins/flytekit-memray/flytekitplugins/memray/profiling.py

Signed-off-by: Jan Fiedler <[email protected]>

fiedlerNr9 · 2024-11-04T19:48:02Z

This is wicked cool!

Can you add flytekit-memray to https://github.com/flyteorg/flytekit/blob/master/.github/workflows/pythonbuild.yml#L319-L364 ?

✅

thomasjpfan · 2024-11-04T19:48:16Z

plugins/flytekit-memray/README.md

+image = ImageSpec(
+    name="memray_demo",
+    packages=["flytekitplugins_memray"],
+    env={"PYTHONMALLOC": "malloc"},


Instead of hard coding this into the environment, can we now trace_python_allocators=True?

I tested that and its not throwing any warnings but the results look different though:
Thats the task I tested without having the env variable set:

@task(container_image=image, enable_deck=True) @memray_profiling(trace_python_allocators=True, memray_reporter_args=["--leaks"]) def memory_leakage(n: int) -> str: generate_data(n=n) return "Well"

Thats the result:

Not sure if this is expected

For completeness, what do you see when you set trace_python_allocators=False?

This, which is expected i guess

It's weird that it gives two different flamegraphs. The new flamegraph makes more sense to me because the tracker is wrapping the user code and you can clearly see the generete_data.

I can not really see where the generate_data is on your original flamegraph.

Oh man, I mixed up my demos. Its looking exactly the same with trace_python_allocaters=False and having the env variable set. Sorry for the confusion, I will update the Readme shortly

Signed-off-by: Jan Fiedler <[email protected]>

thomasjpfan · 2024-11-05T17:56:03Z

plugins/flytekit-memray/flytekitplugins/memray/profiling.py

+        self.trace_python_allocators = trace_python_allocators
+        self.follow_fork = follow_fork
+        self.memory_interval_ms = memory_interval_ms
+        self.dir_name = "memray"


To make it obvious that this is a directory for memray files:

Suggested change

self.dir_name = "memray"

self.dir_name = "memray_bin"

thomasjpfan · 2024-11-05T17:56:49Z

plugins/flytekit-memray/flytekitplugins/memray/profiling.py

+        if not os.path.exists(self.dir_name):
+            os.makedirs(self.dir_name)
+
+        bin_filepath = f"{self.dir_name}/{self.task_function.__name__}.{time.strftime('%Y%m%d%H%M%S')}.bin"


With pathlib:

Suggested change

bin_filepath = f"{self.dir_name}/{self.task_function.__name__}.{time.strftime('%Y%m%d%H%M%S')}.bin"

bin_filepath = os.path.join(self.dir_name, f"{self.task_function.__name__}.{time.strftime('%Y%m%d%H%M%S')}.bin")

thomasjpfan · 2024-11-05T18:09:37Z

plugins/flytekit-memray/flytekitplugins/memray/profiling.py

+
+        memray_reporter_args_str = " ".join(self.memray_reporter_args)
+
+        if os.system(f"memray {reporter} -o {html_filepath} {memray_reporter_args_str} {bin_filepath}") == 0:


To be completely sure we are using the memray that is installed in the current python environment:

Suggested change

if os.system(f"memray {reporter} -o {html_filepath} {memray_reporter_args_str} {bin_filepath}") == 0:

if os.system(f"{sys.executable} -m memray {reporter} -o {html_filepath} {memray_reporter_args_str} {bin_filepath}") == 0:

It's unfortunate, that they do not document their Python API for writing reports, and only document using the CLI. So I'm okay with using the CLI from here.

thomasjpfan · 2024-11-05T18:13:01Z

plugins/flytekit-memray/tests/test_memray_profiling.py

+@task(enable_deck=True)
+@memray_profiling


Not actionable for this PR I wish there was a way to ensure that enable_deck=True when using memray_profiling. Otherwise, we just add overhead without any reports.

@eapolinario @pingsutw What do you think of making deck_fields=None and set enable_decks=True?

https://github.com/flyteorg/flytekit/blob/master/flytekit/core/task.py#L203-L210

fiedlerNr9 requested review from wild-endeavor, kumare3, eapolinario, pingsutw, cosmicBboy, samhita-alla, thomasjpfan and Future-Outlier as code owners October 29, 2024 17:47

fiedlerNr9 force-pushed the add-memray-plugin branch from 875779e to 5ed181e Compare October 29, 2024 18:00

fiedlerNr9 added 16 commits October 29, 2024 11:22

wip

a9e6213

Signed-off-by: Jan Fiedler <[email protected]>

wip

08562b6

Signed-off-by: Jan Fiedler <[email protected]>

wip

d3a5b3c

Signed-off-by: Jan Fiedler <[email protected]>

wip

2917fea

Signed-off-by: Jan Fiedler <[email protected]>

wip

5e8f035

Signed-off-by: Jan Fiedler <[email protected]>

wip

c9b903c

Signed-off-by: Jan Fiedler <[email protected]>

wip

509ee63

Signed-off-by: Jan Fiedler <[email protected]>

wip

77fcfdd

Signed-off-by: Jan Fiedler <[email protected]>

wip

7744e45

Signed-off-by: Jan Fiedler <[email protected]>

wip

7ed2420

Signed-off-by: Jan Fiedler <[email protected]>

rename memray_profiling

45d9094

Signed-off-by: Jan Fiedler <[email protected]>

finish readme

fab1c57

Signed-off-by: Jan Fiedler <[email protected]>

adjust memray_reporter_args type

4b24228

Signed-off-by: Jan Fiedler <[email protected]>

ruff check --fix

4b4b371

Signed-off-by: Jan Fiedler <[email protected]>

ruff format

c9fa064

Signed-off-by: Jan Fiedler <[email protected]>

codespell

8e67334

Signed-off-by: Jan Fiedler <[email protected]>

fiedlerNr9 force-pushed the add-memray-plugin branch from b2c7770 to 8e67334 Compare October 29, 2024 18:23

eapolinario reviewed Oct 31, 2024

View reviewed changes

thomasjpfan reviewed Nov 1, 2024

View reviewed changes

plugins/flytekit-memray/flytekitplugins/memray/profiling.py Show resolved Hide resolved

add flytekit-memray to pythonbuild workflows

00f13ca

Signed-off-by: Jan Fiedler <[email protected]>

allow memray.Tracker arguments in profiling

55350c6

Signed-off-by: Jan Fiedler <[email protected]>

thomasjpfan reviewed Nov 4, 2024

View reviewed changes

fiedlerNr9 added 5 commits November 4, 2024 11:53

extend memray_profiling args description

d08cc59

Signed-off-by: Jan Fiedler <[email protected]>

spelling

14d5af0

Signed-off-by: Jan Fiedler <[email protected]>

move tests

53037b3

Signed-off-by: Jan Fiedler <[email protected]>

move tests again 🤡

7b1c0bc

Signed-off-by: Jan Fiedler <[email protected]>

adjust README.md to not use PYMALLOC env variable

ed5cdaf

Signed-off-by: Jan Fiedler <[email protected]>

thomasjpfan reviewed Nov 5, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add memray plugin #2875

Add memray plugin #2875

fiedlerNr9 commented Oct 29, 2024 •

edited

Loading

codecov bot commented Oct 29, 2024 •

edited

Loading

eapolinario left a comment

fiedlerNr9 commented Nov 4, 2024

thomasjpfan Nov 4, 2024

fiedlerNr9 Nov 4, 2024

thomasjpfan Nov 4, 2024

fiedlerNr9 Nov 4, 2024

thomasjpfan Nov 4, 2024

fiedlerNr9 Nov 4, 2024

thomasjpfan Nov 5, 2024

thomasjpfan Nov 5, 2024

thomasjpfan Nov 5, 2024

thomasjpfan Nov 5, 2024

	bin_filepath = f"{self.dir_name}/{self.task_function.__name__}.{time.strftime('%Y%m%d%H%M%S')}.bin"
	bin_filepath = os.path.join(self.dir_name, f"{self.task_function.__name__}.{time.strftime('%Y%m%d%H%M%S')}.bin")


		memray_reporter_args_str = " ".join(self.memray_reporter_args)

		if os.system(f"memray {reporter} -o {html_filepath} {memray_reporter_args_str} {bin_filepath}") == 0:

	if os.system(f"memray {reporter} -o {html_filepath} {memray_reporter_args_str} {bin_filepath}") == 0:
	if os.system(f"{sys.executable} -m memray {reporter} -o {html_filepath} {memray_reporter_args_str} {bin_filepath}") == 0:

		@task(enable_deck=True)
		@memray_profiling

Add memray plugin #2875

Are you sure you want to change the base?

Add memray plugin #2875

Conversation

fiedlerNr9 commented Oct 29, 2024 • edited Loading

Why are the changes needed?

What changes were proposed in this pull request?

How was this patch tested?

Setup process

Screenshots

Flamegraph

Table

Check all the applicable boxes

Related PRs

Docs link

codecov bot commented Oct 29, 2024 • edited Loading

Codecov Report

eapolinario left a comment

Choose a reason for hiding this comment

fiedlerNr9 commented Nov 4, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fiedlerNr9 commented Oct 29, 2024 •

edited

Loading

codecov bot commented Oct 29, 2024 •

edited

Loading