[WIP] added PyTorch Profiler #2315

Ishan-Kumar2 · 2021-11-09T14:42:32Z

Fixes #1917

Description:
Added a minimum implementation of PyTorch profiler as a handler with engine. A lot of other features can be added, please let me know if you have any suggestions. Also I haven't added tests yet, if the initial code looks good, I can get started on those :)

Check list:

New tests are added (if a new feature is added)
New doc strings: description and/or example code are in RST format
Documentation is updated (if required)

sdesrozis · 2021-11-09T14:51:57Z

@Ishan-Kumar2 Thank you ! It looks great, I will play with the handler asap !

vfdev-5 · 2021-12-21T11:26:42Z

@sdesrozis @Ishan-Kumar2 can we move on with this PR ?

Ishan-Kumar2 · 2021-12-23T06:58:08Z

Hi @vfdev-5, sorry for the delay. I'll have a look at it today :)

sdesrozis · 2021-12-23T09:38:51Z

@Ishan-Kumar2 Sorry for the delay on my side. I will use this handler in some own codes to see whether it works and give a feedback asap.

Ishan-Kumar2 · 2021-12-28T14:44:03Z

@sdesrozis I tested the code on my local example, with this additional code

# Define a PT Profiler
pt_profiler = PyTorchProfiler(on_trace_ready="tensorboard", output_path="./logs/train")
pt_profiler.attach(trainer)

it produces 3 json files (non empty), but I am unable to load them on tensorboard using

tensorboard --logdir=./logs

it shows "No dashboards are active for the current data set." I think as long as json are being produced it should be correct since that's done by PyTorch Profiler, not changed by me. There must be some issue with my opening it.
Not sure what is causing this, if you have a colab example could you please share the snippet you use to install ignite from this PR. I'll check running on colab too.

ignite/handlers/pytorch_profiler.py

sdesrozis · 2021-12-29T08:49:17Z

@sdesrozis I tested the code on my local example, with this additional code
# Define a PT Profiler
pt_profiler = PyTorchProfiler(on_trace_ready="tensorboard", output_path="./logs/train")
pt_profiler.attach(trainer)
it produces 3 json files (non empty), but I am unable to load them on tensorboard using
tensorboard --logdir=./logs
it shows "No dashboards are active for the current data set." I think as long as json are being produced it should be correct since that's done by PyTorch Profiler, not changed by me. There must be some issue with my opening it. Not sure what is causing this, if you have a colab example could you please share the snippet you use to install ignite from this PR. I'll check running on colab too.

It works for me but don't forget to install the dedicated tensorboard pluggin

pip install torch_tb_profiler

I left a few comments.

sdesrozis · 2021-12-29T20:31:44Z

@Ishan-Kumar2 it looks good. I think the next step now is about the tests.

Ishan-Kumar2 · 2021-12-30T07:10:22Z

@sdesrozis great, will start working on the tests.

…into pt_profiler

Ishan-Kumar2 · 2022-01-03T06:18:58Z

@sdesrozis, Added some tests for the profiler. I have not added checks for the output of the profiler since I believe that is already done by PyTorch.
I am new to writing tests from scratch so please let me know if I need to tests something else. Thanks!

sdesrozis · 2022-01-09T07:15:31Z

tests/ignite/handlers/test_pytorch_profiler.py

+    return dummy_trainer
+
+
+def test_get_results(tmp_path):


I think you should test firstly when the profiler is not attached to an engine. Secondly, you should test the presence and the absence of the expected keys.

sdesrozis · 2022-01-09T07:16:24Z

tests/ignite/handlers/test_pytorch_profiler.py

+        pt_profiler.get_results(sort_key="cpu_times")
+
+
+def test_write_results(tmp_path):


You should test the files generated on more than one epoch.

I have added this

sdesrozis · 2022-01-09T07:26:06Z

ignite/handlers/pytorch_profiler.py

+        }
+
+    def _profiler_create(self):
+        self._profiler = torch.profiler.profile(


Maybe we should check the PyTorch version and provide a clear error message if version < 1.8 ?

And this check would be associated to a specific test.

I didn't get how I should do this. In case the PyTorch version is <1.8 then I want all the tests to not run right?
So should I add a @pytest.mark.skipif in all the tests?

sdesrozis · 2022-01-09T07:28:59Z

@sdesrozis, Added some tests for the profiler. I have not added checks for the output of the profiler since I believe that is already done by PyTorch.
I am new to writing tests from scratch so please let me know if I need to tests something else. Thanks!

You can get inspiration from these tests tests/ignite/contrib/handlers/test_tensorboard_logger.py.

The tests need to be improved to check what is going on with the different backends (tpu, nccl, etc).

Ishan-Kumar2 · 2022-01-19T13:25:27Z

@sdesrozis I have incorporated most of your suggestions. I am still working on the distributed tests will add those soon too.

Ishan-Kumar2 and others added 2 commits November 9, 2021 20:08

added PyTorch Profiler

9f928c8

autopep8 fix

333057e

github-actions bot added the module: handlers Core Handlers module label Nov 9, 2021

Ishan-Kumar2 force-pushed the pt_profiler branch from 9186ff1 to 333057e Compare December 26, 2021 20:10

Ishan-Kumar2 added 2 commits December 27, 2021 01:43

updated attach

58c2d18

formatting fix

dafb4d5

sdesrozis reviewed Dec 29, 2021

View reviewed changes

ignite/handlers/pytorch_profiler.py Show resolved Hide resolved

sdesrozis reviewed Dec 29, 2021

View reviewed changes

ignite/handlers/pytorch_profiler.py Outdated Show resolved Hide resolved

Ishan-Kumar2 and others added 3 commits December 29, 2021 18:29

added import in __init__.py

f99a2ba

Merge branch 'pytorch:master' into pt_profiler

6121183

autopep8 fix

e738d69

sdesrozis and others added 4 commits January 2, 2022 16:53

Merge branch 'master' into pt_profiler

588b7fe

add tests and modified write_results to store as txt

74647fb

Merge branch 'pt_profiler' of https://github.com/Ishan-Kumar2/ignite …

79ce0c0

…into pt_profiler

autopep8 fix

bbfbf8f

sdesrozis reviewed Jan 9, 2022

View reviewed changes

Ishan-Kumar2 added 2 commits January 19, 2022 18:49

added more tests and refactored code

bf753bc

merge upstream

229c7ef

autopep8 fix

27dc96f

apupneja mentioned this pull request Feb 5, 2023

Introduce an handler to use new profiler #1917

Open

guptaaryan16 mentioned this pull request Apr 1, 2023

PyTorch Profiler [WIP] #2906

Draft

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] added PyTorch Profiler #2315

[WIP] added PyTorch Profiler #2315

Ishan-Kumar2 commented Nov 9, 2021

sdesrozis commented Nov 9, 2021

vfdev-5 commented Dec 21, 2021

Ishan-Kumar2 commented Dec 23, 2021

sdesrozis commented Dec 23, 2021

Ishan-Kumar2 commented Dec 28, 2021

sdesrozis commented Dec 29, 2021 •

edited

Loading

sdesrozis commented Dec 29, 2021

Ishan-Kumar2 commented Dec 30, 2021

Ishan-Kumar2 commented Jan 3, 2022

sdesrozis Jan 9, 2022

sdesrozis Jan 9, 2022

Ishan-Kumar2 Jan 19, 2022

sdesrozis Jan 9, 2022 •

edited

Loading

Ishan-Kumar2 Jan 19, 2022

sdesrozis commented Jan 9, 2022 •

edited

Loading

Ishan-Kumar2 commented Jan 19, 2022

		pt_profiler.get_results(sort_key="cpu_times")


		def test_write_results(tmp_path):

[WIP] added PyTorch Profiler #2315

Are you sure you want to change the base?

[WIP] added PyTorch Profiler #2315

Conversation

Ishan-Kumar2 commented Nov 9, 2021

sdesrozis commented Nov 9, 2021

vfdev-5 commented Dec 21, 2021

Ishan-Kumar2 commented Dec 23, 2021

sdesrozis commented Dec 23, 2021

Ishan-Kumar2 commented Dec 28, 2021

sdesrozis commented Dec 29, 2021 • edited Loading

sdesrozis commented Dec 29, 2021

Ishan-Kumar2 commented Dec 30, 2021

Ishan-Kumar2 commented Jan 3, 2022

sdesrozis Jan 9, 2022

Choose a reason for hiding this comment

sdesrozis Jan 9, 2022

Choose a reason for hiding this comment

Ishan-Kumar2 Jan 19, 2022

Choose a reason for hiding this comment

sdesrozis Jan 9, 2022 • edited Loading

Choose a reason for hiding this comment

Ishan-Kumar2 Jan 19, 2022

Choose a reason for hiding this comment

sdesrozis commented Jan 9, 2022 • edited Loading

Ishan-Kumar2 commented Jan 19, 2022

sdesrozis commented Dec 29, 2021 •

edited

Loading

sdesrozis Jan 9, 2022 •

edited

Loading

sdesrozis commented Jan 9, 2022 •

edited

Loading