Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

🐛[BUG]: The training for test 'bloodflow_1d_mgn' needs performance improvements #110

Open
lucapegolotti opened this issue Sep 22, 2023 · 0 comments
Labels
? - Needs Triage Need team to review and classify bug Something isn't working external Issues/PR filed by people outside the team

Comments

@lucapegolotti
Copy link
Contributor

Version

Modulus 0.3.0

On which installation method(s) does this occur?

No response

Describe the issue

We tested the training for this example on A40, A100, and RTX3090. While the performance on A40 is reasonable (150s per epoch), on the other GPUs it is too slow (600s per epoch on RTX3090 with a smaller version of the GNN). Further testing is required to understand whether the custom DataLoader for this example should be improved, or if there is some other performance bottleneck somewhere else.

Minimum reproducible example

No response

Relevant log output

No response

Environment details

+ A40, A100, RTX3090 GPUs
@lucapegolotti lucapegolotti added ? - Needs Triage Need team to review and classify bug Something isn't working labels Sep 22, 2023
@NickGeneva NickGeneva added the external Issues/PR filed by people outside the team label Sep 25, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
? - Needs Triage Need team to review and classify bug Something isn't working external Issues/PR filed by people outside the team
Projects
None yet
Development

No branches or pull requests

2 participants