Maximize multi GPU utilization #173

amirhp110 · 2022-08-24T17:33:41Z

amirhp110
Aug 24, 2022

What parameter needs to be changed to maximize the multi GPU utilization.
I did increase the batch size but didn't make any difference. On my 8 x A100 GPU (40 GB) system only 5 GB of each GPU is being used and only 36% gpu utilization. Also not seeing if we are using the mixed precision but that might be my way of checking and the tool report accuracy.

lucasjinreal · 2022-08-25T02:03:51Z

lucasjinreal
Aug 25, 2022
Maintainer

4 replies

amirhp110 Aug 29, 2022
Author

Thanks a lot for the reply,
Did you actually try this and you've seen all GPUs are being utilized at least above 80 percent?
I have to pay to try this, so please let me know your experience.

lucasjinreal Aug 30, 2022
Maintainer

how much do u want to pay

amirhp110 Aug 30, 2022
Author

My main point of my question was if you have already tried a multi GPU system and if you are seeing all GPU are utilizing above 80% or not?

amirhp110 Aug 30, 2022
Author

I just tried and got the following error; I'll try to fix my local copy of the code as suggested:

RuntimeError: torch.nn.functional.binary_cross_entropy and torch.nn.BCELoss are unsafe to autocast.
Many models use a sigmoid layer right before the binary cross entropy layer.
In this case, combine the two layers using torch.nn.functional.binary_cross_entropy_with_logits
or torch.nn.BCEWithLogitsLoss. binary_cross_entropy_with_logits and BCEWithLogits are
safe to autocast.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Maximize multi GPU utilization #173

{{title}}

Replies: 1 comment 4 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Maximize multi GPU utilization #173

amirhp110 Aug 24, 2022

Replies: 1 comment · 4 replies

lucasjinreal Aug 25, 2022 Maintainer

amirhp110 Aug 29, 2022 Author

lucasjinreal Aug 30, 2022 Maintainer

amirhp110 Aug 30, 2022 Author

amirhp110 Aug 30, 2022 Author

amirhp110
Aug 24, 2022

Replies: 1 comment 4 replies

lucasjinreal
Aug 25, 2022
Maintainer

amirhp110 Aug 29, 2022
Author

lucasjinreal Aug 30, 2022
Maintainer

amirhp110 Aug 30, 2022
Author

amirhp110 Aug 30, 2022
Author