does the multi-gpu trainer share the batches among the subprocesses #53

b1nch3f · 2022-06-27T15:45:43Z

b1nch3f
Jun 27, 2022

something is odd with the trainer. When I train on 2 GPUs on a dataset D, it takes X hours, if I train on 4 GPU it still takes X hours, I was expecting it to take X / 2 hours. Does the trainer share the batches between subprocesses or each subprocess takes it's own batch?

erogol · 2022-07-05T09:22:01Z

erogol
Jul 5, 2022
Maintainer

The same answer here #54

Probably all GPUs see the all dataset in an epoch.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

does the multi-gpu trainer share the batches among the subprocesses #53

{{title}}

Replies: 1 comment

{{title}}

Select a reply

does the multi-gpu trainer share the batches among the subprocesses #53

b1nch3f Jun 27, 2022

Replies: 1 comment

erogol Jul 5, 2022 Maintainer

b1nch3f
Jun 27, 2022

erogol
Jul 5, 2022
Maintainer