batch size #150

LXXXXR · 2021-04-29T12:26:30Z

Hi,
Thank you for the work. May I ask, for ResNeSt-50, did you use batch size of 8192 (from paper) or 2048 (from pytorch-encoding)? How much will the performance change?
And I was also wondering the drop out was mentioned in the paper while set to 0 in the training script in pytorch-encoding. Does it mean the trick won’t have much of impact on the performance?
Thanks again for the time.

zhanghang1989 · 2021-05-02T04:33:22Z

The models in the original paper were trained using MXNet implementation.

Typically, the larger the batch size is, the wore the performance will be. See "train ImageNet in 1 hr" paper for details.

The drop out only helps larger model.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

batch size #150

batch size #150

LXXXXR commented Apr 29, 2021 •

edited

Loading

zhanghang1989 commented May 2, 2021

batch size #150

batch size #150

Comments

LXXXXR commented Apr 29, 2021 • edited Loading

zhanghang1989 commented May 2, 2021

LXXXXR commented Apr 29, 2021 •

edited

Loading