You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have two questions related to image size when training Co-DETR with ViT-L:
Could you explain why the backbone's img_size argument is set to 1536(link)? The maximum img_scale size is (1536, 2400), and I’m not entirely clear on the connection between them. Since the image sizes can range from 480 to 1536 due to resizing, I would appreciate more details on how this works.
A few days ago, you updated the img_scale values in the train_pipeline(link). Should the img_scale values in the test_pipeline also be updated?
Thanks!
The text was updated successfully, but these errors were encountered:
For ViT with LSJ augmentation, the backbone's img_size argument should be equal to the actual image size. For ViT with DETR augmentation, this argument can be ignored.
In my experiments, a test image size of 2048x1280 achieves the best single-scale performance.
Oh, so even though my actual image size is (height=1024, width=1920), using img_size=1536 doesn’t affect training a ViT with DETR, is that correct? I noticed that the img_size argument is used in the ViT backbone like this. How can the img_size be ignored?
I understand that training and testing with images of size 2048x1280 produced the best results for your model. Thank you for sharing your insights! What was your original image size? Was it larger than 2048x1280 and resized, or was it originally 2048x1280?
I have two questions related to image size when training Co-DETR with ViT-L:
Could you explain why the backbone's
img_size
argument is set to 1536(link)? The maximumimg_scale
size is (1536, 2400), and I’m not entirely clear on the connection between them. Since the image sizes can range from 480 to 1536 due to resizing, I would appreciate more details on how this works.A few days ago, you updated the
img_scale
values in the train_pipeline(link). Should theimg_scale
values in the test_pipeline also be updated?Thanks!
The text was updated successfully, but these errors were encountered: