For some experiments, you might wanna pass only the background
of imagenet images vs passing only the foreground. Here, I've included the code to extract the meta-data for the bounding box, cleaning up the the downloaded stuff, and then changing ImageNet Loader to support only the images that have box annotations.
from costum_imagenet import BackgroundForegroundImageNet
tr = trans.Compose([trans.Resize(224), trans.CenterCrop(224), trans.ToTensor(), ])
dataset = BackgroundForegroundImageNet(root='./data/imagenet/train', download=True, transform=tr)
x, b, f, y = dataset[0]
torchvision.utils.save_image(torch.stack([x, b, f]), 'test1.png')
If you set the value download=True
, the bounding boxes and the indices of imagenet
train split that have the bounding boxes will be downloaded. But if for some reason you want to create your own bounding boxes from the scratch, here's the steps for doing it:
Downloading: First download the data from here:
wget "https://image-net.org/data/bboxes_annotations.tar.gz"
Extract the File:
tar -xvf bboxes_annotations.tar.gz
Extract every subfolder:
cd bboxes_annotations
ls | grep .tar.gz | while read f ; do tar -xvf "${f}" ; done
Convert dataset to JS:
python read_xml.py
Clean the extra 50GB extracted files:
rm *.tar.gz
ls | grep "n.*" | while read f ; do rm -rf "${f}" ; done
Get Indices that have bounding boxes:
python get_indices.py
Then simply pass the path to the files boxes.pt
and indices.pt
to your BackgroundForegroundImageNet
constructor
dataset = BackgroundForegroundImageNet(root='.', download=False, boxes='boxes.pt', indices='indices.pt')