clevr_with_masks

This repository contains the code for generating dataset in official CLEVR dataset format from tfrecords file which is provided by deepmind's multi_object_datasets repository.

Output sample

You can see more output samples in sample folder. We generate and save object masks in png form.

Image
Masks

Added metadata

We added bounding box for each objects and you can see the bbox info in json files. You can access bbox data as below.

json_data = json.load(your_json_file)
bbox = json_data['scenes'][image_id]['objects'][object_id]['bbox']

We followed the x,y index order of pixel_coords in original data. Hence, if you want to extract the image in bbox, you can slice image as below.

# image in numpy form, [H, W, C], and bbox from above
object_in_bbox = image[bbox[2]: bbox[3], bbox[0]: bbox[1]]

Preparation

Environment
- tensorflow (we used 2.4.1 but other versions will be okay)
- numpy (we used 1.19.5 but other versions will be okay)
download tfrecords [here]

Run

python load_clevr_with_masks.py --tf_records_path {path_for_tf_records} --target_path {path_for_results}

Citation

@inproceedings{johnson2017clevr,
  title={CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning},
  author={Johnson, Justin and Hariharan, Bharath and van der Maaten, Laurens
          and Fei-Fei, Li and Zitnick, C Lawrence and Girshick, Ross},
  booktitle={CVPR},
  year={2017}
}
@misc{multiobjectdatasets19,
  title={Multi-Object Datasets},
  author={Kabra, Rishabh and Burgess, Chris and Matthey, Loic and
          Kaufman, Raphael Lopez and Greff, Klaus and Reynolds, Malcolm and
          Lerchner, Alexander},
  howpublished={https://github.com/deepmind/multi-object-datasets/},
  year={2019}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

clevr_with_masks

Output sample

Added metadata

Preparation

Run

Citation

Files

README.md

Latest commit

History

README.md

File metadata and controls

clevr_with_masks

Output sample

Added metadata

Preparation

Run

Citation