Skip to content

Latest commit

 

History

History
87 lines (85 loc) · 6.15 KB

README.md

File metadata and controls

87 lines (85 loc) · 6.15 KB

HeadCam Objects

Repository for the development of methods for extracting object category information from infant egocentric videos.

Folder Structure:

  • analysis: contains scripts for various analysis pipelines.
    • basic_level_manual_labels
    • full_goldset
    • general_helper_scripts
    • goldset_annotations
    • mturk_pilot
    • panoptic_segmentation_training
    • vedi_pilot
  • data: contains various data files from points in the processing pipeline(annotated image information, segmented images in COCO-JSON format, .manifest files with annotations).
    • annotations: various annotated images from SAYCam set.
      • basic_level_manual_labels: BL, GK, and NB went through and labeled the prominent object in each image using this Colab Notebook.
      • broad_category_segmentations: Used 10 category dictionary and had people go through and label each image with the categories that were present.
      • mturk_detections: pilot bounding box detections with intermediate and final dataframes created using this Colab Notebook
      • faces_hands: annotation dataset from previous project; bounding boxes around faces and hands in dataset.
      • panoptic_segmentations: panoptic segmentations, jsons created using this Colab Notebook.
        • coco_json_format_files: output from reformatting raw segmentations into COCO JSON format.
          • pilot_segmentation.json: first pilot, 9 images with segmentations.
          • pilot_b_segmentations.json: second pilot, 90 images with segmentations.
          • pilot_b_good_segmentations.json: subset of second pilot with confidence thresholded, 60 images with segmentations.
          • pilot_big_segmentations.json: final pilot, 801 image subset of 984 images with segmentations.
          • combined_segmentations.json: final image set (combines final pilot with another set of final images), 3365 images with segmentations.
          • combined_good_segmentations.json: subset of final image set with confidence thresholded, 2215 images with segmentations.
          • rest of folders store the above data, but split into 80/20 training and testing sets.
        • raw_manifest_files: raw Sagemaker output.
    • category_lists: lists of categories we used to label images.
      • categories.txt: basic level category list used as dictionary in annotation tasks.
      • object_list.txt: initial full category list used for basic level pilot MTurk and manual annotations.
    • image_lists: various lists of video/image filenames and urls.
      • SAYCAM_allocentric_videos.csv: 1631 video filenames and whether or not they are allocentric. for filtering out associated images.
      • child_hands.csv: list of 3050 public urls to images with child hands from dataset.
      • goldset_to_annotate.csv: list of 16996 public urls to images, BL made this.
      • hands_sample_annotate.csv: random sample list of 500 public urls to images with hands, subset from hands_to_annotate.csv.
      • hands_to_annotate.csv: list of 11828 public urls to images with hands, subset from goldset_to_annotate.csv.
      • interesting_image_list.txt: list of 1542 image filenames that NB made by sifting through random subset from goldset_to_annotate.csv. FMI, see notes on choosing interesting images.
      • interesting_ims.csv: list of 1000 public urls to images subset from interesting_image_list.txt.
      • people_goldset.csv: list of 9616 public urls to images with people in frame, subset from goldset_to_annotate.csv.
      • person_sample_annotate.csv: random sample list of 500 public urls to images with people in frame, subset from people_goldset.csv.
      • pilotImageURLs.csv: list of 150 public urls to images chosen randomly from interesting_image_list.txt using this helper script.
      • top_category_frames.csv: list of 984 public urls to images.
      • top_frames.csv: list of 953 public urls to images.
    • preprocessed_data: output from processing data using R.
    • saycam_images: includes a zip file of "interesting images" from image_lists/interesting_image_list.txt.
    • vedi_pilot: TODO
  • experiments: task paradigms.
    • mturk_pilot: contains html code for MTurk pilot task collecting bounding box annotations.
  • writing: workspace for papers associated with this project.