How to Obtain the 'good_objs' List or Filtering Code for Objaverse Dataset #165

joeybchen · 2024-10-16T10:24:15Z

Hi,

First of all, thank you for your incredible work on this project! I am working on training the LRM model, but I am having some trouble with the dataset filtering process mentioned in the code.

In src.data.objaverse, you define the ObjaverseData class, which uses a JSON file (valid_paths.json) containing a list of 'good_objs':

class ObjaverseData(Dataset):
    def __init__(self,
        root_dir='objaverse/',
        meta_fname='valid_paths.json',
        input_image_dir='rendering_random_32views',
        target_image_dir='rendering_random_32views',
        input_view_num=6,
        target_view_num=4,
        total_view_n=32,
        fov=50,
        camera_rotation=True,
        validation=False,
    ):
        self.root_dir = Path(root_dir)
        self.input_image_dir = input_image_dir
        self.target_image_dir = target_image_dir

        self.input_view_num = input_view_num
        self.target_view_num = target_view_num
        self.total_view_n = total_view_n
        self.fov = fov
        self.camera_rotation = camera_rotation

        with open(os.path.join(root_dir, meta_fname)) as f:
            filtered_dict = json.load(f)
        paths = filtered_dict['good_objs']
        self.paths = paths
        
        self.depth_scale = 6.0
            
        total_objects = len(self.paths)
        print('============= length of dataset %d =============' % len(self.paths))

Would it be possible for you to either:

Share the valid_paths.json file (or just the 'good_objs' list)?
Or, provide the code you used to generate this list based on the filtering criteria mentioned in the paper?

I am aware of the filtering goals described in the paper (sec 3.2), which include:
"""
The filtering goal is to remove objects that satisfy any of the following criteria: (i) objects without texture maps, (ii) objects with rendered images occupying less than 10% of the view from any angle, (iii) including multiple separate objects, (iv) objects with no caption information provided by the Cap3D dataset, and (v) low-quality objects.
"""
However, I am unsure how to translate these criteria into the correct filtering logic for generating the 'good_objs' list. Any guidance or clarification would be greatly appreciated!

Thank you so much for your help!

The text was updated successfully, but these errors were encountered:

mohamed2020m · 2024-11-16T15:20:03Z

Hi there! did you find anything?
How about the rendering_random_32views too?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to Obtain the 'good_objs' List or Filtering Code for Objaverse Dataset #165

How to Obtain the 'good_objs' List or Filtering Code for Objaverse Dataset #165

joeybchen commented Oct 16, 2024

mohamed2020m commented Nov 16, 2024 •

edited

Loading

How to Obtain the 'good_objs' List or Filtering Code for Objaverse Dataset #165

How to Obtain the 'good_objs' List or Filtering Code for Objaverse Dataset #165

Comments

joeybchen commented Oct 16, 2024

mohamed2020m commented Nov 16, 2024 • edited Loading

mohamed2020m commented Nov 16, 2024 •

edited

Loading