Video StreamingDataset setup #6

kshitijkg · 2023-07-19T00:55:26Z

No description provided.

t46 · 2023-08-24T09:16:34Z

Option 1:
We store entire youtube video and captions using mds writer, we can store bytes. Then in the data loader, in the collate fn, we split the youtube data into multiple clips and return both text and vision in batched format where the batched vision data has dimensions: (B, C, T, H, W, Ch)

Option 2:
We store splitted youtube videos as arrays and dont do the splitting online

t46 · 2023-08-24T09:21:51Z

Uploaded a sample script for Option 2.
https://github.com/t46/video-dataset

t46 · 2023-08-28T00:48:51Z

Uploaded the script for option 1 as well, and stored scripts both for option 1 and 2 under the videorl repo.

https://github.com/TheDuckAI/videorl/tree/feature/streaming-dataset/code/data/streamingdataset/preprocess

t46 · 2023-08-28T14:39:27Z

Updated for 1fps
https://github.com/TheDuckAI/videorl/tree/feature/streaming-dataset/code/data/streamingdataset/preprocess

t46 · 2023-09-02T04:56:11Z

Updated

save all frames including those with no subtitles
split frames and subtitles equally
update speed comparison b/w option 1 and 2
https://github.com/TheDuckAI/videorl/tree/feature/streaming-dataset/code/data/streamingdataset/preprocess

kshitijkg changed the title ~~VideoStreaming dataset setup~~ Video StreamingDataset setup Jul 19, 2023

kshitijkg assigned kshitijkg and t46 Sep 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Video StreamingDataset setup #6

Video StreamingDataset setup #6

kshitijkg commented Jul 19, 2023

t46 commented Aug 24, 2023 •

edited

Loading

t46 commented Aug 24, 2023

t46 commented Aug 28, 2023 •

edited

Loading

t46 commented Aug 28, 2023

t46 commented Sep 2, 2023

Video StreamingDataset setup #6

Video StreamingDataset setup #6

Comments

kshitijkg commented Jul 19, 2023

t46 commented Aug 24, 2023 • edited Loading

t46 commented Aug 24, 2023

t46 commented Aug 28, 2023 • edited Loading

t46 commented Aug 28, 2023

t46 commented Sep 2, 2023

t46 commented Aug 24, 2023 •

edited

Loading

t46 commented Aug 28, 2023 •

edited

Loading