Skip to content

Commit

Permalink
update readme
Browse files Browse the repository at this point in the history
  • Loading branch information
simon-ging committed Feb 24, 2024
1 parent f6e1123 commit 9c29ba3
Showing 1 changed file with 20 additions and 8 deletions.
28 changes: 20 additions & 8 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -167,21 +167,33 @@ cd data/activitynet
wget http://ec2-52-25-205-214.us-west-2.compute.amazonaws.com/files/activity_net.v1-3.min.json
cd ../..

# install ffmpeg system-wide or via conda-forge
# install ffmpeg python helper and pytube:
pip install ffmpeg-python pytube


# download existing videos from youtube
pip install pytube
python -m ovqa.cli.download_activitynet
# afterwards request the missing videos, see author homepage
# TBD compute_missing_videos.py script
# now all videos should be setup in data/activitynet/videos
# they will be downloaded to
# data/activitynet/videos/
# some videos will be missing.

# install ffmpeg system-wide or via conda-forge
# install ffmpeg python helper
pip install ffmpeg-python
# next, run the frame extraction script. if there are corrupt videos, delete them.
python -m ovqa.cli.extract_activitynet_frames --num_workers 4

# compute the list of missing videos. this creates data/activitynet/missing_video_ids.txt
python -m ovqa.cli.check_missing_activitynet

# request the missing videos from the author's (see homepage)
# and copy them to data/activitynet/videos
# now the folder should contain 4926 videos.
# run the frame extraction again, to extract the frames for the missing videos.
python -m ovqa.cli.extract_activitynet_frames --num_workers 4

# extract the frames
python -m ovqa.cli.extract_activitynet_frames
# TBD copy required frames, and delete frames that are not needed

# # TBD frames as tar
# mkdir -p frames_uncropped
# cd frames_uncropped
# tar -xf middleframes_val.tar
Expand Down

0 comments on commit 9c29ba3

Please sign in to comment.