The VSPW dataset with extracted frames and masks is available here. Please download the 480p version of VSPW dataset.
- Python 3.7
- Pytorch 1.7
- Numpy
Download the ImageNet-pretrained models at this link. Put it in the root folder and decompress it.
Edit the .sh files in scripts/ and change the $DATAROOT to your path to VSPW_480p.
PSPNet
sh scripts/run_psp.sh
OCRNet
sh scripts/run_ocr.sh
Change dataroot and prediction root in TC_cal.py and VC_perclip.py.
python TC_cal.py
python VC_perclip.py
This implementation utilized this code and RAFT.
@inproceedings{miao2021vspw,
title={VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild},
author={Miao, Jiaxu and Wei, Yunchao and Wu, Yu and Liang, Chen and Li, Guangrui and Yang, Yi},
booktitle={Proceedings of the {IEEE} Conference on Computer Vision and Pattern Recognition},
year={2021}
}