Video Mask Transfiner for High-Quality Video Instance Segmentation [ECCV 2022]
[Project Page | Dataset Page | Paper]
Video Mask Transfiner for High-Quality Video Instance Segmentation,
Lei Ke, Henghui Ding, Martin Danelljan, Yu-Wing Tai, Chi-Keung Tang, Fisher Yu
ECCV 2022 (arXiv 2207.14012)
Mask annotation comparison between Youtube-VIS and HQ-YTVIS. HQ-YTVIS serves as a new benchmark to facilitate future development (training & evaluation) of VIS methods aiming at higher mask quality.
hq_ytvis_1.mp4
Mask annotations in Youtube-VIS (Left Video) vs. Mask annotations in HQ-YTVIS (Right Video). Please visit our Dataset Page for detailed descriptions of using HQ-YTVIS benchmark.
Dataset Download: HQ-YTVIS Annotation Link
Dataset Usage: replace our annotation json to original YTVIS annotation files.
Please refer to our Installation Guidance and Tube-Mask AP & Tube-Boundary AP Usage Example.
python eval_hqvis.py --save-path prediction_results.json
Please refer to INSTALL.md for installation instructions and dataset preparation.
Please refer to USAGE.md for dataset preparation and detailed running (including testing, visualization, etc.) instructions.
ytvis_result1.mp4
Train on HQ-YTVIS train set and COCO, evaluate on HQ-YTVIS test set.
APB: Tube-Boundary AP (proposed in Eq.1 of the paper)
APM: Tube-Mask AP (proposed in YTVIS paper)
Model | APB | APB75 | ARB1 | APM | ARM75 | download |
---|---|---|---|---|---|---|
VMT_r50 | 30.7 | 24.2 | 31.5 | 50.5 | 54.5 | weight |
VMT_r101 | 33.0 | 29.3 | 33.3 | 51.6 | 55.8 | weight |
VMT_swin_L | 44.8 | 43.4 | 43.0 | 64.8 | 70.1 | weight |
@inproceedings{vmt,
title = {Video Mask Transfiner for High-Quality Video Instance Segmentation},
author = {Ke, Lei and Ding, Henghui and Danelljan, Martin and Tai, Yu-Wing and Tang, Chi-Keung and Yu, Fisher},
booktitle = {European Conference on Computer Vision (ECCV)},
year = {2022}
}
@inproceedings{transfiner,
title={Mask Transfiner for High-Quality Instance Segmentation},
author={Ke, Lei and Danelljan, Martin and Li, Xia and Tai, Yu-Wing and Tang, Chi-Keung and Yu, Fisher},
booktitle = {CVPR},
year = {2022}
}
We thank Mask Transfiner and SeqFormer for their open source codes.