From db4cef5471431b21e362e694190b75a57af87b56 Mon Sep 17 00:00:00 2001 From: 1996scarlet <1996scarlet@gmail.com> Date: Tue, 20 Oct 2020 22:06:31 +0800 Subject: [PATCH] update readme --- Readme.md | 70 +++++++++++++++++++++++++++++++------------------------ 1 file changed, 40 insertions(+), 30 deletions(-) diff --git a/Readme.md b/Readme.md index a6379f8..a439117 100644 --- a/Readme.md +++ b/Readme.md @@ -1,11 +1,12 @@ # OpenVtuber-虚拟爱抖露共享计划 -## Kizuna-Ai MMD demo : face capture via single RGB camera +Kizuna-Ai MMD demo : face capture via single RGB camera
## Installation + ### Requirements * Python 3.5+ @@ -22,56 +23,65 @@ While not required, for optimal performance(especially for the detector) it is h * `python3.7 ./PythonClient/vtuber_usb_camera.py --gpu -1` ## 人脸检测 (Face Detection) -* [RetinaFace: Single-stage Dense Face Localisation in the Wild](https://arxiv.org/abs/1905.00641) -* [RetinaFace (mxnet version)](https://github.com/deepinsight/insightface/tree/master/RetinaFace) - -RetinaFace is a practical single-stage [SOTA](http://shuoyang1213.me/WIDERFACE/WiderFace_Results.html) face detector which is initially described in [arXiv technical report](https://arxiv.org/abs/1905.00641) -![demoimg1](https://github.com/deepinsight/insightface/blob/master/resources/11513D05.jpg) +[RetinaFace: Single-stage Dense Face Localisation in the Wild](https://openaccess.thecvf.com/content_CVPR_2020/html/Deng_RetinaFace_Single-Shot_Multi-Level_Face_Localisation_in_the_Wild_CVPR_2020_paper.html) of **CVPR 2020**, is a practical single-stage [SOTA](http://shuoyang1213.me/WIDERFACE/WiderFace_Results.html) face detector. It is highly recommended to read the official repo [RetinaFace (mxnet version)](https://github.com/deepinsight/insightface/tree/master/RetinaFace). -![demoimg2](https://github.com/deepinsight/insightface/blob/master/resources/widerfacevaltest.png) +However, since the detection target of the face capture system is in the middle-close range, there is no need for complex pyramid scaling. We designed and published [Faster RetinaFace](https://github.com/1996scarlet/faster-mobile-retinaface) to trade off between speed and accuracy, which can reach 500~1000 fps on normal laptops. -## 头部姿态估计(Head Pose Estimation) -* [head-pose-estimation](https://github.com/lincolnhard/head-pose-estimation) +| Plan | Inference | Postprocess | Throughput Capacity (FPS) +| --------|-----|--------|--------- +| 9750HQ+1660TI | 0.9ms | 1.5ms | 500~1000 +| Jetson-Nano | 4.6ms | 11.4ms | 80~200 ## 特征点检测(Facial Landmarks Tracking) + The 2D pre-trained model is from the [deep-face-alignment](https://github.com/deepinx/deep-face-alignment) repository. + * Algorithm from [TPAMI 2019](https://arxiv.org/pdf/1808.04803.pdf) * Training set is based on i-bug 300-W datasets. It's annotation is shown below: