In this work, we propose a new method of facial reenactment which enables to transfer the source face movement by simply tracking sparse facial landmarks. We adopt OpenPose to extract the facial landmarks of each frame of the source video and optimize the latent space of pretrained StyleGAN3 of the target actor to learn our mapping network. By training the network which maps from the facial landmarks to the corresponding latent vector, we are able to transfer the facial expression from the source to the target actor by simply feeding its facial landmarks.
Please check the StyleGAN3 website here for setup.
To optimize latent vectors in StyleGAN3 space of the target actor:
python embedding.py --epochs 30
python optimize.py
To train the mapping network:
python basis.py --epochs 30
To train the SVM classifier for finding latent manipulation directions:
python edit_direction.py
To evaluate the trained model:
python test.py
To visualize the semantically edited latents:
python visualize.py
To calculate landmark difference for quantitative results for face reenactment:
python compare_lmk.py
To calculate LPIPS and FID for quantitative results for emotion manipulation:
python compare_LPIPS_FID.py