A collection of papers I am interested in.
- https://ait.ethz.ch/index.php
- https://liuyebin.com/student.html
- https://virtualhumans.mpi-inf.mpg.de/
- https://ps.is.mpg.de/publications
- https://www.mpi-inf.mpg.de/departments/visual-computing-and-artificial-intelligence/publications
- https://ait.ethz.ch/people/hilliges/
- https://vlg.inf.ethz.ch/publications.html
- https://github.com/eth-ait/aitviewer
- https://github.com/mitsuba-renderer/mitsuba3
- https://github.com/angeloskath/simple-3dviz
- https://github.com/BachiLi/redner
- mmgeneration
- inr-gan
- ADA
- awesome-image-translation
- awesome-gan-inversion
- naver-webtoon-faces
- GAN Experiments
- timm
- fun-with-computer-graphics
- bokeh
- face-parsing.PyTorch
- label-studio
- streamlit-drawable-canvas
- face-alignment
- remove images background
- https://github.com/justinpinkney/awesome-pretrained-stylegan2
- https://github.com/justinpinkney/awesome-pretrained-stylegan3
- generative-evaluation-prdc
- To be read
- Disentanglement
- Inversion
- Encoder
- Survey
- GANs
- Style transfer
- Metric
- Spectrum
- Weakly Supervised Object Localization
- NeRF
- 3D
Title | Venue | Code | Year |
---|---|---|---|
GANSpace: Discovering Interpretable GAN Controls | arXiv:2004.02546 [cs] | GANSpace | 2020 |
Interpreting the Latent Space of GANs for Semantic Face Editing | CVPR | InterFaceGAN | 2020 |
Closed-Form Factorization of Latent Semantics in GANs | arXiv:2007.06600 [cs] | sefa | 2020 |
StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation | arXiv:2011.12799 [cs] | StyleSpace | 2020 |
Unsupervised Image Transformation Learning via Generative Adversarial Networks | arXiv:2103.07751 [cs] | github | 2021 |
Resolution Dependent GAN Interpolation for Controllable Image Synthesis Between Domains | arXiv:2010.05334 [cs] | toonify | 2020 |
WarpedGANSpace: Finding Non-Linear RBF Paths in GAN Latent Space | arXiv:2109.13357 [cs] | 2021 | |
[Discovering Interpretable Latent Space Directions of GANs beyond Binary Attributes] CVPR | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
Semantic Hierarchy Emerges in Deep Generative Representations for Scene Synthesis | arXiv:1911.09267 [cs] | 2020 |
Title | Venue | Code | Year |
---|---|---|---|
Generative Visual Manipulation on the Natural Image Manifold | ECCV | 2018 | |
Semantic Photo Manipulation with a Generative Image Prior | ACM Transactions on Graphics | 2019 | |
Seeing What a GAN Cannot Generate | arXiv:1910.11626 [cs, eess] | 2019 | |
In-Domain GAN Inversion for Real Image Editing | ECCV | 2020 |
Title | Venue | Code | Year |
---|---|---|---|
Closed-Form Factorization of Latent Semantics in GANs | arXiv:2007.06600 [cs] | 2020 | |
GAN “Steerability” without Optimization | arXiv:2012.05328 [cs] | 2021 | |
Low-Rank Subspaces in GANs | arXiv:2106.04488 [cs] | 2021 | |
LARGE: Latent-Based Regression through GAN Semantics | arXiv:2107.11186 [cs] | 2021 | |
Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation | ICCV | 2021 | |
Controllable and Compositional Generation with Latent-Space Energy-Based Models | NeurIPS | LACE | 2021 |
Do Generative Models Know Disentanglement? Contrastive Learning Is All You Need | arXiv:2102.10543 [cs] | DisCo | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
✔️ Exploiting Deep Generative Prior for Versatile Image Restoration and Manipulation | ECCV | DGP | 2020 |
✔️ PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative Models | CVPR | PULSE | 2020 |
✔️ GLEAN: Generative Latent Bank for Large-Factor Image Super-Resolution | arXiv:2012.00739 [cs] | 2020 | |
Unsupervised Portrait Shadow Removal via Generative Priors | arXiv:2108.03466 [cs] | 2021 | |
Towards Real-World Blind Face Restoration with Generative Facial Prior | CVPR | GFPGAN | 2021 |
Towards Vivid and Diverse Image Colorization with Generative Color Prior | ICCV | 2021 | |
Self-Validation: Early Stopping for Single-Instance Deep Generative Priors | arXiv:2110.12271 [cs.CV] | 2021 | |
One-Shot Generative Domain Adaptation | arXiv:2111.09876 [cs] | 2021 | |
❤️ Time-Travel Rephotography | ACM Transactions on Graphics | code | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
Contrastive Model Inversion for Data-Free Knowledge Distillation | arXiv:2105.08584 [cs] | 2021 | |
Generative Models as a Data Source for Multiview Representation Learning | arXiv:2106.05258 [cs] | 2021 | |
Inverting and Understanding Object Detectors | arXiv:2106.13933 [cs] | 2021 | |
Deep Neural Networks Are Surprisingly Reversible: A Baseline for Zero-Shot Inversion | arXiv:2107.06304 [cs] | 2021 | |
Ensembling with Deep Generative Views | arXiv:2104.14551 [cs] | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
On the “Steerability” of Generative Adversarial Networks | arXiv:1907.07171 [cs] | 2020 | |
Interpreting the Latent Space of GANs for Semantic Face Editing | CVPR | 2020 | |
GANSpace: Discovering Interpretable GAN Controls | arXiv:2004.02546 [cs] | GANSpace | 2020 |
Closed-Form Factorization of Latent Semantics in GANs | arXiv:2007.06600 [cs] | sefa | 2020 |
StyleGAN of All Trades: Image Manipulation with Only Pretrained StyleGAN | arXiv:2111.01619 [cs] | 2021 | |
Using Latent Space Regression to Analyze and Leverage Compositionality in GANs | ICLR | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
GAN Inversion: A Survey | arXiv:2101.05278 [cs] | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
Rebooting ACGAN: Auxiliary Classifier GANs with Stable Training | NeurIPS | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
✅ Towards a Better Global Loss Landscape of GANs | NeurIPS | 2020 | |
On the Benefit of Width for Neural Networks: Disappearance of Bad Basins | arXiv:1812.11039 [cs, math, stat] | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
The Hessian Penalty: A Weak Prior for Unsupervised Disentanglement | ECCV | 2020 |
Title | Venue | Code | Year |
---|---|---|---|
Self-Supervised Object Detection via Generative Image Synthesis | arXiv:2110.09848 [cs] | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
Compositional Transformers for Scene Generation | NeurIPS | 2021 | |
❤️ GAN-Supervised Dense Visual Alignment | arXiv:2112.05143 [cs] | gangealing | 2021 |
Improved Transformer for High-Resolution GANs | arXiv:2106.07631 [cs] | 2021 | |
MaskGIT: Masked Generative Image Transformer | arXiv:2202.04200 [cs] | 2022 | |
StyleSwin: Transformer-Based GAN for High-Resolution Image Generation | CVPR | 2022 |
Title | Venue | Code | Year |
---|---|---|---|
ExSinGAN: Learning an Explainable Generative Model from a Single Image | arXiv:2105.07350 [cs] | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
❤️ Diverse Generation from a Single Video Made Possible | arXiv:2109.08591 [cs] | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
Unbiased Auxiliary Classifier GANs with MINE | arXiv:2006.07567 [cs] | 2020 | |
Twin Auxiliary Classifiers GAN | arXiv:1907.02690 [cs, stat] | 2019 |
Title | Venue | Code | Year |
---|---|---|---|
FreezeG | github | ||
✅ Freeze the Discriminator: A Simple Baseline for Fine-Tuning GANs | arXiv:2002.10964 [cs, stat] | FreezeD | 2020 |
Fine-Tuning StyleGAN2 For Cartoon Face Generation | arXiv:2106.12445 [cs, eess] | Cartoon-StyleGAN | 2021 |
Transferring GANs: Generating Images from Limited Data | ECCV | 2018 | |
Image Generation From Small Datasets via Batch Statistics Adaptation | ICCV | 2019 | |
MineGAN: Effective Knowledge Transfer From GANs to Target Domains With Few Images | CVPR | 2020 |
Title | Venue | Code | Year |
---|---|---|---|
GAN Compression: Efficient Architectures for Interactive Conditional GANs | CVPR | 2020 | |
Online Multi-Granularity Distillation for GAN Compression | ICCV | 2021 | |
Revisiting Discriminator in GAN Compression: A Generator-Discriminator Cooperative Compression Scheme | arXiv:2110.14439 [cs] | GCC | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
Robust Attentive Deep Neural Network for Exposing GAN-Generated Faces | arXiv:2109.02167 [cs] | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
Labels4Free: Unsupervised Segmentation Using StyleGAN | arXiv:2103.14968 [cs] | 2021 | |
BigDatasetGAN: Synthesizing ImageNet with Pixel-Wise Annotations | ArXiv:2201.04684 [Cs] | arXiv. 2022 |
Title | Venue | Code | Year |
---|---|---|---|
Alias-Free Generative Adversarial Networks | arXiv:2106.12423 [cs, stat] | 2021 | |
On Buggy Resizing Libraries and Surprising Subtleties in FID Calculation | arXiv:2104.11222 [cs] | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
TileGAN: Synthesis of Large-Scale Non-Homogeneous Textures | ACM Transactions on Graphics | 2019 | |
InsetGAN for Full-Body Image Generation | arXiv:2203.07293 [cs] | 2022 | |
Collaging Class-Specific GANs for Semantic Image Synthesis | ICCV | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
SC-FEGAN: Face Editing Generative Adversarial Network with User’s Sketch and Color | arXiv:1902.06838 [cs] | 2019 | |
Semantic Text-to-Face GAN -ST^2FG | arXiv:2107.10756 [cs] | 2021 | |
CRD-CGAN: Category-Consistent and Relativistic Constraints for Diverse Text-to-Image Generation | arXiv:2107.13516 [cs] | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
Reproducibility of "FDA: Fourier Domain Adaptation ForSemantic Segmentation | arXiv:2104.14749 [cs] | 2021 | |
A Closer Look at Fourier Spectrum Discrepancies for CNN-Generated Images Detection | CVPR | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization | arXiv:2103.14862 [cs] | 2021 | |
Finding an Unsupervised Image Segmenter in Each of Your Deep Generative Models | arXiv:2105.08127 [cs] | 2021 | |
Segmentation in Style: Unsupervised Semantic Image Segmentation with Stylegan and CLIP | arXiv:2107.12518 [cs] | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation | arXiv:1901.05103 [cs] | 2019 | |
Occupancy Networks: Learning 3D Reconstruction in Function Space | arXiv:1812.03828 [cs] | 2019 | |
❤️ Neural Image Representations for Multi-Image Fusion and Layer Separation | arXiv:2108.01199 [cs] | 2021 | |
Learning Continuous Image Representation with Local Implicit Image Function | CVPR | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
How to Train Your Energy-Based Models | ArXiv:2101.03288 | arXiv. 2021 | |
Your Classifier Is Secretly an Energy Based Model and You Should Treat It Like One | ICLR | JEM | arXiv. 2020 |
Generative Visual Prompt: Unifying Distributional Control of Pre-Trained Generative Models | NeurIPS | Generative-Visual-Prompt | 2022 |
Title | Venue | Code | Year |
---|---|---|---|
Variational Inference with Normalizing Flows | ICML | 2015 | |
Density Estimation Using Real NVP | ICLR | arXiv. 2017 |
- https://github.com/heejkoo/Awesome-Diffusion-Models
- https://github.com/huggingface/diffusers
- https://github.com/Jack000/glid-3-xl
- https://github.com/SirWaffle/AIrtist-k-diffusion-wrap
- https://github.com/altryne/awesome-ai-art-image-synthesis
- https://github.com/YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy
- https://github.com/Jack000/glid-3-xl-stable
- https://github.com/Stability-AI/stablediffusion
Title | Venue | Code | Year |
---|---|---|---|
ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models | ICCV | ilvr_adm | arXiv. 2021 |
Diffusion Models Beat GANs on Image Synthesis | arXiv:2105.05233 [cs, stat] | guided-diffusion | 2021 |
An Image Is Worth One Word: Personalizing Text-to-Image Generation Using Textual Inversion | arXiv:2208.01618 | textual_inversion | 2022 |
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation | arXiv:2208.12242 | dreambooth, Dreambooth-Stable-Diffusion |
2022 |
DiffusionCLIP: Text-Guided Diffusion Models for Robust Image Manipulation | CVPR | DiffusionCLIP | 2022 |
Title | Venue | Code | Year |
---|---|---|---|
Palette: Image-to-Image Diffusion Models | arXiv:2111.05826 | Palette-Image-to-Image-Diffusion-Models | 2022 |
Image Super-Resolution via Iterative Refinement | arXiv:2104.07636 | Image-Super-Resolution-via-Iterative-Refinement | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
RenderDiffusion: Image Diffusion for 3D Reconstruction, Inpainting and Generation | arXiv:2211.09869 | 2022 | |
Magic3D: High-Resolution Text-to-3D Content Creation | arXiv:2211.10440 | 2022 |
Title | Venue | Code | Year |
---|---|---|---|
DiffusionInst: Diffusion Model for Instance Segmentation | arXiv:2212.02773 | DiffusionInst | 2022 |
Title | Venue | Code | Year | Cite |
---|---|---|---|---|
Fourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains | arXiv:2006.10739 [cs] | 2020 | ||
✅ Implicit Neural Representations with Periodic Activation Functions | NeurIPS | 2020 | ||
✅ Modulated Periodic Activations for Generalizable Local Functional Representations | arXiv:2104.03960 [cs] | 2021 | ||
Learned Initializations for Optimizing Coordinate-Based Neural Representations | arXiv:2012.02189 [cs] | nerf-meta | 2021 | |
Seeing Implicit Neural Representations as Fourier Series | arXiv:2109.00249 [cs] | 2021 |
Title | Venue | Code | Year | Cite |
---|---|---|---|---|
Adversarial Generation of Continuous Images | arXiv:2011.12026 [cs] | inr-gan | 2020 | |
Image Generators with Conditionally-Independent Pixel Synthesis | arXiv:2011.13775 [cs] | CIPS | 2020 | |
A Structured Dictionary Perspective on Implicit Neural Representations | arXiv:2112.01917 [cs] | 2021 |
Title | Venue | Code | Year | Cite |
---|---|---|---|---|
DiffuStereo: High Quality Human Reconstruction via Diffusion-Based Stereo Using Sparse Cameras | ECCV | DiffuStereo | arXiv. 2022 | |
DiffRF: Rendering-Guided 3D Radiance Field Diffusion | arXiv:2212.01206 | DiffRF | 2022 |
Title | Venue | Code | Year | Cite |
---|---|---|---|---|
Mega-NeRF: Scalable Construction of Large-Scale NeRFs for Virtual Fly-Throughs | CVPR | 2022 | ||
Block-NeRF: Scalable Large Scene Neural View Synthesis | CVPR | BlockNeRFPytorch | arXiv. 2022 | |
IBRNet: Learning Multi-View Image-Based Rendering | arXiv:2102.13090 [cs] | IBRNet | 2021 |
- https://github.com/kakaobrain/NeRF-Factory/ ❤️
- https://github.com/openxrlab/xrnerf
- https://github.com/ActiveVisionLab/nerfmm
- https://github.com/ventusff/improved-nerfmm
- https://github.com/Kai-46/nerfplusplus
- https://github.com/kwea123/nerf_pl
- https://github.com/NVlabs/instant-ngp
- https://github.com/sxyu/nerfvis
- https://github.com/frozoul/4K-NeRF
Title | Venue | Code | Year | Cite |
---|---|---|---|---|
Unsupervised 3D Shape Completion through GAN Inversion | CVPR | 2021 | ||
3D GAN Inversion for Controllable Portrait Image Animation | ArXiv:2203.13441 [Cs] | arXiv. 2022 | ||
Pix2NeRF: Unsupervised Conditional $\pi$-GAN for Single Image to Neural Radiance Fields Translation | ArXiv:2202.13162 [Cs] | arXiv. 2022 | ||
[Monocular 3D Object Reconstruction with GAN Inversion] | ECCV | 2022 | ||
[INeRF: Inverting Neural Radiance Fields for Pose Estimation] IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) | inerf | 2021 | ||
Shape, Pose, and Appearance from a Single Image via Bootstrapped Radiance Field Inversion | arXiv:2211.11674 | nerf-from-image | 2022 |
Title | Venue | Code | Year | Cite |
---|---|---|---|---|
Neural Scene Flow Fields for Space-Time View Synthesis of Dynamic Scenes | arXiv:2011.13084 [cs] | Neural-Scene-Flow-Fields | 2021 | |
D-NeRF: Neural Radiance Fields for Dynamic Scenes | arXiv:2011.13961 [cs] | D-NeRF | 2020 | |
Dynamic View Synthesis from Dynamic Monocular Video | arXiv:2105.06468 [cs] | DynamicNeRF | 2021 | |
❤️ HyperNeRF: A Higher-Dimensional Representation for Topologically Varying Neural Radiance Fields | arXiv:2106.13228 [cs] | hypernerf | 2021 | |
Neural Radiance Flow for 4D View Synthesis and Video Processing | 2020 | |||
❤️ Animatable Neural Implicit Surfaces for Creating Avatars from Videos | arXiv:2203.08133 [cs] | 2022 |
Title | Venue | Code | Year | Cite |
---|---|---|---|---|
✔️ Predicting Loose-Fitting Garment Deformations Using Bone-Driven Motion Networks | SIGGRAPH | VirtualBones | 2022 | |
TailorNet: Predicting Clothing in 3D as a Function of Human Pose, Shape and Garment Style | CVPR | TailorNet_dataset | arXiv. 2020 | |
Learning Implicit Templates for Point-Based Clothed Human Modeling | ECCV | 2022 | ||
3D Clothed Human Reconstruction in the Wild | ECCV | ClothWild_RELEASE | 2022 | |
❤️ TightCap: 3D Human Shape Capture with Clothing Tightness Field | ACM Transactions on Graphics | TightCap | 2021 | |
ARCH: Animatable Reconstruction of Clothed Humans | CVPR | ARCH | 2020 |
Title | Venue | Code | Year | Cite |
---|---|---|---|---|
❤️ [Learning Skeletal Articulations with Neural Blend Shapes] | ACM Transactions on Graphics | neural-blend-shapes | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
Collaborative Neural Rendering Using Anime Character Sheets | ArXiv:2207.05378 [Cs] | CoNR | arXiv. 2022 |
- https://github.com/3DFaceBody/awesome-3dbody-papers
- https://github.com/openMVG/awesome_3DReconstruction_list
- https://github.com/ytrock/THuman2.0-Dataset
- https://github.com/Danial-Kord/DigiHuman
- https://github.com/zhaofuq/Instant-NSR
Title | Venue | Code | Year |
---|---|---|---|
SelfRecon: Self Reconstruction Your Digital Avatar from Monocular Video | arXiv:2201.12792 [cs] | 2022 |
Title | Venue | Code | Year |
---|---|---|---|
Neural Head Reenactment with Latent Pose Descriptors | CVPR | latent-pose-reenactment | 2020 |
Synergy between 3DMM and 3D Landmarks for Accurate 3D Facial Geometry | arXiv:2110.09772 [cs] | 2021 | |
REALY: Rethinking the Evaluation of 3D Face Reconstruction | ECCV | REALY | 2022 |
- https://github.com/TimoBolkart/BFM_to_FLAME
- https://github.com/HavenFeng/photometric_optimization
- https://github.com/soubhiksanyal/FLAME_PyTorch
- https://github.com/Azmarie/Face-Morphing
Title | Venue | Code | Year |
---|---|---|---|
Unified Implicit Neural Stylization | ECCV | arXiv. 2022 | |
ARF: Artistic Radiance Fields | ECCV | ARF-svox2 | arXiv. 2022 |
UPST-NeRF: Universal Photorealistic Style Transfer of Neural Radiance Fields for 3D Scene | arXiv:2208.07059 | UPST-NeRF | 2022 |
Title | Venue | Code | Year |
---|---|---|---|
Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer | arXiv:2203.13248 [cs] | DualStyleGAN | 2022 |
Stitch It in Time: GAN-Based Facial Editing of Real Videos | arXiv. | STIT | 2022 |
Fix the Noise: Disentangling Source Feature for Transfer Learning of StyleGAN | ArXiv:2204.14079 [Cs] | FixNoise | arXiv. 2022 |
AnimeCeleb: Large-Scale Animation CelebHeads Dataset for Head Reenactment | ECCV | AnimeCeleb | arXiv. 2022 |
DCT-Net: Domain-Calibrated Translation for Portrait Stylization | ACM Transactions on Graphics | DCT-Net | 2022 |
VToonify: Controllable High-Resolution Portrait Video Style Transfer | ACM Transactions on Graphics (TOG) | VToonify | n.d. |
BlendGAN: Implicitly GAN Blending for Arbitrary Stylized Face Generation | NeurIPS | BlendGAN | 2021 |
Unpaired Cartoon Image Synthesis via Gated Cycle Mapping | CVPR | 2022 |
Title | Venue | Code | Year |
---|---|---|---|
Thin-Plate Spline Motion Model for Image Animation | CVPR | 2022 | |
Depth-Aware Generative Adversarial Network for Talking Head Video Generation | CVPR | DaGAN | arXiv. 2022 |
Title | Venue | Code | Year |
---|---|---|---|
NeILF: Neural Incident Light Field for Physically-Based Material Estimation | ECCV | neilf | arXiv. 2022 |
[NeRF for Outdoor Scene Relighting] | ECCV | NeRF-OSR | 2022 |
- https://github.com/xianfei/SysMocap
- https://github.com/zju3dv/EasyMocap
- https://github.com/EricGuo5513/HumanML3D
Title | Venue | Code | Year |
---|---|---|---|
Learning Implicit Fields for Generative Shape Modeling | arXiv:1812.02822 [cs] | 2019 |
Title | Venue | Code | Year |
---|---|---|---|
End-to-End Recovery of Human Shape and Pose | CVPR | [hmr] | arXiv. 2018 |
VIBE: Video Inference for Human Body Pose and Shape Estimation | CVPR | VIBE | arXiv. 2020 |
TransPose: Real-Time 3D Human Translation and Pose Estimation with Six Inertial Sensors | ACM Transactions on Graphics | TransPose | 2021 |
Monocular Expressive Body Regression through Body-Driven Attention | European Conference on Computer Vision (ECCV) | expose | 2020 |
Human Mesh Recovery from Multiple Shots | CVPR | multishot | arXiv. 2022 |
❤️ Learned Vertex Descent: A New Direction for 3D Human Model Fitting | ECCV | LVD | arXiv. 2022 |
DeciWatch: A Simple Baseline for 10x Efficient 2D and 3D Pose Estimation | ECCV | DeciWatch | arXiv. 2022 |
PARE: Part Attention Regressor for 3D Human Body Estimation | ICCV | PARE | arXiv. 2021 |
Cross-Attention of Disentangled Modalities for 3D Human Mesh Recovery with Transformers | ECCV | FastMETRO | 2022 |
Title | Venue | Code | Year |
---|---|---|---|
Real-Time High-Resolution Background Matting | arXiv:2012.07810 | BackgroundMattingV2 | 2020 |
Robust High-Resolution Video Matting with Temporal Guidance | ArXiv:2108.11515 [Cs] | RobustVideoMatting | arXiv. 2021 |
- https://github.com/karfly/human36m-camera-parameters
- https://github.com/deepimagination/TalkingHead-1KH
Title | Venue | Code | Year |
---|---|---|---|
Structured Local Radiance Fields for Human Avatar Modeling | CVPR | THUman4.0-Dataset | 2022 |
Multiface: A Dataset for Neural Face Rendering | ArXiv:2207.11243 [Cs.CV] | multiface | 2022 |
ImFace: A Nonlinear 3D Morphable Face Model with Implicit Neural Representations | CVPR | ImFace | 2022 |
Title | Venue | Code | Year |
---|---|---|---|
Towards Metrical Reconstruction of Human Faces | ECCV | MICA | arXiv. 2022 |
Title | Venue | Code | Year |
---|---|---|---|
[BARC: Learning to Regress 3D Dog Shape from Images by Exploiting Breed Information] | CVPR | barc_release | 2022 |
Title | Venue | Code | Year |
---|---|---|---|
Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation | ArXiv:2203.15224 [Cs] | PanopticNeRF | arXiv. 2022 |
Title | Venue | Code | Year |
---|---|---|---|
✅ DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation | arXiv:1901.05103 [cs] | DeepSDF | 2019 |
Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial Modeling | NeurIPS | 2016 | |
Occupancy Networks: Learning 3D Reconstruction in Function Space | arXiv:1812.03828 [cs] | 2019 | |
PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization | arXiv:1905.05172 [cs] | 2019 | |
Deep Meta Functionals for Shape Representation | arXiv:1908.06277 [cs] | 2019 |
Title | Venue | Code | Year |
---|---|---|---|
Escaping Plato’s Cave: 3D Shape From Adversarial Rendering | ICCV | 2019 | |
StyleRig: Rigging StyleGAN for 3D Control over Portrait Images | arXiv:2004.00121 [cs] | 2020 | |
Exemplar-Based 3D Portrait Stylization | arXiv:2104.14559 [cs] | github | 2021 |
❤️ Landmark Detection and 3D Face Reconstruction for Caricature Using a Nonlinear Parametric Model | arXiv:2004.09190 [cs] | CaricatureFace | 2021 |
SofGAN: A Portrait Image Generator with Dynamic Styling | arXiv:2007.03780 [cs] | sofgan | 2021 |
❤️ FreeStyleGAN: Free-View Editable Portrait Rendering with the Camera Manifold | arXiv:2109.09378 [cs] | 2021 | |
PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering | ICCV | PIRender | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
Point-Based Modeling of Human Clothing | ICCV | 2021 | |
ADOP: Approximate Differentiable One-Pixel Point Rendering | arXiv:2110.06635 [cs] | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
Learning to Stylize Novel Views | arXiv:2105.13509 [cs] | stylescene | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
Common Objects in 3D: Large-Scale Learning and Evaluation of Real-Life 3D Category Reconstruction | ICCV | 2021 | |
A 3D Face Model for Pose and Illumination Invariant Face Recognition | IEEE International Conference on Advanced Video and Signal Based Surveillance | BFM | 2009 |
SfSNet: Learning Shape, Reflectance and Illuminance of Faces in the Wild | arXiv:1712.01261 [cs] | 2018 |
Title | Venue | Code | Year |
---|---|---|---|
Visual Object Networks: Image Generation with Disentangled 3D Representation | arXiv:1812.02725 [cs, stat] | 2018 | |
Escaping Plato’s Cave: 3D Shape From Adversarial Rendering | ICCV | 2019 | |
HoloGAN: Unsupervised Learning of 3D Representations from Natural Images | ICCV | 2019 |
- https://github.com/wuhuikai/FaceSwap
- https://github.com/hysts/anime-face-detector
- https://github.com/qq775193759/3D-CariGAN
- https://github.com/yeemachine/kalidokit
- https://github.com/sicxu/Deep3DFaceRecon_pytorch
- https://github.com/happy-jihye/face-vid2vid-demo
Title | Venue | Code | Year |
---|---|---|---|
FaceEraser: Removing Facial Parts for Augmented Reality | arXiv:2109.10760 [cs] | 2021 | |
DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing | arXiv:2109.10737 [cs] | 2021 | |
❤️ StyleGAN-NADA: CLIP-Guided Domain Adaptation of Image Generators | arXiv:2108.00946 [cs] | 2021 | |
Beholder-GAN: Generation and Beautification of Facial Images with Conditioning on Their Beauty Level | arXiv:1902.02593 [cs] | 2019 | |
Mind the Gap: Domain Gap Control for Single Shot Domain Adaptation for Generative Adversarial Networks | arXiv:2110.08398 [cs] | 2021 | |
Fine-Grained Control of Artistic Styles in Image Generation | arXiv:2110.10278 [cs] | 2021 |
- https://github.com/Sxela/ArcaneGAN
- https://github.com/mchong6/GANsNRoses
- https://github.com/FilipAndersson245/cartoon-gan
- https://github.com/venture-anime/cartoongan-pytorch
Title | Venue | Code | Year |
---|---|---|---|
AniGAN: Style-Guided Generative Adversarial Networks for Unsupervised Anime Face Generation | arXiv:2102.12593 [cs] | 2021 | |
[AnimeGAN: A Novel Lightweight GAN for Photo Animation] | AnimeGANv2 | 2020 | |
❤️ Learning to Cartoonize Using White-Box Cartoon Representations | CVPR | White-box-Cartoonization | 2020 |
Generative Adversarial Networks for Photo to Hayao Miyazaki Style Cartoons | arXiv:2005.07702 [cs, eess] | 2020 |
Title | Venue | Code | Year |
---|---|---|---|
A Morphable Model for the Synthesis of 3D Faces | Proceedings of the 26th Annual Conference on Computer Graphics and Interactive Techniques | [3DMM] | SIGGRAPH ’99, USA: ACM Press/Addison-Wesley Publishing Co. 1999 |
Title | Venue | Code | Year |
---|---|---|---|
SketchHairSalon: Deep Sketch-Based Hair Image Synthesis | arXiv:2109.07874 [cs] | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
Face Alignment Across Large Poses: A 3D Solution | IEEE Transactions on Pattern Analysis and Machine Intelligence | 2019 |
Title | Venue | Code | Year |
---|---|---|---|
High-Fidelity Pose and Expression Normalization for Face Recognition in the Wild | 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) | 2015 |
Title | Venue | Code | Year |
---|---|---|---|
Semi-Supervised Domain Adaptation via Adaptive and Progressive Feature Alignment | arXiv:2106.02845 [cs] | 2021 | |
Prototypical Pseudo Label Denoising and Target Structure Learning for Domain Adaptive Semantic Segmentation | arXiv:2101.10979 [cs] | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
Network Augmentation for Tiny Deep Learning | arXiv:2110.08890 [cs] | 2021 | |
Non-Deep Networks | arXiv:2110.07641 [cs] | 2021 | |
When to Prune? A Policy towards Early Structural Pruning | arXiv:2110.12007 [cs] | 2021 | |
❤️ ConformalLayers: A Non-Linear Sequential Neural Network with Associative Layers | arXiv:2110.12108 [cs] | 2021 | |
CHIP: CHannel Independence-Based Pruning for Compact Neural Networks | arXiv:2110.13981 [cs] | 2021 | |
Do We Actually Need Dense Over-Parameterization? In-Time Over-Parameterization in Sparse Training | arXiv:2102.02887 [cs] | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
Making Convolutional Networks Shift-Invariant Again | arXiv:1904.11486 [cs] | 2019 | |
Group Equivariant Convolutional Networks | ICML | arXiv. 2016 | |
Harmonic Networks: Deep Translation and Rotation Equivariance | CVPR | arXiv. 2017 | |
Learning Steerable Filters for Rotation Equivariant CNNs | CVPR | arXiv. 2018 |
Title | Venue | Code | Year |
---|---|---|---|
AdaPruner: Adaptive Channel Pruning and Effective Weights Inheritance | arXiv:2109.06397 [cs] | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
Anchor DETR: Query Design for Transformer-Based Detector | arXiv:2109.07107 [cs] | 2021 | |
❤️ Detecting Twenty-Thousand Classes Using Image-Level Supervision | arXiv:2201.02605 [cs] | 2022 |
Title | Venue | Code | Year |
---|---|---|---|
Robust High-Resolution Video Matting with Temporal Guidance | arXiv:2108.11515 [cs.CV] | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
ResMLP: Feedforward Networks for Image Classification with Data-Efficient Training | arXiv:2105.03404 [cs] | 2021 | |
ConvMLP: Hierarchical Convolutional MLPs for Vision | arXiv:2109.04454 [cs] | 2021 | |
A Battle of Network Structures: An Empirical Study of CNN, Transformer, and MLP | arXiv:2108.13002 [cs.CV] | 2021 | |
Sparse-MLP: A Fully-MLP Architecture with Conditional Computation | arXiv:2109.02008 [cs] | 2021 | |
MLP-Mixer: An All-MLP Architecture for Vision | 2021 | ||
CycleMLP: A MLP-like Architecture for Dense Prediction | ICLR | 2022 |
Title | Venue | Code | Year |
---|---|---|---|
❤️ How Transferable Are Features in Deep Neural Networks? | arXiv:1411.1792 [cs] | 2014 |
Title | Venue | Code | Year |
---|---|---|---|
Positional Encoding as Spatial Inductive Bias in GANs | arXiv:2012.05217 [cs] | 2020 | |
Mind the Pad -- CNNs Can Develop Blind Spots | arXiv:2010.02178 [cs, stat] | 2020 | |
❤️ How Much Position Information Do Convolutional Neural Networks Encode? | ICLR | 2020 | |
On Translation Invariance in CNNs: Convolutional Layers Can Exploit Absolute Spatial Location | CVPR | 2020 | |
Rethinking and Improving Relative Position Encoding for Vision Transformer | ICCV | 2021 | |
A Structured Dictionary Perspective on Implicit Neural Representations | arXiv:2112.01917 [cs] | 2021 |
Title | Venue | Code | Year |
---|---|---|---|
Neural Architecture Search with Reinforcement Learning | ICLR | 2017 | |
Learning Transferable Architectures for Scalable Image Recognition | CVPR | 2018 | |
Progressive Neural Architecture Search | ECCV | 2018 | |
Efficient Neural Architecture Search via Parameter Sharing | ICML | 2018 | |
MnasNet: Platform-Aware Neural Architecture Search for Mobile | CVPR | 2019 | |
DARTS: Differentiable Architecture Search | ICLR | 2019 |
Title | Venue | Code | Year |
---|---|---|---|
AlphaGAN: Fully Differentiable Architecture Search for Generative Adversarial Networks | IEEE Transactions on Pattern Analysis and Machine Intelligence | 2021 | |
GAN Compression: Efficient Architectures for Interactive Conditional GANs | CVPR | 2020 | |
Off-Policy Reinforcement Learning for Efficient and Effective GAN Architecture Search | ECCV | 2020 | |
AutoGAN-Distiller: Searching to Compress Generative Adversarial Networks | ICML | 2020 | |
A Multi-Objective Architecture Search for Generative Adversarial Networks | 2020 | ||
AutoGAN: Neural Architecture Search for Generative Adversarial Networks | ICCV | 2019 |
Title | Venue | Code | Year |
---|---|---|---|
FILM: Frame Interpolation for Large Motion | arXiv:2202.04901 [cs] | 2022 |
Title | Venue | Code | Year |
---|---|---|---|
Image Denoising by Sparse 3-D Transform-Domain Collaborative Filtering | IEEE Transactions on Image Processing | 2007 | |
Towards Flexible Blind JPEG Artifacts Removal | arXiv:2109.14573 [cs, eess] | FBCNN | 2021 |