Skip to content

Tensorflow implementation of Semantic Image Inpainting with Deep Generative Models

Notifications You must be signed in to change notification settings

ChengBinJin/semantic-image-inpainting

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 

Repository files navigation

Semantic Image Inpainting TensorFlow

This repository is a Tensorflow implementation of the Semantic Image Inpainting with Deep Generative Models, CVPR2017.

Requirements

  • tensorflow 1.9.0
  • python 3.5.3
  • numpy 1.14.2
  • pillow 5.0.0
  • matplotlib 2.0.2
  • scipy 0.19.0
  • opencv 3.2.0
  • pyamg 3.3.2

Semantic Image Inpainting

  1. celebA
  • Note: The following resutls are cherry-picked images

  1. SVHN
  • Note: The following resutls are cherry-picked images

  1. Failure Examples

Documentation

Download Dataset

  1. celebA Dataset Use the following command to download CelebA dataset and copy the CelebA dataset on the corresponding file as introduced in Directory Hierarchy information. Manually remove approximately 2,000 images from the dataset for testing, put them on the val folder and others in the `train' folder.
python download.py celebA
  1. SVHN Dataset
    Download SVHN data from The Street View House Numbers (SVHN) Dataset website. Two mat files you need to download are train_32x32.mat and test_32x32.mat in Cropped Digits Format 2.

Directory Hierarchy

.
│   semantic_image_inpainting
│   ├── src
│   │   ├── dataset.py
│   │   ├── dcgan.py
│   │   ├── download.py
│   │   ├── inpaint_main.py
│   │   ├── inpaint_model.py
│   │   ├── inpaint_solver.py
│   │   ├── main.py
│   │   ├── solver.py
│   │   ├── mask_generator.py
│   │   ├── poissonblending.py
│   │   ├── tensorflow_utils.py
│   │   └── utils.py
│   Data
│   ├── celebA
│   │   ├── train
│   │   └── val
│   ├── svhn
│   │   ├── test_32x32.mat
│   │   └── train_32x32.mat

src: source codes of the Semantic-image-inpainting

Implementation Details

We need two sperate stages to utilize semantic image inpainting model.

  • First, independently train DCGAN on your dataset as the original DCGAN process.
  • Second, use pretrained DCGAN and semantic-image-inpainting model to restore the corrupt images.

Same generator and discriminator networks of the DCGAN are used as described in Alec Radford's paper, except that batch normalization of training mode is used in training and test mode that we found to get more stalbe results. Semantic image inpainting model is implemented as moodoki's semantic_image_inpainting. Some bugs and different implementations of the original paper are fixed.

Stage 1: Training DCGAN

Use main.py to train a DCGAN network. Example usage:

python main.py --is_train=true
  • gpu_index: gpu index, default: 0
  • batch_size: batch size for one feed forward, default: 256
  • dataset: dataset name for choice [celebA|svhn], default: celebA
  • is_train: training or inference mode, default: False
  • learning_rate: initial learning rate, default: 0.0002
  • beta1: momentum term of Adam, default: 0.5
  • z_dim: dimension of z vector, default: 100
  • iters: number of interations, default: 200000
  • print_freq: print frequency for loss, default: 100
  • save_freq: save frequency for model, default: 10000
  • sample_freq: sample frequency for saving image, default: 500
  • sample_size: sample size for check generated image quality, default: 64
  • load_model: folder of save model that you wish to test, (e.g. 20180704-1736). default: None

Evaluate DCGAN

Use main.py to evaluate a DCGAN network. Example usage:

python main.py --is_train=false --load_model=folder/you/wish/to/test/e.g./20180704-1746

Please refer to the above arguments.

Stage 2: Utilize Semantic-image-inpainting Model

Use inpaint_main.py to utilize semantic-image-inpainting model. Example usage:

python inpaint_main.py --dataset=celebA \
    --load_model=DCGAN/model/you/want/to/use/e.g./20180704-1746 \
    --mask_type=center
  • gpu_index': gpu index, default: 0`
  • dataset: dataset name for choice [celebA|svhn], default: celebA
  • learning_rate: learning rate to update latent vector z, default: 0.01
  • momentum: momentum term of the NAG optimizer for latent vector, default: 0.9
  • z_dim: dimension of z vector, default: 100
  • lamb: hyper-parameter for prior loss, default: 3
  • is_blend: blend predicted image to original image, default: true
  • mask_type: mask type choice in [center|random|half|pattern], default: center
  • img_size: image height or width, default: 64
  • iters: number of iterations to optimize latent vector, default: 1500
  • num_try: number of random samples, default: 20
  • print_freq: print frequency for loss, default: 100
  • sample_batch: number of sampling images, default: 2
  • load_model: saved DCGAN model that you with to test, (e.g. 20180705-1736), default: None

Loss for Optimizing Latent Vector

  • Content Loss

  • Prior Loss

  • Total Loss

Citation

  @misc{chengbinjin2018semantic-image-inpainting,
    author = {Cheng-Bin Jin},
    title = {semantic-image-inpainting},
    year = {2018},
    howpublished = {\url{https://github.com/ChengBinJin/semantic-image-inpainting}},
    note = {commit xxxxxxx}
  }

Attributions/Thanks

License

Copyright (c) 2018 Cheng-Bin Jin. Contact me for commercial use (or rather any use that is not academic research) (email: [email protected]). Free for research use, as long as proper attribution is given and this copyright notice is retained.

Related Projects

About

Tensorflow implementation of Semantic Image Inpainting with Deep Generative Models

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages