The UNET

Dataset

DRIVE 2004

The DRIVE database was established to enable comparative studies on segmentation of blood vessels in retinal images. The dataset is available on kaggle. Sample image and mask are shown below. The dataset contains 20 samples in the training set and 20 samples in the validation set.

UNET from scratch

With reference to the original UNET paper, I build the UNET architecture from scratch with PyTorch. The architecture is shown below.

I train and evaluate the model on the DRIVE 2004 dataset and visualize its predictions (masks). I experiment with different batch sizes and epochs.

Using a pretrained encoder

Using the segmentation_models.pytorch library, I train a UNET with a pretrained encoder (resnet) on the same DRIVE 2004 dataset.

Results

The 'ResUNET' was expected to perform much better than the base UNET however this was not the case. The loss curves show that the 'ResUNET' immediately overfits on the training data. This may be due to the size of the training data (only 20 samples) and/or the complexity of the the pretrained model.

Model without pretrained encoder

Model with pretrained encoder

Prediciton on test data

Ground truth

Both models learn and generalize well over the training data with ResUNET overfitting. The model's prediction is decent but can be significantly improved. There is a lot more than can be done to make this work much better.

TODO

Experiment pretrained encoder
Use command line arguments
Augment data and retrain model
Experiment with other UNET varients (pretrained encoders)
Publish medium article to explain work further

Additional

Code was deveoped locally whilst training runs were done in a colabnotebook and on aws notebook instance (ml.g4dn.xlarge, 1 T4 GPU)

File Structure

│ ├── DRIVE/
│ │ ├── training/ # Train data
│ │ └── test/ # Test data
├── checkpoints/ # Loss curves and weights from experiments
├── media/ # Images, papers
├── predictions/ # Output from inferencing
├── training.py # Custom script for training models
├── .gitignore # List of files and folders ignored by git
├── infer.py # Custom script for running inference
├── dataset.py # Custom dataset class
├── utils.py # Helper functions
└── unet.py # Model architectures

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The UNET

Dataset

DRIVE 2004

UNET from scratch

Using a pretrained encoder

Results

Model without pretrained encoder

Model with pretrained encoder

Prediciton on test data

Ground truth

TODO

Additional

File Structure

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
archive/DRIVE		archive/DRIVE
checkpoints		checkpoints
media		media
predictions		predictions
.gitignore		.gitignore
README.md		README.md
dataset.py		dataset.py
infer.py		infer.py
requirements.txt		requirements.txt
training.py		training.py
unet.py		unet.py
utils.py		utils.py

cyrilakafia/unet-from-scratch

Folders and files

Latest commit

History

Repository files navigation

The UNET

Dataset

DRIVE 2004

UNET from scratch

Using a pretrained encoder

Results

Model without pretrained encoder

Model with pretrained encoder

Prediciton on test data

Ground truth

TODO

Additional

File Structure

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages