Reproduce ResNet on CIFAR in PyTorch

As we know, the official ResNet code (including timm lib) only provides models for ImageNet dataset with 4 stages. However, in the original ResNet paper (experiments part), only 3 stages are needed for CIFAR-10/100 dataset (e.g., # blocks=[5, 5, 5] in ResNet-32). For example, if we directly run 4-stage ResNet-32 on CIFAR-10, we unfortunately get ~88% top-1 accuracy (but 92.49% in the original ResNet paper).

So, this project reproduces ResNet models (20/32/44/56/110/1202-layer) on CIFAR-10 with timm lib. Finally, we can get 92.45%~92.61% top-1 accuracy for ResNet-32 which matches the reported result 92.49% on CIFAR-10.

Note: the ResNet architectures and training configurations in this repo can be naturally applied on CIFAR-100 by simply replacing the dataset path and the number of classes (i,e, 100).

How to run

Training

timm is recommended for image classification training and required for the training script provided in this repository:

Distributed training

./dist_classification.sh $NUM_GPUS -c $CONFIG_FILE /path/to/dataset

You can use our training configurations provided in configs/:

./dist_classification.sh 8 -c configs/cifar10_resnet32.yml --model resnet32 /path/to/cifar10

Non-distributed training

python train.py -c configs/datasets/cifar10_resnet32.yml --model resnet32 /path/to/cifar10

Models, config files and detailed training hyper-parameters

Models are in the "src/resnet.py" file. We can choose ResNet-20/32/44/56/110/1202 on CIFAR-10 by simply modify the training command like: --model resnet56.
Hyper-parameters are in the "config/datasets/cifar10_resnet32.yml" file and "train.py".

Hyper-parameter	Value
optimizer	sgd
learning scheduler	multistep (decay-milestones=[100, 150])
warmup_epochs	10 (warmup_lr=0.00001)
label smoothing	0.1
batch size	128
weight decay	1e-4
momentum	0.9
initial learning rate	0.1
mean	[0.485, 0.456, 0.406]
std	[0.229, 0.224, 0.225]
# epochs	200
with dropout?	No
data augmentation	RandomHorizontalFlip, RandomCrop, RandomErasing

Citation

@article{hassani2021escaping,
	title        = {Escaping the Big Data Paradigm with Compact Transformers},
	author       = {Ali Hassani and Steven Walton and Nikhil Shah and Abulikemu Abuduweili and Jiachen Li and Humphrey Shi},
	year         = 2021,
	url          = {https://arxiv.org/abs/2104.05704},
	eprint       = {2104.05704},
	archiveprefix = {arXiv},
	primaryclass = {cs.CV}
}

@misc{Idelbayev18a,
  author       = "Yerlan Idelbayev",
  title        = "Proper {ResNet} Implementation for {CIFAR10/CIFAR100} in {PyTorch}",
  howpublished = "\url{https://github.com/akamaster/pytorch_resnet_cifar10}",
  note         = "Accessed: 20xx-xx-xx"
}

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
configs		configs
examples		examples
pretrained_model		pretrained_model
src		src
LICENSE		LICENSE
README.md		README.md
dist_train.sh		dist_train.sh
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reproduce ResNet on CIFAR in PyTorch

How to run

Training

Distributed training

Non-distributed training

Models, config files and detailed training hyper-parameters

Citation

About

Releases

Packages

Languages

License

zwxandy/resnet-pytorch-cifar-reproduction

Folders and files

Latest commit

History

Repository files navigation

Reproduce ResNet on CIFAR in PyTorch

How to run

Training

Distributed training

Non-distributed training

Models, config files and detailed training hyper-parameters

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages