BigVision-PyTorch

PyTorch Implementation of Google Research "Big Vision" ViT. Lightweight implementation that replicates results from "Better plain ViT baselines for ImageNet-1k" - https://arxiv.org/abs/2205.01580.

Start a training job using train.sh, or the following command:

python3 -m torch.distributed.launch --nproc_per_node=8 train.py --model_type ViT-S_16 \
   --mixup \
   --output_dir ./exp/EXPERIMENT_NAME

Make sure to download ImageNet2012 and extract the non-TFDS version. Here's a reference script. Set the dataset directories in data_utils.py.

Note: There have known to be some discrepencies with weight decay in PyTorch vs. JAX/TensorFlow. If you are unable to replicate results, feel free to open an issue.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
models		models
utils		utils
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
train.py		train.py
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BigVision-PyTorch

About

Releases

Packages

Languages

karan-dalal/BigVision-PyTorch

Folders and files

Latest commit

History

Repository files navigation

BigVision-PyTorch

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages