Skip to content

karan-dalal/BigVision-PyTorch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BigVision-PyTorch

PyTorch Implementation of Google Research "Big Vision" ViT. Lightweight implementation that replicates results from "Better plain ViT baselines for ImageNet-1k" - https://arxiv.org/abs/2205.01580.

Start a training job using train.sh, or the following command:

python3 -m torch.distributed.launch --nproc_per_node=8 train.py --model_type ViT-S_16 \
   --mixup \
   --output_dir ./exp/EXPERIMENT_NAME

Make sure to download ImageNet2012 and extract the non-TFDS version. Here's a reference script. Set the dataset directories in data_utils.py.

Note: There have known to be some discrepencies with weight decay in PyTorch vs. JAX/TensorFlow. If you are unable to replicate results, feel free to open an issue.

About

PyTorch Implementation of Google Research "Big Vision" ViT

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published