Optimized Supernet Formation: Transforming Pretrained Models for Efficient On-device Inference

This is the official implementation for the paper:

Optimized Supernet Formation: Transforming Pretrained Models for Efficient On-device Inference

Updates

[11/07/2023] High-level API for edge
[12/04/2023] APIs for Segment Anything (SAM)
[02/01/2024] ViT-base supernet checkpoints pushed to huggingface model hub
[02/01/2024] Hands-on tutorial for quickly converting a given pre-trained model to supernet
[03/28/2024] Update examples for Mamba, SAM, Swin, and CLIP. Released checkpoints.

Installation

First, create a conda environment, then install PyTorch.

conda create -n ofm python=3.10
conda activate ofm
conda install pytorch torchvision torchaudio pytorch-cuda=12.1 -c pytorch -c nvidia

Next, install the OFM package

cd OFM/
pip install .

Supernets checkpoints

OSF modeling a given pre-trained model as a supernet, with an efficient parallel finetuning process, OSM can transform the target pre-trained model to a supernet, and can be quickly specialized to a wide range of resource constraints (> $10^{12}$) with zero-shot.

To validate the results we reported in our paper, we provide several trained supernet checkpoints. These checkpoints are been pushed to the anonymous Huggingface model hub Repos, which you can find in the following links.

Checkpoints Links

We pushed our trained supernet to the Huggingface model hub, you can find the checkpoints in the following links:

You don't need to download the ckpt files, you can use Huggingface Model Card to load the ckpts files directly. We will show you how to do that in the following section.

Checkpoints Usage

We provide detailed instructions and hands-on tutorial for you to validate our zero-shot downsized models:

Example on quickly evaluate ViT supernet with high-performance subnets

Besides, we also provide a high-level API for you to quickly generate sunets for your supernet with 2 lines of codes, as shown in the following example:

from transformers import AutoModelForImageClassification
from ofm import OFM

# Generate downsized models
ckpt_path = "ckpts_repo_name" # Copy the huggingface model hub repo name from above link

model = AutoModelForImageClassification.from_pretrained(
    ckpt_path,
    num_labels=10,
    ignore_mismatched_sizes=True,
)

supernet = OFM(model.to("cpu"))
print("Original FM number of parameters:",supernet.total_params)

#Randomly sample a downsized FM
ds_model, params, config = supernet.random_resource_aware_model()
print("subnetwork params",params)

Train your own supernet (Single Node)

Scripts for converting ViT to a supernet

OFM with its mini-shard training strategy can convert a pre-trained model to a supernet in a fast and efficient way. For instance, you can train a super-ViT on CIFAR-100 with the following command:

python3 scripts/train_img_classification.py --model vit \
--save_dir ckpts/cifar100  \
--dataset cifar100 \
--num_shards 30 \
--lr 1e-5 \
--batch_size 224 \
--elastic_config scripts/elastic_space.json \
--spp \
--log_interval 100

To check the results, you can:

Check the output information from the terminal console
Use tensorboard: tensorboard --logdir log/vit

Training on ImageNet

Before you start, you have to be granted access to the ImageNet dataset. You can request and download the dataset from here.

Set the arguments --huggingface_token to your huggingface token, which should have been granted access to the ImageNet dataset.

python3 scripts/train_img_classification.py --model vit \
--save_dir 'your_dir'  \
--dataset imagenet-1k \
--num_shards 500 \
--lr 2e-5 \
--batch_size 152 \
--log_interval 500 \
--huggingface_token "your-token-here" \
--elastic_config scripts/elastic_space.json

Distributed Training (Multiple Nodes)

If you have multiple GPUs, you can use the following command to train the super-FM with distributed training:

torchrun --nproc_per_node='your numer of gpus' --nnodes=1 scripts/dist_train_img_classification.py --model vit \
--save_dir 'your_dir'  \
--dataset imagenet-1k \
--num_shards 500 \
--lr 2e-5 \
--batch_size 152 \
--log_interval 500 \
--huggingface_token "your-token-here" \
--elastic_config scripts/elastic_space.json

[Note]: More APIs and scripts will be posted, please check the Updates.

Supported Foundation Models (02/01/2024)

Contact

anonymous

Citation

If you find our work is helpful, please kindly support our efforts by citing our paper:


under review

Acknowledgement

The experiments of this work is sponsored by [anonymous institution] and [anonymous institution].

Name		Name	Last commit message	Last commit date
Latest commit History 182 Commits
examples		examples
ofm		ofm
scripts		scripts
.gitignore		.gitignore
COCO_NAS_Trainer.py		COCO_NAS_Trainer.py
LICENSE		LICENSE
Mito_NAS_Trainer.py		Mito_NAS_Trainer.py
README.md		README.md
SA1B_NAS_Trainer.py		SA1B_NAS_Trainer.py
archs.txt		archs.txt
archs_2.txt		archs_2.txt
archs_3.txt		archs_3.txt
custom_transforms.py		custom_transforms.py
dataset.py		dataset.py
inference.py		inference.py
logger.py		logger.py
nas.py		nas.py
power_usage_monitor.sh		power_usage_monitor.sh
requirements.txt		requirements.txt
setup.py		setup.py
subnetwork_opentuner.py		subnetwork_opentuner.py
utility.py		utility.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Optimized Supernet Formation: Transforming Pretrained Models for Efficient On-device Inference

Updates

Installation

Supernets checkpoints

Checkpoints Links

Checkpoints Usage

Train your own supernet (Single Node)

Scripts for converting ViT to a supernet

Training on ImageNet

Distributed Training (Multiple Nodes)

Supported Foundation Models (02/01/2024)

Contact

Citation

Acknowledgement

About

Releases

Packages

Languages

License

pnnl/SuperSAM

Folders and files

Latest commit

History

Repository files navigation

Optimized Supernet Formation: Transforming Pretrained Models for Efficient On-device Inference

Updates

Installation

Supernets checkpoints

Checkpoints Links

Checkpoints Usage

Train your own supernet (Single Node)

Scripts for converting ViT to a supernet

Training on ImageNet

Distributed Training (Multiple Nodes)

Supported Foundation Models (02/01/2024)

Contact

Citation

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages