Training instruction for HQ-SAM

Segment Anything in High Quality
Lei Ke, Mingqiao Ye, Martin Danelljan, Yifan Liu, Yu-Wing Tai, Chi-Keung Tang, Fisher Yu
ETH Zurich & HKUST

We organize the training folder as follows.

train
|____data
|____pretrained_checkpoint
|____train.py
|____utils
| |____dataloader.py
| |____misc.py
| |____loss_mask.py
|____segment_anything_training
|____work_dirs

1. Data Preparation

HQSeg-44K can be downloaded from hugging face link

Expected dataset structure for HQSeg-44K

data
|____DIS5K
|____cascade_psp
| |____DUTS-TE
| |____DUTS-TR
| |____ecssd
| |____fss_all
| |____MSRA_10K
|____thin_object_detection
| |____COIFT
| |____HRSOD
| |____ThinObject5K

2. Init Checkpoint

Init checkpoint can be downloaded from hugging face link

Expected checkpoint

pretrained_checkpoint
|____sam_vit_b_maskdecoder.pth
|____sam_vit_b_01ec64.pth
|____sam_vit_l_maskdecoder.pth
|____sam_vit_l_0b3195.pth
|____sam_vit_h_maskdecoder.pth
|____sam_vit_h_4b8939.pth

3. Training

To train HQ-SAM on HQSeg-44K dataset

python -m torch.distributed.launch --nproc_per_node=<num_gpus> train.py --checkpoint <path/to/checkpoint> --model-type <model_type> --output <path/to/output>

Example HQ-SAM-L training script

python -m torch.distributed.launch --nproc_per_node=8 train.py --checkpoint ./pretrained_checkpoint/sam_vit_l_0b3195.pth --model-type vit_l --output work_dirs/hq_sam_l

Example HQ-SAM-B training script

python -m torch.distributed.launch --nproc_per_node=8 train.py --checkpoint ./pretrained_checkpoint/sam_vit_b_01ec64.pth --model-type vit_b --output work_dirs/hq_sam_b

Example HQ-SAM-H training script

python -m torch.distributed.launch --nproc_per_node=8 train.py --checkpoint ./pretrained_checkpoint/sam_vit_h_4b8939.pth --model-type vit_h --output work_dirs/hq_sam_h

4. Evaluation

To evaluate on 4 HQ-datasets

python -m torch.distributed.launch --nproc_per_node=<num_gpus> train.py --checkpoint <path/to/checkpoint> --model-type <model_type> --output <path/to/output> --eval --restore-model <path/to/training_checkpoint>

Example HQ-SAM-L evaluation script

python -m torch.distributed.launch --nproc_per_node=1 train.py --checkpoint ./pretrained_checkpoint/sam_vit_l_0b3195.pth --model-type vit_l --output work_dirs/hq_sam_l --eval --restore-model work_dirs/hq_sam_l/epoch_11.pth

Example HQ-SAM-L visualization script

python -m torch.distributed.launch --nproc_per_node=1 train.py --checkpoint ./pretrained_checkpoint/sam_vit_l_0b3195.pth --model-type vit_l --output work_dirs/hq_sam_l --eval --restore-model work_dirs/hq_sam_l/epoch_11.pth --visualize

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Training instruction for HQ-SAM

1. Data Preparation

Expected dataset structure for HQSeg-44K

2. Init Checkpoint

Expected checkpoint

3. Training

Example HQ-SAM-L training script

Example HQ-SAM-B training script

Example HQ-SAM-H training script

4. Evaluation

Example HQ-SAM-L evaluation script

Example HQ-SAM-L visualization script

Files

README.md

Latest commit

History

README.md

File metadata and controls

Training instruction for HQ-SAM

1. Data Preparation

Expected dataset structure for HQSeg-44K

2. Init Checkpoint

Expected checkpoint

3. Training

Example HQ-SAM-L training script

Example HQ-SAM-B training script

Example HQ-SAM-H training script

4. Evaluation

Example HQ-SAM-L evaluation script

Example HQ-SAM-L visualization script