LoRA for SemanticSegmentation-domain-adaptation - CVC Internship

Institution: Computer Vision Center (CVC) Internship Period: January 24 - June 24

Project Overview

In this project, we focus on domain adaptation for semantic segmentation using the LoRA (Low-Rank Adaptation) technique. The primary objective is to improve the performance of semantic segmentation models when transferring knowledge from a synthetic dataset to a real-world dataset.

We use the SegFormer-B0 model, a state-of-the-art transformer-based architecture designed for efficient and accurate semantic segmentation. The project consists of two main phases:

Pretraining on GTA5 Dataset:
- Dataset: GTA5
- Labels: Cityscapes
- Description: We pretrain the SegFormer-B0 model on the GTA5 dataset, which provides synthetic images with labels compatible with the Cityscapes dataset. This phase aims to leverage the large-scale synthetic data to learn robust feature representations for semantic segmentation.
Domain Adaptation on Cityscapes Dataset:
- Dataset: Cityscapes
- Technique: Low-Rank Adaptation (LoRA)
- Description: After pretraining, we apply the LoRA technique to adapt the pretrained model to the Cityscapes dataset, which consists of real-world urban street scenes. The LoRA method enables efficient fine-tuning by introducing low-rank updates, reducing the number of trainable parameters and enhancing generalization capabilities.

This approach allows us to utilize the vast and diverse synthetic data from GTA5 to build a strong foundation for the model, followed by domain adaptation to bridge the gap between synthetic and real-world data. The expected outcome is a robust semantic segmentation model that performs well on real-world data with improved accuracy and efficiency.

By combining SegFormer-B0 with the LoRA technique, this project aims to advance the field of domain adaptation in semantic segmentation, providing valuable insights and methodologies for future research and applications.

Content

Main Scripts

gta_segformer_trainer: Trains the SegFormer-B0 on the GTA dataset and stores a checkpoint to the trained model.
lora_gta_to_cityscapes: Takes a previously pretrained SegFormer-B0 on GTA and performs LoRA domain adaptation to a Cityscapes dataset.
evaluation: Performs several model evaluations:
- Model: SegFormer-B0 trained on GTA - Evaluation Dataset: GTA (mIoU: 36.57%)
- Model: SegFormer-B0 trained on GTA - Evaluation Dataset: Cityscapes (mIoU: 8.24%)
- Model: SegFormer-B0 LORA trained on first GTA and then Cityscapes - Evaluation Dataset: Cityscapes (mIoU: 23.39%)

Secondary Scripts

cityscapes_dataset_creator: Given raw data, creates a Cityscapes dataset for further use.
gta_dataset_creator: Given raw data, creates a batched GTA dataset for further use.
cityscapes_dataset_creator: Given a batched GTA dataset, loads it for further use.

Models

The mentioned models can be found at:

Segformer pretrained on GTA
Segformer after LoRA to Cityscapes
Demo: Feel free to check the effect of LoRA in real time!

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
scripts		scripts
useful_scripts		useful_scripts
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LoRA for SemanticSegmentation-domain-adaptation - CVC Internship

Project Overview

Content

Main Scripts

Secondary Scripts

Models

About

Releases

Packages

Languages

guimCC/LoRA-for-SemanticSegmentation-domain-adaptation

Folders and files

Latest commit

History

Repository files navigation

LoRA for SemanticSegmentation-domain-adaptation - CVC Internship

Project Overview

Content

Main Scripts

Secondary Scripts

Models

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages