GitHub - gasanet/GASA: Fine-Grained Cross-Modal Retrieval based on generative and adversarial network

Introduction

source code for our paper GASAnet: Generative Adversarial and Self Attention Based Fine-Grained Cross-Modal Retrieval

Requirement

The code is currently tested only on GPU.

Download dataset
Please visit dataset.
Prepare audio data
python audio.py
Training
- If you want to train the whole model from beginning using the source code, please follow the subsequent steps.
  - Download dataset to the dataset folder.
  - In main_gan_lstm_resnet.py
  - modify lr in params1 to 0.001, lr in params2 and lr in discriminator to 1.
  - modify model_path to the path where you want to save your parameters of networks.
  - Activate virtual environment (e.g. conda) and then run the script
    python main_gan_lstm_resnet.py
Testing
- If you just want to do a quick test on the model and check the final retrieval performance, please follow the subsequent steps.
  - The trained models of our work can be downloaded from Baidu Cloud, and the extraction code is v99c.
  - Activate virtual environment (e.g. conda) and then run the script
    python test_gan.py

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
list		list
README.md		README.md
audio.py		audio.py
centerloss.py		centerloss.py
dataset_lstm.py		dataset_lstm.py
gan.jpg		gan.jpg
main_gan_lstm_resnet.py		main_gan_lstm_resnet.py
model_Resnet.py		model_Resnet.py
model_lstm_selfattention.py		model_lstm_selfattention.py
rankingloss.py		rankingloss.py
retrieval.py		retrieval.py
test_gan.py		test_gan.py
train.py		train.py
util.py		util.py
validate.py		validate.py
video_clear.py		video_clear.py