Tacotron2

The Tacotron2 network is used as the main synthesis engine in the SOVA-TTS project. We took its implementation from NVIDIA, added various improvements that might be found in articles, and made the code more user-friendly.

Key differences:

GST module is added;
Mutual Information Estimator is added (based on the following article and repo);
Added the possibility to include attention loss in the train process (using diagonal or prealigned guidance);
Some work has been done to improve the usability of the code;
Other minor changes and additions.

How to train a new model

First of all you need to install all dependencies (which can be found in the reuqirements.txt) and convert the dataset to the LJ Speech format, where each line contains relative path to the audio file and its text, separated by "|" sign, e.g.:

wavs/000000.wav|С трев+ожным ч+увством бер+усь я з+а пер+о.

Then divide it into two files: the training list (90% of the data) and the validation list (10% of the data).

After that configure the config file as needed (here you can find an explanation of the main fields of the config file), or just use the default one, filling in the values of parameters output_dir (where to save checkpoints), training_files (path to the training list), validation_files (path to the validation list) and audios_path (path to the audio folder, so that together with the relative path to the audio, the full path is obtained).

When everything is ready, launch the training process:

in case if you changed hparams.yaml inside the 'data' folder: python train.py
in case if you have some other config file: python train.py -p path/to/hparams.yaml

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
data		data
modules		modules
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
hparams.py		hparams.py
model.py		model.py
multiproc.py		multiproc.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tacotron2

How to train a new model

About

Releases 1

Packages

Contributors 2

Languages

License

sovaai/sova-tts-engine

Folders and files

Latest commit

History

Repository files navigation

Tacotron2

How to train a new model

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Languages

Packages