Looking for help and hints to train Tortoise model #569
Unanswered
TheBlueSky
asked this question in
Q&A
Replies: 2 comments 1 reply
-
How can we train in Portuguese? I have a source with more than 50,000 hours of audio, clean and of very high sound quality. |
Beta Was this translation helpful? Give feedback.
0 replies
-
Hi I'm still learning and curious about this. Since finetuning tortoise is the same as training for the implementations available, wouldn't it be the same to finetune tortoise on this data? They already use whisper for transcription and dataset generation. I am referring to the mrq repo btw. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
First of all, I would like to thank James Betker, who started this project, and to everyone who contributed and contributes to this project, especially Manmay Nakhashi.
I would like to train a model to be able to use it for non-Latin language TTS. Many of the online tutorials are about fine tuning the built in model for specific voices, and I was unable to find what helps me to train a model.
The closest I got is this discussion, but it only talked on a very high-level that many hours of audio and its transcript is needed, which are the 2 things that I already know are needed :)
What I am looking for is something like the tools, the datasets, whether I need language-specific tokenizer, and so on.
I appreciate any help and hint that point me to the right direction.
Beta Was this translation helpful? Give feedback.
All reactions