Audio format in dataset files #17

r666ay · 2024-08-01T13:00:44Z

Thanks for you great work on implementing FACodec!
I found the data file in https://github.com/Plachtaa/FAcodec/blob/master/data/val.txt has some labels, like speaker id, phonemes. How can I get these labels? Will these labels be auto-generated in the training process?

Plachtaa · 2024-08-01T13:28:37Z

It was from VCTK dataset for legacy implementation. For the current version in this repo, annotation is not required. Auto-generated labels will not be saved during training process

r666ay · 2024-08-01T13:32:28Z

It was from VCTK dataset for legacy implementation. For the current version in this repo, annotation is not required. Auto-generated labels will not be saved during training process

Thanks for your reply. What models are used to generate these annotations? I want to export the auto-generated labels.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Audio format in dataset files #17

Audio format in dataset files #17

r666ay commented Aug 1, 2024

Plachtaa commented Aug 1, 2024

r666ay commented Aug 1, 2024

Audio format in dataset files #17

Audio format in dataset files #17

Comments

r666ay commented Aug 1, 2024

Plachtaa commented Aug 1, 2024

r666ay commented Aug 1, 2024