Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Audio format in dataset files #17

Open
r666ay opened this issue Aug 1, 2024 · 2 comments
Open

Audio format in dataset files #17

r666ay opened this issue Aug 1, 2024 · 2 comments

Comments

@r666ay
Copy link

r666ay commented Aug 1, 2024

Thanks for you great work on implementing FACodec!
I found the data file in https://github.com/Plachtaa/FAcodec/blob/master/data/val.txt has some labels, like speaker id, phonemes. How can I get these labels? Will these labels be auto-generated in the training process?

@Plachtaa
Copy link
Owner

Plachtaa commented Aug 1, 2024

It was from VCTK dataset for legacy implementation. For the current version in this repo, annotation is not required. Auto-generated labels will not be saved during training process

@r666ay
Copy link
Author

r666ay commented Aug 1, 2024

It was from VCTK dataset for legacy implementation. For the current version in this repo, annotation is not required. Auto-generated labels will not be saved during training process

Thanks for your reply. What models are used to generate these annotations? I want to export the auto-generated labels.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants