-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CLVP checkpoint? #1
Comments
Hi, any update on this? |
Hey there, CLVP is the same one that is used in github.com/tortoise-tts I uploaded a copy of that here: https://huggingface.co/jbetker/tts-scores-clvp/tree/main |
Thanks a lot @neonbjb for such quick answer. I got the CLVP model from your Huggingface link but it does not look to be the same as this code is expecting. I am getting the error below.
|
Hey there, |
Thanks @neonbjb , it now works perfectly.
|
Hey,
The CLVP model was trained at 22kHz, which is why I'm doing that
conversion. Is there a bug here I'm missing?
Absolutely submit a PR!
…On Thu, Feb 2, 2023 at 11:55 AM Xavier Anguera ***@***.***> wrote:
Thanks @neonbjb <https://github.com/neonbjb> , it now works perfectly.
A couple of questions/comments:
- Question: In the CLVP and Frechet distances you are converting audio
to 22K before computing MEL from it, but in the wav2vec audio needs to be
at 16K as this is how the model was trained. Is there any reason for the
conversion to 22K?
- Comment: fd and clvp/wav2vec have different parameter requirements.
If you're keen on it I can send you a PR to standardize them.
—
Reply to this email directly, view it on GitHub
<#1 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAGLMOXQR5ZCIQNHDRDTWVTWVP7IXANCNFSM54THD26A>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
--
-
James Betker
|
Hi, |
I recommend just removing the CLVP scores altogether. wav2vec Intelligibility has much better signal. |
Thanks for the great project! I think it can be super useful, and if some papers pick it up and show it works well it can become the new FID for Audio :)
Would it be possible to upload CLVP checkpoint?
Thanks!
The text was updated successfully, but these errors were encountered: