You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I used the original audio vs generated audio metric on songs generated using SVC. I am aware that this is not the intended use but I was curious on my way to developing a metric for singing voices with the same lyrics. Can you provide me insight into what an imaginary value means in this case? Is it something wrong with my audios or is it because the FD works only for TTS?
The text was updated successfully, but these errors were encountered:
I used the original audio vs generated audio metric on songs generated using SVC. I am aware that this is not the intended use but I was curious on my way to developing a metric for singing voices with the same lyrics. Can you provide me insight into what an imaginary value means in this case? Is it something wrong with my audios or is it because the FD works only for TTS?
The text was updated successfully, but these errors were encountered: