Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[DiffSinger] Add support for tension and voicing #1045

Merged
merged 6 commits into from
Feb 25, 2024

Conversation

yqzhishen
Copy link
Contributor

@yqzhishen yqzhishen commented Feb 19, 2024

This PR adds support for DiffSinger tension as TENC and voicing as VOIC.

Usage

If the voicebank supports the two parameters, write in dsvariance/dsconfig.yaml:

predict_voicing: true
predict_tension: true

Also, write in dsconfig.yaml of the acoustic model:

use_voicing_embed: true
use_tension_embed: true

Related Changes

DiffSinger variance predictor now requires explicit declaration in dsvariance/dsconfig.yaml if it does NOT predict energy or breathiness:

predict_energy: false  # default is true
predict_breathiness: false  # default is true

Old voicebanks will continue to work as OpenUtau regards these two parameters as enabled without the declaration.

Notice

VOIC of OpenUtau has a default range of [0, 100] and a default value of 100, which allows decreasing but not increasing. However, DiffSinger voicing allows increasing technically. If you want to do so, please edit the maximum (or minimum too, if you would like) value of VOIC.

@yqzhishen yqzhishen marked this pull request as ready for review February 23, 2024 17:26
@stakira stakira merged commit 8ed13f8 into stakira:master Feb 25, 2024
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants