Add Japanese monophone G2P (tailored to AI voicebanks/phonemizers) + add support to Diffsinger Japanese Phonemizer #1147
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There has been demand for some sort of Japanese G2P implementation, so I decided to add one, specifically made for machine-learning phonemizers (it'll be basically useless for UTAU voicebanks/phonemizers, though I guess I won't stop anyone from messing around with that). This PR also adds G2P support to the DiffSinger Japanese Phonemizer (note that the old dicts will still work just fine; they'll just overwrite the G2P). This will make writing new dictionaries much easier. I was also partially inspired to do this by the recently added Korean G2P.
Note that the dictionaries were made for the standard Japanese dialect (e.g. no distinction between じ/ぢ and ず/づ), so you'll still need to override that in your custom dict.
Functionality
N
which is always voiced)) but with phoneme replacements this can be easily changed if a voicebank does not support it;SP
, "息" and "吸" areAP
). Exhales are currently not supported though I'm willing to add it if there's demand;kw
,gw
,ng
,ngy
).What it does not do
g2p.onnx
file). I did not consider this necessary due to the relative simplicity of the Japanese language;