Skip to content
RedBlackAka edited this page Dec 22, 2024 · 5 revisions

OpenUtau supports DiffSinger (maintained by OpenVPI), a machine learning based singing synthesizer.

Setup

Before using DiffSinger on OpenUtau, please download nsf_hifigan vocoder . Drag and drop to import it into OpenUtau.

Turn on "beta" in preferences and upgrade your OpenUtau, because some new voicebanks might use features not yet supported by the current stable build.

macOS 11 Big Sur or higher is required to use Diffsinger on macOS.

Preferences

  • Diffsinger Rendering speedup is 50 times by default. A smaller speedup may improve the quality of audio, but slow down the rendering.
  • OpenUtau uses CPU to render by default. If you use Windows and have a discrete graphics card, you can use DirectML to make rendering faster. Please set "Machine Learning Runner" to "DirectML" (or "CoreML" on macOS), choose your discrete graphics card in "GPU" menu, and restart your OpenUtau.

Expressions

Here are the expressions supported by DiffSinger:

  • PITD (pitch curve)
  • DYN (volume curve)
  • GENC (gender curve, need voicebank's support. See here for details. The default range -100~+100 equals to shifting the formant by +12~-12 semitones.)
  • VELC (velocity curve, need voicebank's support. See here for details. This expression will affect the speed of the head and tail of the vowels. Every increase of 100 in this expression will multiply the speed by 2.)

VELC is a custom expression defined by DiffSinger and isn't included in new projects. To add this expression into your project, click “Expreeions → Add all expressions suggested by renderers”

You can also adjust the range of each expression in this menu.

image