0.2
Changes
- The UI takes the whole width of the screen now and consists of two parts - the settings on the left and the generated sentences player/editor on the right.
- Preprocessing of long files has been significantly sped up through parallelisation. This led to a 3-4x time reduction.
- Introduced metadata: the ability to set the album title, the artist, the genre and upload a cover image.
- Added support for
.m4b
. - Added support for chapter detection (at the moment only for epub files) and chapter markers in m4b files (if you want to have the smallest file possible, use opus - it performs very well for speech even at 16k!)
- Small improvements in the training workflow (a folder with reference samples is automatically created in the tts_voices folder when training finishes) and fixes for the RVC workflow.
Pre-Installed Packages
You may download self-contained packages that only require unpacking from here. You don't have to install anything, all components are included in portable conda environments. You may install additional components at any time using the launcher. But please remember to update Pandrator from the launcher.
Installer
You may use the installer/launcher below, which was created from the pandrator_installer_launcher.py
file in the repository, or use the source file directly. Please remember to run the executable as an administrator. It's possible that Windows or your antivirus software will flag it as a threat. You may whitelist it, or, if you're not comfortable doing that, review the code in the repository and install Pandrator manually.