GitHub - keskival/wavenet_synth: An implementation and demonstration of Google WaveNet. This application learns the given sound file and the resultant model can be used to synthesize sounds.

An implementation and demonstration of Google WaveNet.

This application learns the given sound file and the resultant model can be used to synthesize sounds.

It is a work in progress, and will be used in Cybercom #HackingGreat event in November 2016.

At the moment of writing CuDNN does not have applicable GPU acceleration functions to accelerate strided convolutions, so we have to use a trick to swap time dimensions to batch dimension and back for the normal 1-stride convolution to use GPU acceleration.

The application takes corpus.wav as the input file and test.wav as the test set file and trains the WaveNet with that corpus using different amplitude and noise distortions.

Running process.py trains a model, and saves the best model found so far into the file sound-model-best and sound-model-best.meta.

After having a trained model, you can use it to generate random sounds, by running generate.py. It saves the generated sound to Octave file sound.mat, and the probability distribution used for generation in image.mat. For generating really long samples, you might want to disable the image.mat generation by commenting out the relevant parts in generate.py.

You can draw the probability distribution and play intermediate training results in Octave using the Octave script file: plot_training.m

You can show the generated results and play the generated sound in Octave using the Octave script file: plot_generation.m

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
debug_model.py		debug_model.py
export_to_octave.py		export_to_octave.py
generate.py		generate.py
input.wav		input.wav
inspect_checkpoint.py		inspect_checkpoint.py
manage_data.py		manage_data.py
model.py		model.py
operations.py		operations.py
ops.py		ops.py
params.py		params.py
plot_generation.m		plot_generation.m
plot_training.m		plot_training.m
process.py		process.py
realization.wav		realization.wav
sound-model		sound-model
sound-model.meta		sound-model.meta
test.wav		test.wav
test_input.wav		test_input.wav
test_output.wav		test_output.wav
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases 1

Packages

Languages

License

keskival/wavenet_synth

Folders and files

Latest commit

History

Repository files navigation

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages