-
Notifications
You must be signed in to change notification settings - Fork 103
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #9 from Music-and-Culture-Technology-Lab/vocal-mix
Add note-level vocal transcription, with integration of vocal-contour module.
- Loading branch information
Showing
45 changed files
with
2,288 additions
and
245 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,54 @@ | ||
Vocal Transcription | ||
=================== | ||
|
||
|
||
.. automodule:: omnizart.vocal | ||
|
||
|
||
App | ||
### | ||
.. autoclass:: omnizart.vocal.app.VocalTranscription | ||
:members: | ||
:show-inheritance: | ||
|
||
|
||
Dataset | ||
####### | ||
.. autoclass:: omnizart.vocal.app.VocalDatasetLoader | ||
:members: | ||
:show-inheritance: | ||
|
||
|
||
Inference | ||
######### | ||
.. automodule:: omnizart.vocal.inference | ||
:members: | ||
|
||
|
||
Labels | ||
###### | ||
.. automodule:: omnizart.vocal.labels | ||
:members: | ||
:undoc-members: | ||
|
||
|
||
Prediction | ||
########## | ||
.. automodule:: omnizart.vocal.prediction | ||
:members: | ||
:undoc-members: | ||
|
||
|
||
Settings | ||
######## | ||
Below are the default settings for building the vocal model. It will be loaded | ||
by the class :class:`omnizart.setting_loaders.VocalSettings`. The name of the | ||
attributes will be converted to snake-case (e.g. HopSize -> hop_size). There | ||
is also a path transformation process when applying the settings into the | ||
``VocalSettings`` instance. For example, if you want to access the attribute | ||
``BatchSize`` defined in the yaml path *General/Training/Settings/BatchSize*, | ||
the coressponding attribute will be *VocalSettings.training.batch_size*. | ||
The level of */Settings* is removed among all fields. | ||
|
||
.. literalinclude:: ../../../omnizart/defaults/vocal.yaml | ||
:language: yaml |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,25 @@ | ||
omnizart vocal | ||
============== | ||
|
||
Lists the detailed available options of each sub-commands. | ||
|
||
|
||
transcribe | ||
########## | ||
|
||
.. click:: omnizart.cli.vocal.transcribe:transcribe | ||
:prog: omnizart vocal transcribe | ||
|
||
|
||
generate-feature | ||
################ | ||
|
||
.. click:: omnizart.cli.vocal.generate_feature:generate_feature | ||
:prog: omnizart vocal generate-feature | ||
|
||
|
||
train-model | ||
########### | ||
|
||
.. click:: omnizart.cli.vocal.train_model:train_model | ||
:prog: omnizart vocal train-model |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.