-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
0 parents
commit 56f4c68
Showing
118 changed files
with
11,292 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,26 @@ | ||
MAPS* | ||
conda* | ||
|
||
.vscode/* | ||
.ipynb_checkpoints/* | ||
*/__pycache__ | ||
__pycache__/* | ||
|
||
server.bat | ||
copy*.sh | ||
|
||
# linux | ||
.Xdefaults* | ||
.bash_history* | ||
.bash_logout* | ||
.bashrc* | ||
.cache/* | ||
.conda/* | ||
.config/* | ||
.containers/* | ||
.dockbarx/* | ||
.kde/* | ||
.keras/* | ||
.profile* | ||
.python_history* | ||
.ssh/* |
Binary file added
BIN
+1.14 MB
Avtomatska transkripcija klavirske glasbe s konvolucijskimi nevronskimi mrežami.pdf
Binary file not shown.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,15 @@ | ||
# Bachelor thesis: Automatic music transcription of piano music with convolutional neural networks | ||
|
||
## Abstract | ||
|
||
In this thesis we explore the problem of automatic music transcription using deep neural networks, more specific convolutional neural networks. Automatic music transcription is a task of writing the sheet music from musical recordings. We analysed previous studies and found that there was a lack of research about the size and the shape of architecture of deep models. We explored the performance of four different architectures of convolutional neural networks on the piano recordings dataset MAPS, which is a common benchmark for learning automatic music transcription. We also compared two different normalization techniques for spectrograms: standardization and the logarithmic compression. We found out that the performance of transcription is highly correlated with the higher number of convolutional layers. Transcription is also 10% more successful with logarithmic compression instead of standardization. | ||
|
||
The thesis is in file [Avtomatska transkripcija klavirske glasbe s konvolucijskimi nevronskimi mrežami](Avtomatska transkripcija klavirske glasbe s konvolucijskimi nevronskimi mrežami.pdf). The presentation for the thesis defense is in file [zagovor](zagovor.pptx). | ||
|
||
## About this repository | ||
|
||
The bachelor thesis was written in LaTeX and is in folder [bachelorsThesis](bachelorsThesis). | ||
The main experiments were done in file [diploma.ipynb](diploma.ipynb) with [graphics.ipynb](graphics.ipynb) and [times.ipynb](times.ipynb) as side files. | ||
The code for running longer experiments and training is in folder [src](src). It can be run with `python main.py --help` for further instructions. | ||
Informational results of training are in folder [results](results). Folder [tt_files](tt_files) includes TrainTest split files. | ||
The trained models are in folder [cnnModels](cnnModels), which include earlier models, [expModels](expModels), which include trained models, and [hteModels](hteModels), which are models trained on transposed music. |
Oops, something went wrong.