Releases: Music-and-Culture-Technology-Lab/omnizart
Releases · Music-and-Culture-Technology-Lab/omnizart
Release version 0.5.0
Changelog
0.5.0 - 2021-12-09
Official Open JOSS reviewed version.
Bugs
- Fix bug of name conflict while loading chord model.
Release v0.4.2
Changelog
0.4.2 - 2021-11-16
Accumulated release. Various improvements and bug fix. See details below.
Feature
- Migrate checkpoints from private Google Drive to Github release.
See here - Replace opencv
Dependency
- Upgrade Tensorflow version to 2.5.0 for Nvidia 30 series GPU compatibility.
- Upgrade Spleeter version to 2.3.0 for new TF version compatibility.
- Replace Opencv with PIL for drum feature resizing and remove opencv from the dependency.
Enhancement
- Simplify model loading mechanism by unifying the all checkpoint format to use TF format.
- Lazy import extraction functions to boost loading time.
- Change the order of Dockerfile commands for better utilizing cache.
Documentation
- Add notice about compatibility issue of running on certain CPU architecture.
- Add explaination about enabling auto completion.
- Rephrase sentences in paper according to JOSS review feedback.
- Add explaination about installing development dependencies.
- Use pepy as the alternative source for 'download' state badge.
Bugs
- Fix bug of unable to find vocal contour checkpoint.
- Fix bug of fail to custom layers of chord module.
- Fix various unit tests bugs.
- Fix minor linter errors.
Checkpoints Storage Migration
Migrate the place for storing the checkpoints from Google Drive to Github. Also upload ONNX models when available.
UPDATES
- 2022/11/27 - Upload the soundfont file.
- 2022/03/07 - Upload the pre-trained weight file of chord_v2.
- 2022/03/06 - Upload model architecture file (.pb) of chord_v2.
- 2022/03/04 - Upload the data file (resource.zip) for unit tests.
Release v0.4.1
Changelog
0.4.1 - 2021-06-04
Hotfix version according to issue #23
Feature
- Add a new piano transcription model and set it as the default model while using
music
module.
Bugs
- Fix bug while parsing weight files in the checkpoint folder.
Release v0.4.0
Changelog
0.4.0 - 2021-05-31
Various improvements on music module and some critical bug fixes.
Enhancement
- Improve the peak finding and thresholding strategy for more stable and better performance.
- Modify the feeding strategy of feature slices with adjustable overlapping rate while making predictions.
- Apply learning rate scheduler for music module.
- Replace the usage of custom training loop of music module with the built-in TF
.fit()
function.
Bugs
- Fix a critical bug of inference of music module that would lead to missing onsets.
- Fix generation of pertubation of vocal module while training.
Documentation
- Merge the demo page into master from
build_doc
branch.
Release v0.3.4
Changelog
0.3.4 - 2021-05-10
Hotifx version according to issue #19.
Bugs
- Fix bug of treating numpy array as list while appending elements.
Release v0.3.3
Changelog
0.3.3 - 2021-05-07
Hotfix version according to issue #19.
Bugs
- Fix column inconsistency of
aggregate_f0_info
andwrite_agg_f0_results
. - Update version of dependencies according to the security alert.
Release v0.3.2
Changelog
0.3.2 - 2021-02-13
Enhancement
- Move
load_label
functions of different datasets into dataset structure classes. - Add custom exception on fail downloading GD file due to access limit.
- Add unit tests on parsing label files into shared intermediate format.
Bugs
- Fix wrong access name of the dict in vocal midi inference function.
- Fix bug of generating beat module training labels.
Release v0.3.0
Changelog
0.3.0 - 2021-01-17
Release the beat
module for symbolic domain beat transcription.
Features
- Release
beat
module. - Add an example
patch-cnn
module for demonstrating the implementation progress.
Enhancement
- Refactor the flow of chord module for parsing the feature and label files.
- Modularize F0 information aggragation functions to utils.py and io.py.
- Improve verbosity on fail to open hdf files.
Documentation
- Re-arrange the side bar with an additional group of CLI.
- Add custom CSS style for adjusting the width of audio and video elements.
Bugs
- Fix Spleeter import errors after upgrading to v2.1.2.
Release v0.2.0
Changelog
0.2.0 - 2020-12-13
Vocal melody transcription in both frame- and note-level are live!
We release the modules for vocal melody transcription after a decent amount of effort.
Now you can transcribe your favorite singing voice.
Features
- Release
vocal
andvocal-contour
submodules.
Enhancement
- Improve chord transcription results by filtering out chord predictions with short duration.
- Resolve the path for transcription output in a consistent way.
Documentation
- Re-organize Quick Start and Tutorial pages to improve accessibility.
- Move the section for development from README.md to CONTRIBUTING.md.
Bug Fix
- Fix bugs of passing the wrong parameter to vamp for chroma feature extraction.