Update torchaudio requirement from <=0.7.2 to <0.13.1 #514

dependabot · 2022-11-08T13:11:00Z

Updates the requirements on torchaudio to permit the latest version.

Release notes

torchaudio 0.13.0 Release Note

Highlights

TorchAudio 0.13.0 release includes:

Source separation models and pre-trained bundles (Hybrid Demucs, ConvTasNet)

New datasets and metadata mode for the SUPERB benchmark

Custom language model support for CTC beam search decoding

StreamWriter for audio and video encoding

[Beta] Source Separation Models and Bundles

Hybrid Demucs is a music source separation model that uses both spectrogram and time domain features. It has demonstrated state-of-the-art performance in the Sony Music DeMixing Challenge. (citation: https://arxiv.org/abs/2111.03600)

The TorchAudio v0.13 release includes the following features

MUSDB_HQ Dataset, which is used in Hybrid Demucs training (docs)

Hybrid Demucs model architecture (docs)

Three factory functions suitable for different sample rate ranges

Pre-trained pipelines (docs) and tutorial

SDR Results of pre-trained pipelines on MUSDB-HQ test set

Pipeline All Drums Bass Other Vocals

HDEMUCS_HIGH_MUSDB* 6.42 7.76 6.51 4.47 6.93

HDEMUCS_HIGH_MUSDB_PLUS** 9.37 11.38 10.53 7.24 8.32

* Trained on the training data of MUSDB-HQ dataset. ** Trained on both training and test sets of MUSDB-HQ and 150 extra songs from an internal database that were specifically produced for Meta.

Special thanks to @adefossez for the guidance.

ConvTasNet model architecture was added in TorchAudio 0.7.0. It is the first source separation model that outperforms the oracle ideal ratio mask. In this release, TorchAudio adds the pre-trained pipeline that is trained within TorchAudio on the Libri2Mix dataset. The pipeline achieves 15.6dB SDR improvement and 15.3dB Si-SNR improvement on the Libri2Mix test set.

[Beta] Datasets and Metadata Mode for SUPERB Benchmarks

With the addition of four new audio-related datasets, there is now support for all downstream tasks in version 1 of the SUPERB benchmark. Furthermore, these datasets support metadata mode through a get_metadata function, which enables faster dataset iteration or preprocessing without the need to load or store waveforms.

Datasets with metadata functionality:

LIBRISPEECH (docs)

LibriMix (docs)

QUESST14 (docs)

SPEECHCOMMANDS (docs)

(new) FluentSpeechCommands (docs)

(new) Snips (docs)

(new) IEMOCAP (docs)

(new) VoxCeleb1 (Identification, Verification)

[Beta] Custom Language Model support in CTC Beam Search Decoding

In release 0.12, TorchAudio released a CTC beam search decoder with KenLM language model support. This release, there is added functionality for creating custom Python language models that are compatible with the decoder, using the torchaudio.models.decoder.CTCDecoderLM wrapper.

[Beta] StreamWriter

torchaudio.io.StreamWriter is a class for encoding media including audio and video. This can handle a wide variety of codecs, chunk-by-chunk encoding and GPU encoding.

Backward-incompatible changes

[BC-breaking] Fix momentum in transforms.GriffinLim (#2568)

... (truncated)

Commits

bc8640b Fix doc in torchaudio.backend (#2781)
1b444d8 Add iemocap variants (#2778)
ee68a98 Update download path for speechcommands (#2777)
3b1d85d Add notes on file structure in Voxceleb1 based datasets (#2776)
9a013fd Add file_name to the returned item in Snips dataset (#2775)
88a8dd4 Update description of HDemucs pipelines (#2774)
b703a63 Update resampling tutorial (#2773)
fc6090e Fix leaking matplotlib figure (#2771)
55c695b Fix fading in hybrid demucs tutorial (#2769)
bd37611 Fix CTCDecoder doc (#2766)
Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR
@dependabot recreate will recreate this PR, overwriting any edits that have been made to it
@dependabot merge will merge this PR after your CI passes on it
@dependabot squash and merge will squash and merge this PR after your CI passes on it
@dependabot cancel merge will cancel a previously requested merge and block automerging
@dependabot reopen will reopen this PR if it is closed
@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Updates the requirements on [torchaudio](https://github.com/pytorch/audio) to permit the latest version. - [Release notes](https://github.com/pytorch/audio/releases) - [Commits](pytorch/audio@v0.2.0...v0.13.0) --- updated-dependencies: - dependency-name: torchaudio dependency-type: direct:production ... Signed-off-by: dependabot[bot] <[email protected]>

dependabot · 2022-12-16T13:08:13Z

Superseded by #526.

dependabot bot added the dependencies Pull requests that update a dependency file label Nov 8, 2022

dependabot bot assigned dciborow and mattchansky Nov 8, 2022

dependabot bot closed this Dec 16, 2022

dependabot bot deleted the dependabot/pip/torchaudio-lt-0.13.1 branch December 16, 2022 13:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update torchaudio requirement from <=0.7.2 to <0.13.1 #514

Update torchaudio requirement from <=0.7.2 to <0.13.1 #514

dependabot bot commented on behalf of github Nov 8, 2022

dependabot bot commented on behalf of github Dec 16, 2022

Pipeline	All	Drums	Bass	Other	Vocals
HDEMUCS_HIGH_MUSDB*	6.42	7.76	6.51	4.47	6.93
HDEMUCS_HIGH_MUSDB_PLUS**	9.37	11.38	10.53	7.24	8.32

Update torchaudio requirement from <=0.7.2 to <0.13.1 #514

Update torchaudio requirement from <=0.7.2 to <0.13.1 #514

Conversation

dependabot bot commented on behalf of github Nov 8, 2022

torchaudio 0.13.0 Release Note

Highlights

[Beta] Source Separation Models and Bundles

[Beta] Datasets and Metadata Mode for SUPERB Benchmarks

[Beta] Custom Language Model support in CTC Beam Search Decoding

[Beta] StreamWriter

Backward-incompatible changes

dependabot bot commented on behalf of github Dec 16, 2022