Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Real time noise suppression #12

Open
sberryman opened this issue Apr 9, 2020 · 1 comment
Open

Real time noise suppression #12

sberryman opened this issue Apr 9, 2020 · 1 comment

Comments

@sberryman
Copy link

Excellent article on VentureBeat today:
https://venturebeat.com/2020/04/09/microsoft-teams-ai-machine-learning-real-time-noise-suppression-typing/

Funny enough I've used this dataset (which I'm assuming you are referring to in the article) to also train noise suppression. I didn't have a requirement for real-time/streaming so I used a bidirectional LSTM recurrent layer. I also trained against Librispeech (technically LibriTTS as I wanted 24hz audio.)

Examples

Sourced from national news broadcasts to show performance against data it was NOT trained on. Audio files are compressed as GitHub doesn't allow raw waveform upload. I've provided the source files from the broadcast with _noisy.wav suffix and the predicted output from the network with the _clean.wav suffix.

Example 1

sequence 1585584_clean
sequence.1585584_.zip

Example 2

sequence 1597540_clean
sequence.1597540_.zip

Example 3

sequence 1046182_clean
sequence.1046182_.zip

Example 4

sequence 1597377_clean
sequence.1597377_.zip

Example 5

sequence 231_clean
sequence.231_.zip

Example 6

Not the best but still did a decent job suppressing a noise sample it was never trained against.
00049 unknown and_despite_that_and_despite_40_million_18_trump_haters_including_people_that_worked_for_hillary_clinton_and_some_of_the_worst_human_beings_on_earth_they_got_nothing_clean
trump_helicopter.zip

@sberryman
Copy link
Author

I should clarify that I started by masking and later decided to go a different route. In all the provided examples the output from the network is recombined with the source phase when passing to istft. Masking left much more of the background noise.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant