16000Hz? #176

tim-gromeyer · 2023-08-13T07:35:26Z

Hello,
is it possible to use 16000Hz? In the README it says only 48000Hz, but I can't use 48000Hz. I am trying to add noise cancellation to my voice assistant, but Vosk (the library used for speech recognition) does not work with 48000Hz, unfortunately.

zamadatix · 2023-08-25T03:44:27Z

The RNN is trained on being fed 48 kHz signals. One option for your voice assistant would be to capture audio at 48000 Hz, feed it through the noise cancellation algorithm, then downsample the resulting audio to 16 kHz to be processed by Vosk.

xJanise · 2023-08-30T14:32:48Z

the microphone i personally am using doesnt support anything above 16kHz, is there any workaround or will have have to use it with 16kHz and hope for the best?

zamadatix · 2023-08-30T14:42:52Z

It doesn't as much matter that the original data is 16 kHz as much as the format fed into the noise suppression algorithm is encoded as 48 kHz sampling. From that perspective you can still grab from the mic and feed to Vosk at 16 kHz you just need to do sample rate conversion (i.e. convert from 16 kHz to 48 kHz or vice versa) for audio coming into or going out of the noise suppression step.

bobobo199733 mentioned this issue Mar 5, 2024

noise-suppression-for-voice bobobo199733/issues-metrics-tool-repo#22

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

16000Hz? #176

16000Hz? #176

tim-gromeyer commented Aug 13, 2023

zamadatix commented Aug 25, 2023

xJanise commented Aug 30, 2023

zamadatix commented Aug 30, 2023 •

edited

Loading

16000Hz? #176

16000Hz? #176

Comments

tim-gromeyer commented Aug 13, 2023

zamadatix commented Aug 25, 2023

xJanise commented Aug 30, 2023

zamadatix commented Aug 30, 2023 • edited Loading

zamadatix commented Aug 30, 2023 •

edited

Loading