Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mic input for live transcription #86

Open
joshoreefe opened this issue Apr 3, 2024 · 4 comments
Open

Mic input for live transcription #86

joshoreefe opened this issue Apr 3, 2024 · 4 comments

Comments

@joshoreefe
Copy link

joshoreefe commented Apr 3, 2024

After some upgrades and configuration changes the live transcription stopped working. My setup was working okay, but for unknown reason stopped capturing the mic input. Hence upgraded Jetson Orin Nano developer kit 4b to JetPack 5.1.3.

The live input device doesn't seem to capture audio same way as arecord. If I do a test recording so:

arecord -D usbmic test.wav
Recording WAVE 'test.wav' : Signed 16 bit Little Endian, Rate 8000 Hz, Mono

the recorded audio is fine. The audio file transcribes correctly.

If I then try live transcription using the same device so:

whisper-ctranslate2 --live_transcribe True --live_input_device 27 ....etc

the process starts okay:
Live stream device: usbmic
Listening.. (Ctrl+C to Quit)

But that's all. Nothing happens. Seems the capture is working differently from record?

@joshoreefe
Copy link
Author

Is live transcribing working for others? If so, please give some setup hints!

@Benjamin-Lee
Copy link

Can confirm on Mac as well. Feeding in mp3 works but live stream doesn't, even though the device is detected.

@965311532
Copy link

Live transcribing isn't working for me either

@pheraph
Copy link

pheraph commented Sep 2, 2024

The built-in microphone of the M-MacBooks is known to have problems with the input volume with various programs. Sometimes a sudo killall coreaudiod helps for a while, but not here. In fact, the threshold can be lowered, then it works for me:

whisper-ctranslate2 --live_transcribe True --live_volume_threshold 0.01

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants