Problems with german transcription #138
-
I'm experiencing problems with German Live transcription. Both opponents are speaking German (native German speakers), Deepgram configured like this:
When it is silence inside the stream Deepgram just sending words Wir or Ich in final messages. Tried to play a bit with timeSlice (increased it to 500) but this didn't really help. Not sure if endpointing option might help. Is it possible to somehow prevent receiving Wir or Ich in final messages? |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 4 replies
-
Hi @magic-tech-dev! Sorry you ran into this. Are you using a browser microphone for transcription? Sometimes that can introduce artifacts that our models incorrectly transcribe as words. Testing out different audio sources may help; I'll also talk with our research team to see if there's anything we can do on our end. I'm not aware of a |
Beta Was this translation helpful? Give feedback.
-
Hi @shirgoldbird! Yes, I'm using browser microphone for transcription. Our product is a web client, so using other resources is not possible unfortunately.
About
But why then it is working correct with 'en' language? I'm using the same config and microphone - never saw something like that. English model somehow ignores this artifacts? English transcription working great. Seems like the German recognition is not production ready - written POC is not working correctly. Please let me know if it will be any workaround available to make it working more stable in case of German language recognition. |
Beta Was this translation helpful? Give feedback.
@magic-tech-dev That's correct, Whisper does not support real-time transcription.
It seems like our German model has a recurring issue where it detects words in silence. This is due to the data the model was trained on and can't be fixed on your end by configuring Deepgram's parameters.
I've passed along your feedback to the research team so we can investigate improvements. Unfortunately, I don't have a timeline for you at the moment, but we usually announce language model updates on our changelog and social media .
Please let me know if you have any other questions!