Problems getting good speech to text for auctions and fundraisers #957
-
The issue is that Deepgram consistently incorrectly transcribes an auctioneer calling dollar amounts. For example, the audio "seven thousand, seven, seventy five hundred, seven five, seventy five hundred, seven thousand, do I have seventy five hundred? At seven thousand, do I have seventy five hundred?" transcribes as follows: 7 1000, 700,75100, 75, 7500, 10000, 7000. I find it particularly odd that Deepgram will transcribe "seventy five hundred" as "75100". It does this often! no space, no comma, not properly transcribed without say "thousand" after "seven". Any ideas? The current settings are: const deepgram = deepgramClient.listen.live({ |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 5 replies
-
@johnnydanger could you try this with our Nova-2 model? it might handle it better. Though numbers can always be tricky for STT, and the speed at which the auctioneer is saying the numbers could also make this somewhat challenging to get right. Depending on if there are breaks between you might want to look at endpointing with interim results. Also check out our docs on utterance_end and speech_started Playing with these different features of our Streaming STT API might get you what you need, but it might take some experimentation. |
Beta Was this translation helpful? Give feedback.
-
Awesome, thanks John! I’ll dig into it and let you know the solution I land on. Thanks for all the help.
…On Oct 19, 2024 at 9:24 AM -0700, John Vajda (JV) ***@***.***>, wrote:
@johnnydanger there are actually. try using numerals or punctuate and set to true
https://developers.deepgram.com/docs/numerals
https://developers.deepgram.com/docs/punctuation
These are the older formatting features that smart_format attempted to replace, but I know sometimes they work better independently
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
Beta Was this translation helpful? Give feedback.
@johnnydanger there are actually. try using
numerals
orpunctuate
and set totrue
https://developers.deepgram.com/docs/numerals
https://developers.deepgram.com/docs/punctuation
These are the older formatting features that
smart_format
attempted to replace, but I know sometimes they work better independently