Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Non-English text shown on transcription console UI #1022

Open
asadullahnaeem-techtics opened this issue Nov 1, 2024 · 4 comments
Open

Non-English text shown on transcription console UI #1022

asadullahnaeem-techtics opened this issue Nov 1, 2024 · 4 comments
Labels
bug Something isn't working

Comments

@asadullahnaeem-techtics
Copy link

The logs of the Multimodal agent with OpenAI Realtime API show correct English text on the console but on the UI, it sometimes shows the audio transcription in other languages like Hindi, Chinese, Russian etc while speaking English.

@longcw
Copy link
Contributor

longcw commented Nov 3, 2024

I noticed the same issue sometimes. According to the openai's document, the transcription is from a separate process run on a separate ASR model (currently whisper-1), and the transcript may diverge somewhat from the model's interpretation. So it seems it's from the speech recognition error. https://platform.openai.com/docs/api-reference/realtime-server-events/conversation/item/input_audio_transcription/completed

@longcw
Copy link
Contributor

longcw commented Nov 3, 2024

oh wait, you mean in the transcripts in logs it's correct but in UI it's wrong? maybe it's a different issue from what I mentioned above.

@asadullahnaeem-techtics
Copy link
Author

In the logs, transcription is correct but on the UI it is not.

@nbsp nbsp added the bug Something isn't working label Nov 5, 2024
@davidzhao
Copy link
Member

can you share an example with logs or screenshot?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

4 participants