Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Seeking Guidance on Custom Urdu ASR Training Data and Vocabulary Expansion #4900

Open
Shaukataliii opened this issue Jan 3, 2024 · 1 comment

Comments

@Shaukataliii
Copy link

Hello,
I am a developer working on a project involving the development of an Urdu Automatic Speech Recognition (ASR) system using the Kaldi ASR toolkit. I am encountering two specific challenges and would greatly appreciate your insights.

Challenges

  1. Acquiring Transcriptions for Custom Urdu Dataset:
  • Issue: Obtaining accurate transcriptions for a substantial custom Urdu language dataset, tailored for industry-specific use, has proven challenging.

  • Request: Seeking guidance or suggestions on cost-effective solutions or resources that could assist in obtaining accurate transcriptions.

  1. Optimizing Kaldi ASR for Recognizing Unseen Words:
    • Issue: We aim to optimize the Kaldi ASR model to efficiently recognize new words it may encounter during inference, especially industry-specific jargon.
    • Request: Looking for insights or recommendations on approaches to handle previously unseen words and enhance the model's adaptability.

Thank you for your time and consideration.

@judyfong
Copy link

judyfong commented Oct 8, 2024

For two I recommend looking at the icelandic althingi recipe a bit. https://github.com/cadia-lvl/althingi-asr We use sub word modeling and also fst and regular expressions through Thrax. We're looking to merge the recipe into kaldi-asr soon.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants