Use ASpIRE Chain Model (By Dan Povey) #50

adx349 · 2017-04-20T05:46:26Z

Thank you for your work on kaldi, it is very helpful for me.
I was wondering what changes do I have to make to use the latest ASpIRE Chain Model.
I tried changing the nnet-mode=3 and also replace fst,mdl,conf files with the new model but it is not giving me any output.
What do you think is the issue ?

arawind · 2017-04-29T04:26:33Z

@adx349 Try setting the frame-subsamping-factor to 3, and the acoustic-scale to 1

Take a look at this thread for more details

fanskyer · 2017-05-09T18:18:04Z

i too tried that, and found it is not working. my guess is that it ASPIRE model use BLSTM which is not supported in this online decoding.

tshastry · 2017-05-09T18:21:21Z

@fanskyer @adx349 I actually think it is an issue with the new Kaldi looped decoding not working properly. If you rollback Kaldi to commit bcc71b67d489a1766922c9caf2a54306755f1861 and gst-kaldi-nnet2-online to commit 63b2cfd, then the ASPIRE model works. You will still need to set nnet-mode to 3, acoustic-scale to 1, and frame-subsampling-factor to 3

maxhawkins · 2017-05-28T19:04:30Z

Were you able to get this working? I tried rolling back to 63b2cfd and setting those options in my config. No luck, it just returns yeah yeah yeah over and over again.

Here's my config: https://gist.github.com/maxhawkins/24edbd87be0aa1601da5034acc27d7ee

I'm using the ASpIRE chain model from kaldi-asr.org with an HCLG.fst created using the documentation.

maxhawkins · 2017-05-28T19:28:27Z

Never mind. I was using the client incorrectly. When I converted my wav file to raw PCM it started working fine.

For anyone who encounters this in the future, here are the steps I took:

Compile kaldi-asr/kaldi@bcc71b6 and 63b2cfd
Compile the ASpIRE HCLG.fst and point the worker.yml to it.
Start the server and pass it raw audio using client.py

python kaldigstserver/master_server.py --port=8888 &
env GST_PLUGIN_PATH=.. python kaldigstserver/worker.py -u ws://localhost:8888/worker/ws/speech -c worker.yaml &
sox audio.wav -r 8000 -e signed -b 16 -c 1 -t raw audio.raw remix 1
python kaldigstserver/client.py -r 16000 audio.raw

tshastry · 2017-06-14T01:03:02Z

Just an update to this -- I did some testing on my side, and the ASpIRE model will work with the latest commits and the frame-subsampling-factor set to 1 instead of 3. This is necessary for the most recent "looped decoding" implementation of Kaldi it seems. However, the accuracy appears to be worse than when the commits of both are reversed.

maxhawkins · 2017-06-14T02:09:16Z

Thanks I'll give that a shot.

I'm also seeing some errors with word-level alignment (subtle drift noticeable on long recordings) with the ASPIRE model at 63b2cfd, but I think that's a separate issue. I'll keep troubleshooting and file another bug if I can't resolve it.

suhel-jaber · 2018-04-20T03:21:44Z

It works for me, but it keeps outputting "mhm" every few seconds, while TEDLIUM didn't. Anyone experienced the same issue?

maxhawkins · 2018-04-20T22:30:32Z

I've had that issue before. Usually it means your settings are wrong. Check the acoustic-scale and frame-subsampling-factor.

mgoldey mentioned this issue Feb 16, 2018

fix for kaldi chain models alumae/kaldi-gstreamer-server#116

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use ASpIRE Chain Model (By Dan Povey) #50

Use ASpIRE Chain Model (By Dan Povey) #50

adx349 commented Apr 20, 2017

arawind commented Apr 29, 2017 •

edited

Loading

fanskyer commented May 9, 2017

tshastry commented May 9, 2017

maxhawkins commented May 28, 2017 •

edited

Loading

maxhawkins commented May 28, 2017

tshastry commented Jun 14, 2017

maxhawkins commented Jun 14, 2017

suhel-jaber commented Apr 20, 2018

maxhawkins commented Apr 20, 2018

Use ASpIRE Chain Model (By Dan Povey) #50

Use ASpIRE Chain Model (By Dan Povey) #50

Comments

adx349 commented Apr 20, 2017

arawind commented Apr 29, 2017 • edited Loading

fanskyer commented May 9, 2017

tshastry commented May 9, 2017

maxhawkins commented May 28, 2017 • edited Loading

maxhawkins commented May 28, 2017

tshastry commented Jun 14, 2017

maxhawkins commented Jun 14, 2017

suhel-jaber commented Apr 20, 2018

maxhawkins commented Apr 20, 2018

arawind commented Apr 29, 2017 •

edited

Loading

maxhawkins commented May 28, 2017 •

edited

Loading