Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Word-alignment problem #56

Open
arielvsp opened this issue Aug 26, 2017 · 1 comment
Open

Word-alignment problem #56

arielvsp opened this issue Aug 26, 2017 · 1 comment

Comments

@arielvsp
Copy link

arielvsp commented Aug 26, 2017

Hi,

Running the demo script (./transcribe-audio.sh dr_strangelove.mp3) produces the following output and hangs:

LOG ([5.2.64~1-2fbf2]:ComputeDerivedVars():ivector-extractor.cc:183) Computing derived variables for iVector extractor
LOG ([5.2.64~1-2fbf2]:ComputeDerivedVars():ivector-extractor.cc:204) Done.
WARNING ([5.2.64~1-2fbf2]:LatticeWordAligner():word-align-lattice.cc:263) [Lattice has input epsilons and/or is not input-deterministic (in Mohri sense)]-- i.e. lattice is not deterministic.  Word-alignment may be slow and-or blow up in memory.
WARNING ([5.2.64~1-2fbf2]:LatticeWordAligner():word-align-lattice.cc:263) [Lattice has input epsilons and/or is not input-deterministic (in Mohri sense)]-- i.e. lattice is not deterministic.  Word-alignment may be slow and-or blow up in memory.
WARNING ([5.2.64~1-2fbf2]:LatticeWordAligner():word-align-lattice.cc:263) [Lattice has input epsilons and/or is not input-deterministic (in Mohri sense)]-- i.e. lattice is not deterministic.  Word-alignment may be slow and-or blow up in memory.
huh i hello this is hello dimitri listen i i can't hear too well do you support you could turn the music down just a little
Caught SIGSEGV

Is that normal (given the Kaldi warnings)? I have the same behavior with the streaming service (the worker hangs) when do-phone-alignment is set to "true". Is there anything I can do in Kaldi to prevent/improve this?

@nshmyrev
Copy link
Contributor

I investigated this problem, the thing is that you can't call WordAlignLattice twice, the first run replaces silences with epsilons, so the second run emits a warning. A similar problem fix is here:

alphacep/vosk-api@558b4dd

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants