This is the main repository to build an acoustic model for sphinx based on the Open Speech Corpus Isolated Word Corpus.
First execute the script 01_download_word_recordings.py
, this script will fetch all data from OPS.
Then execute the script 02_convert_mp4_to_wav.py
, to execute this script you must have FFMpeg
installed and on your path.
After you need to prepare sphinx configuration data, to achieve this:
Then you can call the script 03_configure_sphinx.py
, this script will configure almost all the files required by sphinx,
but to create a custom language model you need to execute 04_generate_language_model.sh
.
Make sure you have sphinxtrain installed on your pc
Now execute
sphinxtrain -t ops_isolated_words setup
After this in your etc folder you will have a full structure or what you need for your project
Please check this link for further information.
Search for $CFG_HMM_TYPE
and select .semi
If you are on a multicore machine change $CFG_QUEUE_TYPE
to Queue::POSIX
and $CFG_NPART
and $DEC_CFG_NPART
to your machine cores
Then execute the train
sphinxtrain run
This could take some time.
To check the results
pocketsphinx_continuous -hmm model_parameters/ops_isolated_words.ci_semi/ -lm etc/ops_isolated_words.lm.DMP -dict etc/ops_isolated_words.dic -inmic yes