Sphinx OPS Isolated Word Models

This is the main repository to build an acoustic model for sphinx based on the Open Speech Corpus Isolated Word Corpus.

First execute the script 01_download_word_recordings.py, this script will fetch all data from OPS.

Then execute the script 02_convert_mp4_to_wav.py, to execute this script you must have FFMpeg installed and on your path.

After you need to prepare sphinx configuration data, to achieve this:

Then you can call the script 03_configure_sphinx.py, this script will configure almost all the files required by sphinx, but to create a custom language model you need to execute 04_generate_language_model.sh.

Make sure you have sphinxtrain installed on your pc

Now execute

sphinxtrain -t ops_isolated_words setup

After this in your etc folder you will have a full structure or what you need for your project

Please check this link for further information.

Search for $CFG_HMM_TYPE and select .semi If you are on a multicore machine change $CFG_QUEUE_TYPE to Queue::POSIX and $CFG_NPART and $DEC_CFG_NPART to your machine cores

Then execute the train

sphinxtrain run

This could take some time.

To check the results

pocketsphinx_continuous -hmm model_parameters/ops_isolated_words.ci_semi/ -lm etc/ops_isolated_words.lm.DMP -dict etc/ops_isolated_words.dic -inmic yes

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
01_download_word_recordings.py		01_download_word_recordings.py
02_convert_mp4_to_wav.py		02_convert_mp4_to_wav.py
03_configure_sphinx.py		03_configure_sphinx.py
04_generate_language_model.sh		04_generate_language_model.sh
README.md		README.md
__init__.py		__init__.py
common_filters.py		common_filters.py
generate_dict.py		generate_dict.py
generate_fileids.py		generate_fileids.py
generate_filler.py		generate_filler.py
generate_lm_transcription.py		generate_lm_transcription.py
generate_phone_set_from_dict.py		generate_phone_set_from_dict.py
generate_transcriptions.py		generate_transcriptions.py
ops_aphasia.html		ops_aphasia.html
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sphinx OPS Isolated Word Models

About

Releases

Packages

Languages

contraslash/sphinx-ops-isolated-words-models

Folders and files

Latest commit

History

Repository files navigation

Sphinx OPS Isolated Word Models

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages