Skip to content

finetune the chain model based on cvte open source model without traing any GMM for frame alignment

Notifications You must be signed in to change notification settings

zhaoyi2/CVTE_chain_model_finetune

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CVTE_chain_model_finetune

  • finetune chain model base on cvte open source model using aishell1 data
  • using the open source GMM for frame align

Explanation

  • cvte supply a chain model trained using more than 2000h audio data
  • cvte supply a 3-gram LM model trained with 1000 GB text;
  • this project does not need training any GMM series model
  • this project support online cmvn, since "apply-cmvn-online" is used during the training and decoding

Install

  • kaldi

Usage

  • bash run_train_ft.sh to prepare the data、train LM(optinal,you can also use the cvte open source HCLG) and finetune chain model
  • bash run_test.sh to decode the test data using the finetune model with apply-cmvn-online

Workflow

  • prepare the lexicon if train LM with your own data(optional,you can also use the cvte open source HCLG)
  • prepare the train data
  • prepare phone sets, questions, L compilation(optinal,you can also use the cvte open source HCLG)
  • train LM using kaldi_lm or SRILM(optinal,you can also use the cvte open source HCLG)
  • make graph(optinal,you can also use the cvte open source HCLG)
  • extract the 40 fbank
  • generate speed-perturbed data (for alignment) and use it to align through cvte open source GMM(tri6b)
  • get the alignments as lattices
  • generate volume-perturbed data (for train)
  • prepare config file for finetune such as model、den.fst、phone_lm.fst and tree
  • funtune the chain model by some epoch with a little bit small learning rate
  • decode the test data use finetune model with apply-cmvn-online

Result(aishell1 test data)

  • cer 11.10%

Reference

About

finetune the chain model based on cvte open source model without traing any GMM for frame alignment

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published