GitHub - shanhaidexiamo/UtterancePIT-Speech-Separation: According to funcwj's uPIT, the training code supporting multi-gpu is written, and the Dataloader is reconstructed.

UtterancePIT-Speech-Separation

According to funcwj's uPIT, the training code supporting multi-gpu is written, and the Dataloader is reconstructed.

If you want to see the funcwj code, this is his repository link.

uPIT-for-speech-separation

Demo Pages: Results of pure speech separation model

Accomplished goal

Support Multi-GPU Training
Use the Dataloader Method That Comes With Pytorch
Provide Pre-Training Models

Python Library Version

Pytorch==1.3.0
tqdm==4.32.1
librosa==0.7.1
scipy==1.3.0
numpy==1.16.4
PyYAML==5.1.1

How to Using This Repository

Generate dataset using create-speaker-mixtures.zip with WSJ0 or TIMI
Prepare scp file(The content of the scp file is "filename path")
```
 python create_scp.py
```
Prepare cmvn(Cepstral mean and variance normalization (CMVN) is a computationally efficient normalization technique for robust speech recognition.).
```
 #Calculated by the compute_cmvn.py script: 
 python compute_cmvn.py ./tt_mix.scp ./cmvn.dict
```
Modify the contents of yaml, mainly to modify the scp address, cmvn address. At the same time, the number of num_spk in run_pit.py is modified.
Training:
```
sh train.sh
```
Inference:
```
sh test.sh
```

Reference

Kolbæk M, Yu D, Tan Z H, et al. Multitalker speech separation with utterance-level permutation invariant training of deep recurrent neural networks[J]. IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), 2017, 25(10): 1901-1913.
https://github.com/funcwj/uPIT-for-speech-separation

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
conf		conf
pretrain-model		pretrain-model
scripts		scripts
.gitignore		.gitignore
README.md		README.md
cmvn.dict		cmvn.dict
compute_cmvn.py		compute_cmvn.py
dataset.py		dataset.py
dataset.py.exit		dataset.py.exit
dataset.py.old		dataset.py.old
model.py		model.py
oracle_separate.py		oracle_separate.py
run_pit.py		run_pit.py
separate.py		separate.py
test.sh		test.sh
train.sh		train.sh
trainer.py		trainer.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

UtterancePIT-Speech-Separation

According to funcwj's uPIT, the training code supporting multi-gpu is written, and the Dataloader is reconstructed.

If you want to see the funcwj code, this is his repository link.

Accomplished goal

Python Library Version

How to Using This Repository

Reference

About

Releases

Packages

Languages

shanhaidexiamo/UtterancePIT-Speech-Separation

Folders and files

Latest commit

History

Repository files navigation

UtterancePIT-Speech-Separation

According to funcwj's uPIT, the training code supporting multi-gpu is written, and the Dataloader is reconstructed.

If you want to see the funcwj code, this is his repository link.

Accomplished goal

Python Library Version

How to Using This Repository

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages