GitHub

This is the repository for Speaker Change Detection algorithms which are based on different Deep Learning architectures.

Firstly, you can start with my blogpost to read summary of different algorithms.

We have tried BiLSTM, CNN+BiLSTM and Speaker2Vec. You can find the detailed Jupyter Notebooks for these approaches at this repository.

Also, you can find the scripted version of BiLSTM and CNN+BiLSTM. If you read their readme and follow whole steps, it should work. (If it does not work, you can open the issue or send the e-mail to me. However, for the visualization, you need to use Jupyter Notebooks. Unfortunately, some function names are different in the script and notebooks. Because, scripted version has little bit different architecture. For the Speaker2Vec, I have not done the scripted version, however, I am planning to provide that.

Our results are not so good, however, whole pipeline is ready. So that, you can tweak the paramaters, do fine-tuning and change the DL architecture to get better result.

Name		Name	Last commit message	Last commit date
Latest commit History 155 Commits
bilstm_CL		bilstm_CL
cnn_bilstm_CL		cnn_bilstm_CL
.DS_Store		.DS_Store
.gitignore		.gitignore
BiLSTM.ipynb		BiLSTM.ipynb
ISCI_extractor.ipynb		ISCI_extractor.ipynb
LICENSE		LICENSE
README.md		README.md
Speaker2Vec.ipynb		Speaker2Vec.ipynb
Speaker2Vec_dev.ipynb		Speaker2Vec_dev.ipynb
bilstm-crepe.ipynb		bilstm-crepe.ipynb
bilstm_minima.ipynb		bilstm_minima.ipynb
npy_extractor.ipynb		npy_extractor.ipynb
pyannote_reproduce.ipynb		pyannote_reproduce.ipynb
pyannote_reproduce_dev.ipynb		pyannote_reproduce_dev.ipynb
spec-file.txt		spec-file.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

License

hedonistrh/SpChangeDetect

Folders and files

Latest commit

History

Repository files navigation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages