Memory Augmented Neural Network for Source Separation

In this project, we implement neural Turing machine (NTM) for sequential signals of speech and noise in presence of different speakers and noise types. NTM is a memory-augmented neural network which is equipped with external memory to learn long sequential data. The information is stored with attention mechanism and read-writing scheme. For more details about NTM, you can refer to Neural Turing Machine. The system architecture and experimental settings are shown in Memory Augmented Neural Network for Source Separation.

Setting

Hardware:
- CPU: Intel Core i7-4930K @3.40 GHz
- RAM: 64 GB DDR3-1600
- GPU: NVIDIA Tesla K20c 6 GB RAM
Tensorflow 0.12
Dataset
- Wall Street Journal Corpus
- Noises are collected from freeSFX and AudioMicro

Result

An example of demixed signal


Mixed signal


Clean signal


Demixed signal

STOI measure on other noises


Seen speakers	Unseen speakers

STOI measure on bus noises


Seen speakers	Unseen speakers

STOI measure on caf noises


Seen speakers	Unseen speakers

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Evaluation		Evaluation
Others		Others
Model.py		Model.py
NTMCell.py		NTMCell.py
README.md		README.md
data_process.py		data_process.py
demo.ipynb		demo.ipynb
ops.py		ops.py
t-sne.py		t-sne.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Memory Augmented Neural Network for Source Separation

Setting

Result

About

Releases

Packages

Languages

KWTsou1220/mann-for-speech-separation

Folders and files

Latest commit

History

Repository files navigation

Memory Augmented Neural Network for Source Separation

Setting

Result

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages