NHNN Implementation

This is the Pytorch implementation for paper "Accounting for Variations in Speech Emotion Recognition with NonParametric Hierarchical Neural Network".

The training file included is for IEMOCAP. The PRIORI datasets are not public. The metadata is stored in data.csv, which includes audio segment id, subject id, gender label, and emotion label (valence rating). The eGeMAPS features for the audios are extracted and stored as features.csv.

The Log-MFB features can be downloaded here (around 7GB). The features have been extracted and concatenated into a single numpy array.

To run the training for CNN, you can simply run

python3 train_CNN.py

To run the training for NHNN, type

python3 train_NHNN.py *version*

Here, version must be either FC or FC+Conv

Please note that the current NHNN implementation requires a feature encoder. Therefore you must run the training for CNN first, and the feature encoder will be used for training the NHNN model.

Please feel free to email [email protected] for any questions.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.DS_Store		.DS_Store
NHNN.py		NHNN.py
README.md		README.md
Train_CNN.py		Train_CNN.py
Train_NHNN.py		Train_NHNN.py
data.csv		data.csv
extract_mfbs.py		extract_mfbs.py
features.csv		features.csv
model.py		model.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NHNN Implementation

About

Releases

Packages

Languages

lance-ying/NHNN

Folders and files

Latest commit

History

Repository files navigation

NHNN Implementation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages