EAV: EEG-Audio-Video Dataset for Emotion Recognition in Conversational Contexts

We introduce a multimodal emotion dataset comprising data from 30-channel electroencephalography (EEG), audio, and video recordings from 42 participants. Each participant engaged in a cue-based conversation scenario, eliciting five distinct emotions: neutral(N), anger(A), happiness(H), sadness(S), and calmness(C).

Participants engage in paired listen/speak sets with recordings of an experienced actor, seated in front of a 27-inch monitor displaying visual stimuli. The experiment is designed as a pseudo-random class-iteration sequence: [A, A, C, C, S, S, H, A, C, H, H, S, S, A, A, C, C, H, H, S]. Throughout the experiment, each participant contributed 200 interactions. This resulted in a cumulative total of 8,400 interactions across all participants. Please refer to the paper [TODO: add link] for more details.

Domains

Video

Each 'Video' subfolder contains 200 video clips, each is 20 sec in lengths, 30 fps and performs either ’listening’ or ’speaking’ tasks. The Video data adopts the structure - [5 emotion classes × 2 tasks × 20 iterations]

File format: .mp4

Baseline performance of DeepFace: Mean ACC = 52.8 %, Mean F1-score = 51.5 %

Audio

Each 'Audio' subfolder contains 100 audio files, each is 20 sec in lengths and performs only ’speaking’ task. The audio data adopts the structure - [5 classes × 1 task ('speaking') × 20 conversations]

File format: .wav

Baseline performance of SCNN: Mean ACC = 36.7 %, Mean F1-score = 34.1 %

EEG

Each 'EEG' subfolder contains 2 EEG data files. Each instance is 20 sec in lengths and an initial sampling rate of 500 Hz. Due to continuous recording, the processed EEG data adopts the structure - [200 instances × 10,000 time points(20s × 500 Hz) × 30 channels]. The labels for this data use a one-hot encoding format, structured as 200 trials by 10 classes (5 emotions multiplied by 2 tasks).

File format: .mat

Baseline performance of EEGnet: Mean ACC = 36.7 %, Mean F1-score = 34.1 %

Note that the label information can be applied across all modalities since all recordings, regardless of the modality, were conducted synchronously. This ensures uniform annotations throughout the dataset.

Getting Started

conda environment

conda create --name eav python=3.10
conda activate eav

Installation

Clone the repo

git clone https://github.com/nubcico/EAV.git
cd EAV

Install requirements
```
pip install -r requirements.txt
```

Usage

Run demo:

python demo.py

Training

To train the Transfromer for Speech emotion recognition run:

python Dataload_audio.py

Roadmap

CNN based Emotion recognition on Video, Audio and EEG domains using Tensorflow
CNN based Emotion recognition on Video and EEG domains using PyTorch
Transformer based Emotion recognition on Video, Audio and EEG domains using PyTorch
Create demo file
Add .pkl files of preprocessed video data (Feature_vision folder)
Add inference files

Contact

Minho Lee - [email protected]

Adai Shomanov - [email protected]

Zhuldyz Kabidenova - [email protected]

Adnan Yazici - [email protected]

License

Distributed under the MIT License. See LICENSE.txt for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
CNN_tensorflow		CNN_tensorflow
CNN_torch		CNN_torch
Fusion		Fusion
Pre_trained_models/ast-finetuned-audioset		Pre_trained_models/ast-finetuned-audioset
Transformer_torch		Transformer_torch
VIT_with_ASTweights		VIT_with_ASTweights
codes_old		codes_old
images		images
Dataload_audio.py		Dataload_audio.py
Dataload_eeg.py		Dataload_eeg.py
Dataload_vision.py		Dataload_vision.py
EAV_datasplit.py		EAV_datasplit.py
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EAV: EEG-Audio-Video Dataset for Emotion Recognition in Conversational Contexts

Domains

Video

Audio

EEG

Getting Started

Installation

Usage

Training

Roadmap

Contact

License

About

Releases

Packages

Languages

License

zizimars/EAV

Folders and files

Latest commit

History

Repository files navigation

EAV: EEG-Audio-Video Dataset for Emotion Recognition in Conversational Contexts

Domains

Video

Audio

EEG

Getting Started

Installation

Usage

Training

Roadmap

Contact

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages