Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The correspondence of audio snippets between different emotions #12

Open
cyj907 opened this issue Jun 18, 2021 · 5 comments
Open

The correspondence of audio snippets between different emotions #12

cyj907 opened this issue Jun 18, 2021 · 5 comments

Comments

@cyj907
Copy link

cyj907 commented Jun 18, 2021

Hi,

I downloaded part of the datasets and found that the correspondence of the audio snippets are not arranged as what I have expected.

I thought the number in the filename indicates the content of the audio. E.g. 001.mp4 in disgusted should be the same content as 001.mp4 in neutral. But unfortunately, they are not the same in M003. And I don't know why 30 snippets are provided for emotions other than neutral, but there are 40 snippets for neutral.

Could you explain to me why is it? And could you provide the correspondence relations of different snippets?
It is really hard to use your dataset if the correspondence are not provided.

Thanks

@uniBruce
Copy link
Owner

Hi,

I downloaded part of the datasets and found that the correspondence of the audio snippets are not arranged as what I have expected.

I thought the number in the filename indicates the content of the audio. E.g. 001.mp4 in disgusted should be the same content as 001.mp4 in neutral. But unfortunately, they are not the same in M003. And I don't know why 30 snippets are provided for emotions other than neutral, but there are 40 snippets for neutral.

Could you explain to me why is it? And could you provide the correspondence relations of different snippets?
It is really hard to use your dataset if the correspondence are not provided.

Thanks

Sorry about this problem. It seems that M003 has got one audio&video lost, which, I think, makes the correspondence a mess. I will contact the related staff to check with that. Normally, the correspondence you want is in the supplementary of the paper (i.e., the orders of the sentences). In terms of the number of emotions, you could try to read the supplementary as well. Actually, we capture emotional data (except neutral) in three intensities, which means the amount of emotional data will be much larger than that of the neutral class if we don't capture more sentences for neutral.

@cyj907
Copy link
Author

cyj907 commented Jun 22, 2021

Thank you for the reply.

I have read the supp and realize why there are different numbers of snippets in neutral and other emotions. But there still exists other problems.

I have carefully listened to some of the audio snippets in the datasets. I found that not only the order of the sentences are wrong, some of the snippets actually contains more than one sentence. e.g. The actor read two sentences within one snippet. So I used asr to find the correspondence of different snippets........it is really painful.

If you have updated your datasets, would you please let me know? I am not sure if I have found the correspondence 100% correctly by asr.

@uniBruce
Copy link
Owner

Thank you for the reply.

I have read the supp and realize why there are different numbers of snippets in neutral and other emotions. But there still exists other problems.

I have carefully listened to some of the audio snippets in the datasets. I found that not only the order of the sentences are wrong, some of the snippets actually contains more than one sentence. e.g. The actor read two sentences within one snippet. So I used asr to find the correspondence of different snippets........it is really painful.

If you have updated your datasets, would you please let me know? I am not sure if I have found the correspondence 100% correctly by asr.

Ok, I will contact the annotation team asap. It seems that the mistakes are more likely from their work.

@hjzzju
Copy link

hjzzju commented Aug 4, 2021

Any change? it seems that the correspondence is still not correct now

@filby89
Copy link

filby89 commented Jul 31, 2022

Hi,
in my experience the correspondence is still broken, and some actors follow a completely different naming convention (the filenames go up to 60, skipping various numbers between them. I used MEAD in a research project of mine and annotated the transcriptions for all actors. You can find it here:

https://github.com/filby89/spectre/blob/master/data/list_full_mead_annotated.txt

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants