Evaluating Speaker Identity Coding in Self-supervised Models (SSMs) and Humans

Masters Project Done by: Gasser Elbanna

Supervisor: Satrajit S. Ghosh

EPFL Advisor: Antoine Bosselut

This repository hosts all scripts developed for this project. As the project comprises three chapters, each directory contains the analysis carried out in the chapter.

Chapter 1:

The first chapter of the project is aimed to assess the suitability of SSMs for the purpose of speaker identification. In order to achieve this aim, we address the following research questions to carry out this assessment:

Are self-supervised models good candidates to study speaker identity coding?
What aspects of speech do self-supervised models encode?
What are the models’ invariances and equivariances when recognizing a speaker?

Chapter 2:

In this chapter, we study the models’ encoding spaces as analogous to the perceptual space of humans. Here are the research questions we are tackling in this chapter:

Is there a correlation between linear distances computed in the embeddings space and theperceptual space of humans?
Does learnable decision models explain human behavior better than linear distance metrics?
What are the commonalities and differences between the representational spaces of the models and humans?

Chapter 3:

Taking a step further, in this chapter, we aim to investigate the correspondence between models’ encoding spaces and human neural representational space. The main question we ask in this chapter:

Where is the information content of speech models best represented in the brain?

Further details regarding the code scripts are provided in the directory of each chapter.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
chapter1		chapter1
chapter2		chapter2
chapter3		chapter3
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Evaluating Speaker Identity Coding in Self-supervised Models (SSMs) and Humans

Masters Project Done by: Gasser Elbanna

Supervisor: Satrajit S. Ghosh

EPFL Advisor: Antoine Bosselut

Chapter 1:

Chapter 2:

Chapter 3:

About

Releases

Packages

Languages

sensein/speaker_identity_perception

Folders and files

Latest commit

History

Repository files navigation

Evaluating Speaker Identity Coding in Self-supervised Models (SSMs) and Humans

Masters Project Done by: Gasser Elbanna

Supervisor: Satrajit S. Ghosh

EPFL Advisor: Antoine Bosselut

Chapter 1:

Chapter 2:

Chapter 3:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages