Skip to content

sensein/speaker_identity_perception

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Evaluating Speaker Identity Coding in Self-supervised Models (SSMs) and Humans

Masters Project Done by: Gasser Elbanna

Supervisor: Satrajit S. Ghosh

EPFL Advisor: Antoine Bosselut

This repository hosts all scripts developed for this project. As the project comprises three chapters, each directory contains the analysis carried out in the chapter.

Chapter 1:

The first chapter of the project is aimed to assess the suitability of SSMs for the purpose of speaker identification. In order to achieve this aim, we address the following research questions to carry out this assessment:

  • Are self-supervised models good candidates to study speaker identity coding?
  • What aspects of speech do self-supervised models encode?
  • What are the models’ invariances and equivariances when recognizing a speaker?

Chapter 2:

In this chapter, we study the models’ encoding spaces as analogous to the perceptual space of humans. Here are the research questions we are tackling in this chapter:

  • Is there a correlation between linear distances computed in the embeddings space and theperceptual space of humans?
  • Does learnable decision models explain human behavior better than linear distance metrics?
  • What are the commonalities and differences between the representational spaces of the models and humans?

Chapter 3:

Taking a step further, in this chapter, we aim to investigate the correspondence between models’ encoding spaces and human neural representational space. The main question we ask in this chapter:

  • Where is the information content of speech models best represented in the brain?

Further details regarding the code scripts are provided in the directory of each chapter.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages