This directory contains source code and a notebook for the speech-to-text experiment as a part of the Societal Biases in Slovak AI project.
-
Install the FFmpeg library if not already. On Ubuntu, you would install FFmpeg as follows:
apt install ffmpeg
-
Create a conda environment from the
conda.yml
file:conda env create -f conda.yml conda activate biases-sk-speech
Alternatively, you may install the required dependencies via pip:
pip install -r requirements.txt
The [notebook] contains examples of usage on how to download and preprocess datasets and perform speech recognition. Given the size of the datasets and the prediction runtime, you are encouraged to run the code on a dedicated machine.
The notebook also contains a brief analysis of the examined datasets and plots representing speech recognition results per dataset and category.