Project

Instructions

Must make dir called model_output

Dataset

spoken language dataset. Be mindful that it's a 16 GB dataset

Extra optional tools

You can check this thread of jupyter lab vs. jupyter notebooks

pip3 install jupyterlab

pip3 install notebook

These tools were only used to run .ipynb files and facilitate visualizations!

Jupyterlab guide

Python script to jupyter-notebook converter

Exploratory steps

How did we clean up the files?

ls <dir> | grep -o '.....$' | uniq
<dir> | grep -o '^es.*'  # finds the spanish ones

For our work, we used the test set found in local dirs such as

/media/andres/2D2DA2454B8413B5/test/test/

The final version is the file_cleaner script found in this dir. That one copies the Spanish files to a new given dir as its second argument

Theory

Tutorial on mel spectograms

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
speech_sources		speech_sources
.gitignore		.gitignore
1Loss_over_epochs.jpg		1Loss_over_epochs.jpg
All_model_losses.jpg		All_model_losses.jpg
Notebook_for_Paul.ipynb		Notebook_for_Paul.ipynb
README.md		README.md
dataloader.py		dataloader.py
eval.py		eval.py
experiment_runner.py		experiment_runner.py
file_cleaner.sh		file_cleaner.sh
model.py		model.py
requirements.txt		requirements.txt
train.py		train.py
tsne-1d.png		tsne-1d.png
tsne-2d.png		tsne-2d.png
tsne-3d.png		tsne-3d.png
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project

Instructions

Dataset

Extra optional tools

Exploratory steps

Theory

About

Releases

Packages

Contributors 3

Languages

Zappandy/spoken_language_detector

Folders and files

Latest commit

History

Repository files navigation

Project

Instructions

Dataset

Extra optional tools

Exploratory steps

Theory

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages