speech-recognition-with-python

Materials for "Speech Recognition with Python" lecture at PyConPL 2023 conference.

Materials include application examples of the following tools:

SpeechRecognition (Python module supporting several speech-to-text engines and APIs)
AssemblyAI (API)
OpenAI's Whisper (speech-to-text model)
Transformers (pretrained speech-to-text models)

Instructions for running a workshop using Google Colab

Go to speech_recognition_with_python.ipynb and click on Open in Colab button on top of the notebook.
Copy .wav files from audio_filesdirectory or Google Drive folder into your personal Google Drive. Suggested path of the directory with the audio files is: Colab Notebooks/speech_recognition_with_python. Note: If you will be storing these files in a different location, be sure to change the PATH constant in the notebook.
Mount your Google Drive to your Google Colab notebook.
And... that's all! 🥳 Have a great learning experience!

Please note: To use AssemblyAI you need to create your own account on https://www.assemblyai.com/. After creating your account, you will receive an AssemblyAI API Key, which you need to copy into notebook (change the value of ASSEMBLY_AI_API_KEY constant in the notebook).

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
audio_files		audio_files
README.md		README.md
pyconopl2023_speech_recognition_with_python.ipynb		pyconopl2023_speech_recognition_with_python.ipynb
slides.pdf		slides.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

speech-recognition-with-python

Instructions for running a workshop using Google Colab

About

Languages

mpfmorawski/pyconpl2023-speech-recognition

Folders and files

Latest commit

History

Repository files navigation

speech-recognition-with-python

Instructions for running a workshop using Google Colab

About

Topics

Resources

Stars

Watchers

Forks

Languages