Skip to content

Simple python application to take your speech and convert into text

License

Notifications You must be signed in to change notification settings

nilukush/bhashan2pathtak

Repository files navigation

bhashan2pathtak

Simple Python application to convert speech into text using Wit.ai

Prerequisites for MacOS

  • brew install portaudio
  • brew install flac

Features

  • Continuous speech recognition
  • Improved accuracy using Wit.ai
  • Ambient noise adjustment
  • Error handling and user-friendly feedback

Setup

Before using bhashan2pathtak, you need to set up a Wit.ai account and obtain a token:

  1. Go to https://wit.ai/ and create an account if you haven't already.
  2. Create a new Wit.ai app and copy your Client Access Token.
  3. Set your token using one of these methods: a. Set an environment variable:
    export WIT_AI_TOKEN=your_token_here
    
    b. Create a config.json file in the directory where you'll run the application, with the following content:
    {
        "WIT_AI_TOKEN": "your_token_here"
    }

Installation

To use bhashan2pathtak as a package:

  1. Install the package:

    pip install bhashan2pathtak
    
  2. Run the application:

    bhashan2pathtak
    

Note: You still need to configure your Wit.ai token as described in the Setup section.

Development Setup

  1. Clone this repository
  2. Install the required packages:
    pip install -r requirements.txt
    
    Note: This project uses PyAudio 0.2.14. If you encounter issues with installation, try upgrading to this version.
  3. Sign up for a Wit.ai account and create a new app to get an access token

Development Configuration

To run this application, you need to provide your Wit.ai token. You have two options:

  1. Environment Variable: Set the WIT_AI_TOKEN environment variable:

    export WIT_AI_TOKEN=your_wit_ai_token_here
    
  2. Configuration File: Copy config.json.example to config.json and add your Wit.ai token:

    cp config.json.example config.json
    

    Then edit config.json and replace your_wit_ai_token_here with your actual token.

Important: Never commit your config.json file to the repository. It's listed in .gitignore to prevent accidental commits.

How to run

After setting up the configuration:

python3 speech_to_text.py

Troubleshooting

If you encounter any issues with PyAudio, make sure you have version 0.2.14 installed:

pip install PyAudio==0.2.14

For any other issues, please check the Wit.ai documentation or open an issue in this repository.

About

Simple python application to take your speech and convert into text

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages