Highlight and speak text from your Obsidian notes. Converts text to audio using lifelike voices from OpenAI.
tts-interaction-audio.mp4
Just add your OpenAI API key. Choose from 6 voices. OpenAI charges Audio at $0.015 per 1,000 characters. That's around $12 for the ~800,000 character text of A Tale of Two Cities.
Visual Feedback: Active sentence is highlighted and updated as playback progresses.
Listen immediately: Audio is streamed sentence-by-sentence. Jump back and forth by skipping by sentence.
Variable Speeds: On device playback rate adjustor for improved audio quality.
Caching: Audio is cached in your vault to reduce costs, and automatically removed. Cache duration is configurable. Audio may be cached device local or in a vault directory.
<img src="docs/cache-settings.png" width"400" >
Export and Embed Audio: Quickly export to audio files: export audio files from selection, or embed audio by pasting text from your clipboard.
Play text from anywhere: Lots of commands. Play text to speech directly from your clipboard.
OS Integration: Integrates with your mobile phone to play while locked. Pause/Play with OS controls on desktop.
You can also run alternate models if you have OpenAI compatible API server that has an /v1/audio/speech
endpoint. For example openedai-speech. Just configure the url in the plugin settings