Skip to content

adrianlyjak/obsidian-aloud-tts

Repository files navigation

Aloud Text To Speech Obsidian Plugin

Highlight and speak text from your Obsidian notes. Converts text to audio using lifelike voices from OpenAI.

tts-interaction-audio.mp4

Just add your OpenAI API key. Choose from 6 voices. OpenAI charges Audio at $0.015 per 1,000 characters. That's around $12 for the ~800,000 character text of A Tale of Two Cities.

Settings View

Features:

Visual Feedback: Active sentence is highlighted and updated as playback progresses.

Listen immediately: Audio is streamed sentence-by-sentence. Jump back and forth by skipping by sentence.

Variable Speeds: On device playback rate adjustor for improved audio quality.

Caching: Audio is cached in your vault to reduce costs, and automatically removed. Cache duration is configurable. Audio may be cached device local or in a vault directory.

<img src="docs/cache-settings.png" width"400" >

Export and Embed Audio: Quickly export to audio files: export audio files from selection, or embed audio by pasting text from your clipboard.

Play text from anywhere: Lots of commands. Play text to speech directly from your clipboard.

OS Integration: Integrates with your mobile phone to play while locked. Pause/Play with OS controls on desktop.

Alternate TTS Models

You can also run alternate models if you have OpenAI compatible API server that has an /v1/audio/speech endpoint. For example openedai-speech. Just configure the url in the plugin settings