Aloud

adrianlyjak15k downloads

Speak text from your notes. Converts text to speech in real-time using lifelike voices from OpenAI.

Add to Obsidian

Overview
Scorecard
Updates54

Highlight and speak text from your Obsidian notes. Converts text to audio using lifelike voices from various providers.

Just add your API key from a supported provider. Choose from available voices.

Settings View

Supported TTS Models:

OpenAI: (e.g., tts-1, tts-1-hd, gpt-4o-mini). OpenAI charges Audio at $0.015 per 1,000 characters.
Google Gemini: (Gemini 2.5 series)
Hume AI: (Hume voices with customization)
ElevenLabs: (Model selection, voice selection, stability/similarity options)
Fish Audio: (S2 Pro/S1 models with custom voice model IDs)
Azure Speech Services: (Region, voice and output format selection)
MiniMax: (speech-2.6-hd, speech-2.6-turbo, speech-02-hd, speech-02-turbo, speech-01-hd, speech-01-turbo)
AWS Polly: (Region, voice, neural/standard engine, output format)

You can also configure a custom API endpoint if you have an OpenAI compatible API server that has an /v1/audio/speech endpoint. For example openedai-speech.

Features:

Visual Feedback: Active sentence is highlighted and updated as playback progresses.

Listen immediately: Audio is streamed sentence-by-sentence. Jump back and forth by skipping by sentence.

Variable Speeds: On device playback rate adjustor for improved audio quality.

Caching: Audio is cached in your vault to reduce costs, and automatically removed. Cache duration is configurable. Audio may be cached device local or in a vault directory.

Export and Embed Audio: Quickly export to audio files: export audio files from selection, or embed audio by pasting text from your clipboard.

Play text from anywhere: Lots of commands. Play text to speech directly from your clipboard.

OS Integration: Integrates with your mobile phone to play while locked. Pause/Play with OS controls on desktop.

Alternate TTS Models

You can also run alternate models if you have an OpenAI‑compatible API server that exposes /v1/audio/speech (for example, openedai-speech). Configure the URL and API key in the plugin settings under “OpenAI Compatible (Advanced)”.

Support

If you find this plugin useful, you can support development on Ko-fi. Donations help cover API keys for testing provider integrations.

73%

HealthExcellent

ReviewCaution

About

Highlight and play text from notes using lifelike voices from OpenAI, Google Gemini, ElevenLabs, Azure, AWS Polly, Hume, MiniMax or a custom OpenAI-compatible endpoint. Stream audio sentence-by-sentence with active-sentence highlighting, adjustable speed, local caching to reduce cost, quick export/embed, clipboard playback and OS media control support.

AI Attachments Export

Details

Payments

Optional

Current version

0.15.1

Last updated

3 weeks ago

Created

2 years ago

Updates

54 releases

Downloads

15k

Compatible with

Obsidian 0.15.0+

Platforms

Desktop, Mobile

License

MIT

Sponsor

Ko-fi

Author

adrianlyjak

github.com/adrianlyjak

adrianlyjak