Search...Search plugins and themes...
⌘K
Sign in
  • Get started
  • Download
  • Pricing
  • Enterprise
  • Account
  • Obsidian
  • Overview
  • Sync
  • Publish
  • Canvas
  • Mobile
  • Web Clipper
  • CLI
  • Learn
  • Help
  • Developers
  • Changelog
  • About
  • Roadmap
  • Blog
  • Resources
  • System status
  • License overview
  • Terms of service
  • Privacy policy
  • Security
  • Community
  • Plugins
  • Join the community
  • Discord
  • Forum / 中文论坛
  • Merch store
  • Brand guidelines
Follow us
DiscordTwitterBlueskyThreadsMastodonYouTubeGitHub
© 2026 Obsidian

Whisper

nikdanilovnikdanilov44k downloads

Speech-to-text using OpenAI Whisper.

Add to Obsidian
  • Overview
  • Scorecard
  • Updates26

Record or upload audio, transcribe with Whisper, and optionally post-process the result with an LLM. Works on desktop and mobile.

Works with OpenAI, Groq, Azure, or any other Whisper-compatible API.

Quick Start

  1. Install from Settings → Community Plugins → search "Whisper"
  2. Add your API key in the plugin settings
  3. Open a note, press Alt + Q, speak, press Alt + Q again

The transcription appears at your cursor.

Usage

Record — click the mic icon in the sidebar, or press Alt + Q to start/stop.

Upload — command palette → Upload audio file (mp3, mp4, m4a, wav, webm, ogg).

Right-click — right-click any audio file in your vault → Transcribe audio file.

All commands can be assigned custom hotkeys in Obsidian's hotkey settings:

  • Start/stop recording (Alt + Q by default)
  • Pause/resume recording
  • Open recording controls
  • Upload audio file

Automation

Trigger from iOS Shortcuts, Alfred, or any tool that can open URLs:

obsidian://whisper                 open controls
obsidian://whisper?command=start   start recording
obsidian://whisper?command=stop    stop and transcribe
obsidian://whisper?command=pause   pause/resume
obsidian://whisper?command=cancel  discard recording

Post-Processing

Enable Post-processing in settings to run transcriptions through an LLM — fix grammar, remove filler words, format as markdown, extract action items.

Supports Claude, GPT, or any OpenAI-compatible endpoint (Ollama, LM Studio, etc.).

You can also enable Auto-generate title to create descriptive filenames for your notes.

Note Templates

When Create note file is enabled, you can customize the filename and content using template variables:

Variable Example
{{title}} Meeting Notes
{{audioFile}} recordings/2026-04-05.webm
{{transcription}} the transcribed text
{{date}} 2026-04-05
{{time}} 14-30-00
{{datetime}} 2026-04-05 14:30:00

Example note template:

# {{title}}
![[{{audioFile}}]]

{{transcription}}

Use ![[{{audioFile}}]] to embed audio (playable) or [[{{audioFile}}]] to link.

Manual Installation

Download manifest.json, main.js, styles.css from releases into .obsidian/plugins/whisper/ in your vault.

Contributing

Issues and PRs welcome — GitHub Issues


Buy me a coffee · @nikdanilov_

80%
HealthExcellent
ReviewSatisfactory
About
Record or upload audio on desktop or mobile and transcribe using Whisper-compatible APIs (OpenAI, Groq, Azure, or local endpoints). Post-process results with an LLM to clean grammar, remove filler words, format as Markdown, extract action items, and auto-generate titled, templated notes with embedded audio.
AIAttachmentsTemplating
Details
Current version
1.9.1
Last updated
Last month
Created
3 years ago
Updates
26 releases
Downloads
44k
Compatible with
Obsidian 1.12.7+
License
MIT
Report bugRequest featureReport plugin
Sponsor
Ko-fi
Author
nikdanilovnikdanilov
nikdanilov.com
GitHubnikdanilov
  1. Community
  2. Plugins
  3. AI
  4. Whisper

Related plugins

Local GPT

Local Ollama and OpenAI-like GPT's assistance for maximum privacy and offline access.

Media DB

Query multiple APIs for movies, series, anime, games, music releases and wiki articles, and import them into your vault.

Smart Templates

AI powered templates for generating structured content. Works with Local Models, Anthropic Claude, Gemini, OpenAI & more.

Templater

Create and use dynamic templates.

Smart Connections

AI link discovery copilot. See related notes as you write. Lookup using semantic (vector) search across your vault. Zero-setup local model for embeddings, no API keys, private.

Copilot

Your AI Copilot: Chat with Your Second Brain, Learn Faster, Work Smarter.

Khoj

An AI personal assistant for your digital brain.

Text Generator

Generate text content using GPT-3 (OpenAI).

Smart Composer

AI chat with note context, smart writing assistance, and one-click edits for your vault.

Fantasy Statblocks

Create, manage and view a Fantasy Bestiary with Dungeons and Dragons style statblocks.