Search...Search plugins and themes...
⌘K
Sign in
  • Get started
  • Download
  • Pricing
  • Enterprise
  • Account
  • Obsidian
  • Overview
  • Sync
  • Publish
  • Canvas
  • Mobile
  • Web Clipper
  • CLI
  • Learn
  • Help
  • Developers
  • Changelog
  • About
  • Roadmap
  • Blog
  • Resources
  • System status
  • License overview
  • Terms of service
  • Privacy policy
  • Security
  • Community
  • Plugins
  • Themes
  • Discord
  • Forum / 中文论坛
  • Merch store
  • Brand guidelines
Follow us
DiscordTwitterBlueskyThreadsMastodonYouTubeGitHub
© 2026 Obsidian

Whisper

nikdanilovnikdanilov48k downloads

Speech-to-text using OpenAI Whisper.

Add to Obsidian
  • Overview
  • Scorecard
  • Updates26

Record or upload audio, transcribe with Whisper, and optionally post-process the result with an LLM. Works on desktop and mobile.

Works with OpenAI, Groq, Azure, or any other Whisper-compatible API.

Quick Start

  1. Install from Settings → Community Plugins → search "Whisper"
  2. Add your API key in the plugin settings
  3. Open a note, press Alt + Q, speak, press Alt + Q again

The transcription appears at your cursor.

Usage

Record — click the mic icon in the sidebar, or press Alt + Q to start/stop.

Upload — command palette → Upload audio file (mp3, mp4, m4a, wav, webm, ogg).

Right-click — right-click any audio file in your vault → Transcribe audio file.

All commands can be assigned custom hotkeys in Obsidian's hotkey settings:

  • Start/stop recording (Alt + Q by default)
  • Pause/resume recording
  • Open recording controls
  • Upload audio file

Automation

Trigger from iOS Shortcuts, Alfred, or any tool that can open URLs:

obsidian://whisper                 open controls
obsidian://whisper?command=start   start recording
obsidian://whisper?command=stop    stop and transcribe
obsidian://whisper?command=pause   pause/resume
obsidian://whisper?command=cancel  discard recording

Post-Processing

Enable Post-processing in settings to run transcriptions through an LLM — fix grammar, remove filler words, format as markdown, extract action items.

Supports Claude, GPT, or any OpenAI-compatible endpoint (Ollama, LM Studio, etc.).

You can also enable Auto-generate title to create descriptive filenames for your notes.

Note Templates

When Create note file is enabled, you can customize the filename and content using template variables:

Variable Example
{{title}} Meeting Notes
{{audioFile}} recordings/2026-04-05.webm
{{transcription}} the transcribed text
{{date}} 2026-04-05
{{time}} 14-30-00
{{datetime}} 2026-04-05 14:30:00

Example note template:

# {{title}}
![[{{audioFile}}]]

{{transcription}}

Use ![[{{audioFile}}]] to embed audio (playable) or [[{{audioFile}}]] to link.

Manual Installation

Download manifest.json, main.js, styles.css from releases into .obsidian/plugins/whisper/ in your vault.

Contributing

Issues and PRs welcome — GitHub Issues


Buy me a coffee · @nikdanilov_

79%
HealthExcellent
ReviewSatisfactory
About
Record or upload audio on desktop or mobile and transcribe using Whisper-compatible APIs (OpenAI, Groq, Azure, or local endpoints). Post-process results with an LLM to clean grammar, remove filler words, format as Markdown, extract action items, and auto-generate titled, templated notes with embedded audio.
AIAttachmentsTemplating
Details
Current version
1.9.1
Last updated
2 months ago
Created
3 years ago
Updates
26 releases
Downloads
48k
Compatible with
Obsidian 1.12.7+
Platforms
Desktop, Mobile
License
MIT
Report bugRequest featureReport plugin
Sponsor
Ko-fi
Author
nikdanilovnikdanilov
nikdanilov.com
GitHubnikdanilov
  1. Community
  2. Plugins
  3. AI
  4. Whisper

Related plugins

Local GPT

Local Ollama and OpenAI-like GPT's assistance for maximum privacy and offline access.

Nexus AI Chat Importer

Import AI chat conversations from ChatGPT, Claude, and Mistral Vibe (formerly Le Chat) exports into Obsidian as clean, readable Markdown files.

Smart Templates

AI powered templates for generating structured content. Works with Local Models, Anthropic Claude, Gemini, OpenAI & more.

Star Gate

Clip web pages into Obsidian with a multi-tab browser, AI analysis templates, and content capture.

Templater

Create and use dynamic templates.

Smart Connections

Find related notes and excerpts while writing. Your link building copilot displays relevant content in graph + list view. A local embedding model powers semantic search. Zero setup. No API key.

Copilot

Your AI Copilot: Chat with Your Second Brain, Learn Faster, Work Smarter.

QuickAdd

Quickly add new notes or content to your vault.

Claudian

Embeds Claude Code/Codex as an AI collaborator in your vault. Your vault becomes agent's working directory, giving it full agentic capabilities: file read/write, search, bash commands, and multi-step workflows.

Share Note

Instantly share/publish a note, with the full theme and content exactly like you see in Obsidian. Data is shared encrypted by default, and only you and the person you send it to have the key.