Search...Search plugins and themes...
⌘K
Sign in
  • Get started
  • Download
  • Pricing
  • Enterprise
  • Account
  • Obsidian
  • Overview
  • Sync
  • Publish
  • Canvas
  • Mobile
  • Web Clipper
  • CLI
  • Learn
  • Help
  • Developers
  • Changelog
  • About
  • Roadmap
  • Blog
  • Resources
  • System status
  • License overview
  • Terms of service
  • Privacy policy
  • Security
  • Community
  • Plugins
  • Join the community
  • Discord
  • Forum / 中文论坛
  • Merch store
  • Brand guidelines
Follow us
DiscordTwitterBlueskyThreadsMastodonYouTubeGitHub
© 2026 Obsidian

Local Dictation

Alexander BrittainAlexander Brittain3 downloads

Dictate notes with Whisper or Cohere Transcribe; clean up with a local Ollama model. Private, on-device speech-to-text for Obsidian.

Add to Obsidian
  • Overview
  • Scorecard
  • Updates1

Private, on-device speech-to-text for Obsidian. Dictate notes with Whisper or Cohere Transcribe; clean up with a local Ollama model.

Features

  • Cohere Transcribe — a Hugging Face Open ASR Leaderboard-topping engine, running locally.
  • Whisper — mature offline transcription with a range of size/speed options.
  • Silero v6 VAD — enterprise-grade neural voice activity detection for real-time speech boundary detection.
  • Optional Ollama LLM cleanup — polish dictated text with a local LLM.
  • One-click model management — browse, download, and remove models from inside the plugin.
  • Hardware acceleration — Metal on macOS, CUDA on Linux/Windows (Turing-or-newer NVIDIA GPUs).
  • Private and offline — transcription stays on-device. No cloud, no telemetry, no account. Only model downloads need a network.

Platform Support

Platform Support Status Hardware Acceleration
macOS Supported Metal support for Whisper.
Linux Native Supported CUDA support for Whisper and Cohere on Turing-or-newer NVIDIA GPUs.
Linux Flatpak Supported CUDA supported on Turing-or-newer NVIDIA GPUs - Flatpak GPU setup.
Windows Supported CUDA support for Whisper and Cohere on Turing-or-newer NVIDIA GPUs. Windows CUDA setup.

Runtime Dependencies

The CPU sidecar has no GPU runtime dependencies. macOS Whisper uses Metal through system frameworks automatically. Linux and Windows CUDA acceleration needs a Turing-or-newer NVIDIA GPU (RTX 20-series / GTX 16-series or newer) and a driver compatible with CUDA 12.9. Cohere CUDA additionally needs cuDNN 9 runtime libraries; without cuDNN, Cohere falls back to CPU.

See Platform Runtime Dependencies for the full contract.

Quick Start

Install Local Dictation from Obsidian's Community Plugins. Open Settings → Local Dictation and install the sidecar from the plugin settings — the plugin downloads it from the matching GitHub Release, verifies it, and stores it under the plugin's bin/ directory. Then click Manage models, install a model, open a note, and start dictation from the ribbon button or via the Local Dictation: Start dictation session command.

The sidecar and model downloads are separate on purpose: Obsidian installs the plugin UI, the plugin installs the native sidecar, and the sidecar manages model downloads. Transcription runs locally after setup.

Privacy & system access

Local Dictation runs transcription on your own machine. To do that, the plugin reaches beyond Obsidian's vault API in two specific ways. Both are surfaced by Obsidian's community-plugin review as fs and child_process warnings — this section is the audit trail for what they cover.

  • Filesystem (fs) — used to install the native sidecar into the plugin's bin/ directory, to manage Whisper and Silero model files cached outside the vault, and to write transient audio dumps when transcription fails and you have diagnostics enabled. No vault content is read or written through fs; that goes through Obsidian's editor API.
  • Process execution (child_process) — used to spawn the local Rust sidecar (local-dictation-sidecar) and stream PCM audio to it over stdio. The command path is the installed binary; no shell is invoked and no part of the command is user-supplied.
  • Network — used only to download the sidecar archive once from this repository's GitHub Releases and to fetch model files from their official sources on demand. There is no telemetry, no analytics, no account, and no background traffic after setup.

The source of truth for these accesses is src/sidecar/sidecar-installer.ts and the IPC layer in src/sidecar/.

Contributing

See CONTRIBUTING.md for prerequisites, setup, scripts, branching conventions, PR workflow, and architecture overview.

License

MIT. See LICENSE.

89%
HealthExcellent
ReviewSatisfactory
About
Run private, GPU-accelerated dictation inside Obsidian using Whisper or Cohere Transcribe with Silero VAD for accurate speech boundary detection on macOS, Linux, and Windows. Process transcriptions locally with an optional Ollama LLM, manage models with one click, and keep everything offline for privacy.
AudioAIWriting
Details
Current version
2026.5.19
Last updated
3 hours ago
Created
Last month
Updates
1 release
Downloads
3
Compatible with
Obsidian 1.8.7+
Platforms
Desktop only
License
MIT
Report bugRequest featureReport plugin
Author
Alexander BrittainAlexander Brittainbrittain9
GitHubbrittain9
  1. Community
  2. Plugins
  3. Audio
  4. Local Dictation

Related plugins

Text Generator

Generate text content using GPT-3 (OpenAI).

Smart Composer

AI chat with note context, smart writing assistance, and one-click edits for your vault.

Local GPT

Local Ollama and OpenAI-like GPT's assistance for maximum privacy and offline access.

ChatGPT MD

A seamless integration of ChatGPT, OpenRouter.ai and local LLMs via Ollama into your notes.

BMO Chatbot

Generate and brainstorm ideas while creating your notes using Large Language Models (LLMs) such as OpenAI's "gpt-3.5-turbo" and "gpt-4".

GPT-3 Notes

Generate notes on any subject using OpenAI's GPT-3.5 and GPT-4 language models.

Ollama

Enable the usage of Ollama within your notes.

Copilot

Your AI Copilot: Chat with Your Second Brain, Learn Faster, Work Smarter.

Smart Connections

AI link discovery copilot. See related notes as you write. Lookup using semantic (vector) search across your vault. Zero-setup local model for embeddings, no API keys, private.

Claudian

Embeds Claude Code/Codex as an AI collaborator in your vault. Your vault becomes agent's working directory, giving it full agentic capabilities: file read/write, search, bash commands, and multi-step workflows.