Search...Search plugins and themes...
⌘K
Sign in
  • Get started
  • Download
  • Pricing
  • Enterprise
  • Account
  • Obsidian
  • Overview
  • Sync
  • Publish
  • Canvas
  • Mobile
  • Web Clipper
  • CLI
  • Learn
  • Help
  • Developers
  • Changelog
  • About
  • Roadmap
  • Blog
  • Resources
  • System status
  • License overview
  • Terms of service
  • Privacy policy
  • Security
  • Community
  • Plugins
  • Themes
  • Discord
  • Forum / 中文论坛
  • Merch store
  • Brand guidelines
Follow us
DiscordTwitterBlueskyThreadsMastodonYouTubeGitHub
© 2026 Obsidian

AI Audio Transcription and Summary

HackerHomeLabHackerHomeLab142 downloads

Record audio meetings locally, auto-chunk, and transcribe using Whisper or Gemini, then summarize using your LLM of choice.

Add to Obsidian
  • Overview
  • Scorecard
  • Updates3

Record voice notes and meetings directly within Obsidian (macOS & iOS), transcribe them using OpenAI Whisper or Google Gemini with automatic size-chunking, and summarize/polish them using your LLM of choice (Anthropic Claude, OpenAI GPT, or Google Gemini).

Support me: https://ko-fi.com/hackerh

🚀 Key Features

  • Direct Local Recording: Record high-quality audio directly inside Obsidian. Includes support for selecting specific input devices (microphones) via settings.
  • Dual-Provider Transcription:
    • OpenAI Whisper: Automatically chunks larger audio files at 22MB to safely stay under OpenAI's 25MB limit.
    • Google Gemini: Transcribes using the latest stable Gemini API (e.g., gemini-2.5-flash), chunking at 10MB to respect inline payload boundaries.
  • Intelligent Post-Processing: Automatically cleans up transcripts using Anthropic (Claude), OpenAI (GPT), or Google (Gemini) based on your custom formatting instructions.
  • Flexible Note Output:
    • Create a new Markdown file based on custom date/time templates and title generation.
    • Or paste the transcription/summary directly at your current cursor position.
  • Fail-Safe Resilience: If any network error, invalid key, or deprecated model causes transcription or post-processing to fail, the plugin will still save the recorded audio file to your vault and create the note with the media player link so you never lose a recording.
  • Premium Visuals: Features dynamic visual wave bars, recording status blinking alerts, and time counters designed to integrate seamlessly with modern Obsidian themes.

⚙️ Configuration & Setup

  1. Enable the plugin under Settings > Community Plugins.
  2. Go to the AI Audio Transcription & Summary settings tab.
  3. Fill in your API Keys:
    • Gemini API Key: Required if you use Google Gemini for transcription or post-processing.
    • Whisper API Key: Required for OpenAI Whisper transcription.
    • OpenAI / Anthropic API Key: Required if you use GPT or Claude for post-processing.
  4. Configure your Transcription Provider (OpenAI Whisper or Google Gemini) and select a recommended model from the dropdown.
  5. Set your output paths, file templates, and optional post-processing templates.

🛠️ Developer Commands

If you want to modify or compile the plugin yourself:

  1. Install dependencies:
    npm install
    
  2. Compile the production bundle:
    npm run build
    
    This compiles the TS source files into a single bundle file main.js.

☕ Support

If you find this plugin useful and want to support my work, you can buy me a coffee!

Your support helps me maintain the plugin and keep adding new features. Thank you!


📄 License

This project is licensed under the MIT License.

HealthExcellent
ReviewPassed
About
If you appreciate this - Please donate to continue to build the plugin: https://ko-fi.com/hackerh AI Audio Transcription & Summary is a premium Obsidian plugin (macOS & iOS) to record meetings, lectures, or voice notes directly inside your notes. It features dual-provider transcription and multi-LLM post-processing, combined with a robust local backup safeguard. Key Features: Direct Recording: Record high-quality audio inside Obsidian with active waveforms and timers. Whisper & Gemini Transcription: Automatic audio chunking (22MB for Whisper, 10MB for Gemini) to bypass API payload limits. LLM Summary & Formatting: Auto-format transcripts using Claude, GPT, or Gemini. Fail-Safe Recovery: If API or network requests fail, your raw audio recording is saved locally inside your vault and linked in your note so you never lose a file. Flexible Insertion: Insert summaries directly at your cursor or generate standalone notes from templates. Support development: https://ko-fi.com/hackerhomelab
AIAutomationReview
Details
Payments
Optional
Current version
1.0.3
Last updated
3 weeks ago
Created
3 weeks ago
Updates
3 releases
Downloads
142
Compatible with
Obsidian 1.4.0+
Platforms
Desktop, Mobile
License
MIT
Report bugRequest featureReport plugin
Payments
https://ko-fi.com/hackerhomelab
Author
HackerHomeLabHackerHomeLabhackerhomelab
GitHubhackerhomelab
  1. Community
  2. Plugins
  3. AI
  4. AI Audio Transcription and Summary

Related plugins

LearnKit

A native study system for your Obsidian vault. LearnKit turns notes into durable knowledge with flashcards, spaced repetition, tests, and more.

Karpathy LLM Wiki

Karpathy's LLM Wiki implementation - multi-page knowledge generation with entity/concept pages and conversational query.

Smart Templates

AI powered templates for generating structured content. Works with Local Models, Anthropic Claude, Gemini, OpenAI & more.

Templater

Create and use dynamic templates.

BRAT

Easily install a beta version of a plugin for testing.

Smart Connections

Find related notes and excerpts while writing. Your link building copilot displays relevant content in graph + list view. A local embedding model powers semantic search. Zero setup. No API key.

Homepage

Open a note, base, or workspace on startup, or set it for quick access later.

Claudian

Embeds Claude Code/Codex as an AI collaborator in your vault. Your vault becomes agent's working directory, giving it full agentic capabilities: file read/write, search, bash commands, and multi-step workflows.

Copilot

Your AI Copilot: Chat with Your Second Brain, Learn Faster, Work Smarter.

Lazy Loader

Load plugins with a delay on startup, so that you can get your app startup down into the sub-second loading time.