Search...Search plugins and themes...
⌘K
Sign in
  • Get started
  • Download
  • Pricing
  • Enterprise
  • Account
  • Obsidian
  • Overview
  • Sync
  • Publish
  • Canvas
  • Mobile
  • Web Clipper
  • CLI
  • Learn
  • Help
  • Developers
  • Changelog
  • About
  • Roadmap
  • Blog
  • Resources
  • System status
  • License overview
  • Terms of service
  • Privacy policy
  • Security
  • Community
  • Plugins
  • Themes
  • Discord
  • Forum / 中文论坛
  • Merch store
  • Brand guidelines
Follow us
DiscordTwitterBlueskyThreadsMastodonYouTubeGitHub
© 2026 Obsidian

Audio Transcript

Jesús GarcíaJesús García40 downloads

Transcribí audios con identificación de hablantes usando Gladia, Deepgram o AssemblyAI

Add to Obsidian
  • Overview
  • Scorecard
  • Updates9

Record or transcribe audio files directly in Obsidian with speaker diarization — know who said what. Supports Gladia, Deepgram, AssemblyAI, OpenAI Whisper, Groq, and local Whisper.

Quick start

  1. Install from Community Plugins → search "Audio Transcript"
  2. Enable it in Settings → Community Plugins
  3. Open Settings → Audio Transcript, pick a provider, paste your API key
  4. Open a note, click the 🎙️ ribbon icon, and record or pick a file

The plugin auto-detects your Obsidian language. No manual language setting needed.

Providers

Provider Diarization Free tier Get API key
Gladia Yes Free credits app.gladia.io
Deepgram Yes $200 credits console.deepgram.com
AssemblyAI Yes Free hours assemblyai.com
OpenAI Whisper No Pay-as-you-go platform.openai.com
Groq (Whisper) No Free tier console.groq.com
Whisper (local) No Self-hosted whisper.cpp

Providers without diarization produce a single text block. Use Gladia, Deepgram, or AssemblyAI for speaker separation.

Features

  • Record or upload — record from your mic or pick audio files (MP3, WAV, WebM, etc.)
  • Speaker diarization — automatically labels who spoke when (Gladia, Deepgram, AssemblyAI)
  • Batch transcription — queue multiple files at once
  • Configurable output — custom templates with {speaker}, {time}, {text} placeholders
  • Timestamps with audio links — click a timestamp to jump to that moment in the saved audio
  • Callout wrapping — output inside a foldable > [!transcription] block
  • Auto language detection — matches your Obsidian UI language (Spanish or English)

How it works

  1. Audio is sent to your chosen provider's API
  2. The provider transcribes and detects speakers
  3. Speaker labels are replaced with the names you provide
  4. The transcription is inserted into your active note

Example output:

**Jesús** `0:05`
Buen día, ¿cómo estás?

**María** `0:08`
Muy bien, gracias.

Support

Audio Transcript is free and open source. If it saves you hours of manual transcription, consider buying me a coffee:

Credits

Created by Jesús García & DeepSeek V4-Pro · GitHub

99%
HealthExcellent
ReviewPassed
About
Graba o transcribe audios directamente en la nota con identificación de hablantes y timestamps. Usa Gladia, Deepgram o AssemblyAI, reemplaza "Speaker 1/2" por los nombres que indiques y guarda automáticamente el audio en la carpeta de la nota.
AudioIntegrations
Details
Current version
0.2.5
Last updated
Yesterday
Created
Last week
Updates
9 releases
Downloads
40
Compatible with
Obsidian 1.5.0+
Platforms
Desktop, Mobile
License
MIT
Report bugRequest featureReport plugin
Author
Jesús GarcíaJesús Garcíaje-bh91
GitHubjaliriogbarrios19
  1. Community
  2. Plugins
  3. Audio
  4. Audio Transcript

Related plugins

Local REST API & MCP Server

Unlock your automation needs by interacting with your notes over a secure REST API.

BRAT

Easily install a beta version of a plugin for testing.

Maps

Adds a map layout to bases so you can display notes as an interactive map view.

Self-hosted LiveSync

Sync vaults securely to self-hosted servers or WEBRTC.

Zotero Integration

Insert and import citations, bibliographies, notes, and PDF annotations from Zotero.

Readwise Official

Sync highlights from Readwise to your vault.

Agent Client

Chat with Claude Code, Codex, Gemini CLI, and more via the Agent Client Protocol — right from your vault.

Fast Note Sync

Real-time sync of your vaults across server, mobile, and web; shareable with anyone; supports REST and MCP integrations to build your personal AI knowledge base.

Smart Composer

AI chat with note context, smart writing assistance, and one-click edits for your vault.

LanguageTool Integration

advanced spell/grammar checks with the help of language-tool.