Audio Transcript

Transcribí audios con identificación de hablantes usando Gladia, Deepgram o AssemblyAI

Overview
Scorecard
Updates45

Record or transcribe audio files directly in Obsidian with speaker diarization and AI-powered summaries. Know who said what — and what it means.

Quick start

Install from Community Plugins → search "Audio Transcript"
Enable it in Settings → Community Plugins
Open Settings → Audio Transcript, pick a provider, paste your API key
Open the dashboard (📊 ribbon icon) or click the 🎙️ ribbon to record

The plugin auto-detects your Obsidian language. No manual language setting needed.

Dashboard & AI (v0.6.0)

Open the dashboard via the 📊 ribbon icon or Ctrl+P → "Abrir dashboard de transcripciones".

Transcription history — all your past transcriptions indexed automatically
AI Chat — chat with your transcriptions as context, powered by 8 LLM providers
One-click summaries — summarize any transcription with AI, inserted directly into the note
Prompt templates — create, edit, and reuse your own prompts
Flash / Advanced modes — choose different models for fast vs deep analysis
Status bar — live spob credits display

LLM Providers

Configure your LLM in Settings → Audio Transcript → IA (Proveedores LLM):

Provider	Requires API key
OpenAI (GPT-5.x)	Yes
Anthropic Claude	Yes
DeepSeek	Yes
Google Gemini	Yes
OpenRouter	Yes
Grok (xAI)	Yes
GLM (Z.ai)	Yes
Smart Plugins Obsidian (spob)	Yes — get one here

Transcription Providers

Provider	Diarization	Free tier	Get API key
Gladia	Yes	Free credits	app.gladia.io
Deepgram	Yes	$200 credits	console.deepgram.com
AssemblyAI	Yes	Free hours	assemblyai.com
OpenAI Whisper	No	Pay-as-you-go	platform.openai.com
Groq (Whisper)	No	Free tier	console.groq.com
Whisper (local)	No	Self-hosted	whisper.cpp
Smart Plugins Obsidian (spob)	Yes	Pay as you go	spob.fly.dev

spob es el proveedor por defecto. Obtené tu API key y empezá a transcribir con AssemblyAI al instante — sin configurar nada más.

Providers without diarization produce a single text block. Use Gladia, Deepgram, or AssemblyAI for speaker separation.

Features

Record or upload — record from your mic or pick audio files (MP3, WAV, WebM, etc.)
Speaker diarization — automatically labels who spoke when (Gladia, Deepgram, AssemblyAI)
Batch transcription — queue multiple files at once
Configurable output — custom templates with {speaker}, {time}, {text} placeholders
Timestamps with audio links — click a timestamp to jump to that moment in the saved audio
Callout wrapping — output inside a foldable > [!transcription] block
Auto language detection — matches your Obsidian UI language (Spanish or English)
Provider fallback — if one API fails, the plugin tries the next configured provider automatically
Audio preservation — recordings are saved before transcription, never lost on API failure
Dashboard & AI — transcription history, AI chat, summaries, and prompt templates (see above)

How it works

Audio is sent to your chosen provider's API
The provider transcribes and detects speakers
Speaker labels are replaced with the names you provide
The transcription is inserted into your active note

Example output:

**Jesús** `0:05`
Buen día, ¿cómo estás?

**María** `0:08`
Muy bien, gracias.

Support

Audio Transcript is free and open source. If it saves you hours of manual transcription, consider buying me a coffee:

Credits

Created by Jesús García & DeepSeek V4-Pro · GitHub

HealthExcellent

ReviewPassed

About

Graba o transcribe audios directamente en la nota con identificación de hablantes y timestamps. Usa Gladia, Deepgram o AssemblyAI, reemplaza "Speaker 1/2" por los nombres que indiques y guarda automáticamente el audio en la carpeta de la nota.

Audio Integrations

Details

Current version

1.1.1

Last updated

4 weeks ago

Created

2 months ago

Updates

45 releases

Downloads

395

Compatible with

Obsidian 1.8.7+

Platforms

Desktop, Mobile

License

MIT

Author

Jesús Garcíaje-bh91

jaliriogbarrios19

Quick start

Install from Community Plugins → search "Audio Transcript"

Enable it in Settings → Community Plugins

Open Settings → Audio Transcript, pick a provider, paste your API key

Open the dashboard (📊 ribbon icon) or click the 🎙️ ribbon to record

The plugin auto-detects your Obsidian language. No manual language setting needed.

Dashboard & AI (v0.6.0)

Open the dashboard via the 📊 ribbon icon or Ctrl+P → "Abrir dashboard de transcripciones".

Transcription history — all your past transcriptions indexed automatically

AI Chat — chat with your transcriptions as context, powered by 8 LLM providers

One-click summaries — summarize any transcription with AI, inserted directly into the note

Prompt templates — create, edit, and reuse your own prompts

Flash / Advanced modes — choose different models for fast vs deep analysis

Status bar — live spob credits display

LLM Providers

Configure your LLM in Settings → Audio Transcript → IA (Proveedores LLM):

Provider

Requires API key

OpenAI (GPT-5.x)

Yes

Anthropic Claude

Yes

DeepSeek

Yes

Google Gemini

Yes

OpenRouter

Yes

Grok (xAI)

Yes

GLM (Z.ai)

Yes

Smart Plugins Obsidian (spob)

Yes — get one here

Transcription Providers

Provider

Diarization

Free tier

Get API key

Gladia

Yes

Free credits

app.gladia.io

Deepgram

Yes

$200 credits

console.deepgram.com

AssemblyAI

Yes

Free hours

assemblyai.com

OpenAI Whisper

Pay-as-you-go

platform.openai.com

Groq (Whisper)

Free tier

console.groq.com

Whisper (local)

Self-hosted

whisper.cpp

Smart Plugins Obsidian (spob)

Yes

Pay as you go

spob.fly.dev

spob es el proveedor por defecto. Obtené tu API key y empezá a transcribir con AssemblyAI al instante — sin configurar nada más.

Providers without diarization produce a single text block. Use Gladia, Deepgram, or AssemblyAI for speaker separation.

Features

Record or upload — record from your mic or pick audio files (MP3, WAV, WebM, etc.)

Speaker diarization — automatically labels who spoke when (Gladia, Deepgram, AssemblyAI)

Batch transcription — queue multiple files at once

Configurable output — custom templates with {speaker}, {time}, {text} placeholders

Timestamps with audio links — click a timestamp to jump to that moment in the saved audio

Callout wrapping — output inside a foldable > [!transcription] block

Auto language detection — matches your Obsidian UI language (Spanish or English)

Provider fallback — if one API fails, the plugin tries the next configured provider automatically

Audio preservation — recordings are saved before transcription, never lost on API failure

Dashboard & AI — transcription history, AI chat, summaries, and prompt templates (see above)

Audio Transcript

Quick start

Dashboard & AI (v0.6.0)

LLM Providers

Transcription Providers

Features

How it works

Support

Credits

Audio Transcript

Quick start

Dashboard & AI (v0.6.0)

LLM Providers

Transcription Providers

Features

How it works

Support

Credits

Related plugins

BRAT

Local REST API with MCP

Fast Note Sync

Self-hosted LiveSync

Maps

Zotero Integration

Agent Client

Readwise Official

Custom Frames

LanguageTool Integration