Search...Search plugins and themes...
⌘K
Sign in
  • Get started
  • Download
  • Pricing
  • Enterprise
  • Account
  • Obsidian
  • Overview
  • Sync
  • Publish
  • Canvas
  • Mobile
  • Web Clipper
  • CLI
  • Learn
  • Help
  • Developers
  • Changelog
  • About
  • Roadmap
  • Blog
  • Resources
  • System status
  • License overview
  • Terms of service
  • Privacy policy
  • Security
  • Community
  • Plugins
  • Themes
  • Discord
  • Forum / 中文论坛
  • Merch store
  • Brand guidelines
Follow us
DiscordTwitterBlueskyThreadsMastodonYouTubeGitHub
© 2026 Obsidian

Seshat VTT

thematthiasleitnerthematthiasleitner22 downloads

Transcribe audio files with multiple STT providers and insert transcripts beneath audio links in your notes.

  • Overview
  • Scorecard
  • Updates1

Transcribe audio files from Obsidian notes using multiple providers and insert transcript text directly under the audio reference.

Supported providers

  • OpenAI (/v1/audio/transcriptions)
  • Google Gemini (/v1beta/models/{model}:generateContent)
  • Groq (/openai/v1/audio/transcriptions)
  • Deepgram (/v1/listen)
  • AssemblyAI (/v2/upload + /v2/transcript)
  • Rev AI (/speechtotext/v1/jobs)
  • Speechmatics (/v2/jobs)
  • OpenAI-compatible custom endpoint ({base}/audio/transcriptions)

Usage

  1. Open plugin settings and select your active provider.
  2. Configure only the fields shown for that provider.
  3. Open the markdown note you want to process.
  4. Click the ribbon audio icon (Transcribe audio in current note).
  5. The plugin scans only the currently open markdown note, transcribes audio references in that note, and inserts transcripts below each matching audio reference.

If the current note has no supported audio links, no transcripts are added.

Community Submission Disclosures

  • This plugin sends audio content (and optional prompt/language hints) to the configured third-party transcription provider over the network.
  • Using this plugin typically requires provider accounts and API keys, and may incur provider charges.
  • The plugin itself does not include telemetry, ads, or a self-update mechanism.
  • Provider API keys are stored in the plugin's local Obsidian data file (data.json) on your device. Do not commit that file to Git.

Notes

  • Request/poll timing is fixed to reasonable defaults in code and is no longer exposed in settings.
  • Use Default language and Prompt as hints for supported providers.
  • Settings now show only options for the currently selected provider.
  • Dynamic model dropdowns auto-refresh for OpenAI, Gemini, Groq, and OpenAI-compatible providers.
  • The ribbon action processes only the currently open markdown note.
  • Repeated references to the same audio file in a run reuse the same transcript API result to avoid duplicate charges.

Release

  • GitHub Actions release workflow: .github/workflows/release.yml
  • Required release assets: main.js, manifest.json, and styles.css
  • Tag name must exactly match manifest.json.version (no v prefix)
57%
HealthExcellent
ReviewRisks
About
Transcribe audio files in the current open note using multiple providers (OpenAI, Google Gemini, Groq, Deepgram, AssemblyAI, Rev AI, Speechmatics, or OpenAI-compatible endpoints). Insert transcript text directly beneath each audio reference and reuse results for repeated references to avoid duplicate charges. Process only the active markdown note.
AIAttachmentsIntegrations
Details
Current version
0.1.0
Last updated
3 months ago
Created
3 months ago
Updates
1 release
Downloads
22
Compatible with
Obsidian 1.5.0+
Platforms
Desktop, Mobile
License
MIT
Report bugRequest featureReport plugin
Author
thematthiasleitnerthematthiasleitner
github.com/thematthiasleitner
GitHubthematthiasleitner
  1. Community
  2. Plugins
  3. AI
  4. Seshat VTT

Related plugins

Agent Client

Chat with Claude Code, Codex, Gemini CLI, and more via the Agent Client Protocol — right from your vault.

Smart Composer

AI chat with note context, smart writing assistance, and one-click edits for your vault.

Local GPT

Local Ollama and OpenAI-like GPT's assistance for maximum privacy and offline access.

Image auto upload

Upload images from your clipboard by PicGo.

Whisper

Speech-to-text using OpenAI Whisper.

Nexus AI Chat Importer

Import AI chat conversations from ChatGPT, Claude, and Le Chat exports into Obsidian as clean, readable Markdown files.

Snipd Official

Sync Snipd podcast highlights to your vault with transcript, notes, AI summaries and metadata.

BMO Chatbot

Generate and brainstorm ideas while creating your notes using Large Language Models (LLMs) such as OpenAI's "gpt-3.5-turbo" and "gpt-4".

Local REST API & MCP Server

Unlock your automation needs by interacting with your notes over a secure REST API.

Copilot

Your AI Copilot: Chat with Your Second Brain, Learn Faster, Work Smarter.