sourit200114 downloadsConvert the current note or selected text to MP3 audio through user-configured TTS APIs.
Note TTS is an Obsidian plugin that turns the current note or selected text into an MP3 audio file through your own TTS API provider. The generated audio is saved inside your vault and opened in a built-in playback modal.
Convert current note to speech and Convert selected text to speechPreview cleaned text for speech command to inspect the text before it is sent to TTSTTS Audio/ by defaultCopy these files into your vault plugin folder:
<your vault>/.obsidian/plugins/note-tts/
manifest.json
main.js
styles.css
Then enable Note TTS in Obsidian's community plugins settings.
Choose MiniMax in the plugin settings and fill in:
API KeyModel: defaults to speech-2.8-turboVoice ID: defaults to Chinese_Mandarin_Gentleman; you can also use another system voice or a cloned voice IDLanguage boost: use auto for automatic detection, or Chinese for Chinese textThe plugin calls https://api.minimax.io/v1/t2a_v2 and expects MiniMax to return MP3 audio data as hex.
Choose Replicate in the plugin settings and fill in:
API TokenModel: defaults to minimax/speech-2.8-turboVoice: choose a MiniMax Speech 2.8 Turbo system voice; choose Custom to use the next fieldCustom voice ID: used when Voice is set to Custom, usually for a MiniMax voice-cloning voice_idLanguage preference: defaults to AutoEmotion: defaults to AutoThe default model calls Replicate's official model endpoint:
https://api.replicate.com/v1/models/minimax/speech-2.8-turbo/predictions
To use another Replicate model, change Model to another owner/name, then fill in:
Model version: the Replicate model version hashInput JSON template: for example:{
"text": "{{text}}"
}
Different Replicate TTS models may require different input fields. You can edit the template to match the model, such as prompt, voice, or language. The plugin polls the prediction, finds the first audio URL in the output, downloads it, and saves it to your vault.
A custom provider can return:
Use Audio URL path, Audio hex path, or Audio base64 path to specify the JSON path, for example data.audio_url.