a2fsa2k38 downloadsChat with your vault using local vector search (RAG) and any AI provider — OpenAI, Claude, Gemini, Mistral, Groq, Ollama, and more.
Chat with your Obsidian vault using any AI provider. Ask questions in natural language and get answers grounded in your actual notes, with clickable source citations.
No Python backend, no external server, no setup required. Everything runs inside Obsidian.
| Provider | Notes |
|---|---|
| OpenAI | GPT-4o, GPT-4, GPT-3.5, etc. |
| Anthropic | Claude 3.5 Sonnet, Claude 3, etc. |
| Gemini 2.0 Flash, Gemini 1.5, etc. | |
| Mistral | Mistral Large, Mistral Small, etc. |
| Groq | Fast inference for Llama, Mixtral, etc. |
| xAI | Grok |
| DeepSeek | DeepSeek Chat, DeepSeek Coder |
| Cohere | Command R+ |
| Together AI | Open-source models |
| Perplexity | Sonar models |
| Ollama | Local — llama3, mistral, phi3, etc. |
| llama.cpp | Local — any GGUF model |
| LM Studio | Local — any model |
| Jan | Local — any model |
| Custom | Any OpenAI-compatible API |
Install via Obsidian's Community Plugins browser:
Select Local LLM as the provider, choose your server type (Ollama, llama.cpp, LM Studio, Jan, or other OpenAI-compatible), enter the server URL and model name, and you're done. No API key required.
Quick start with Ollama:
ollama serve
ollama pull llama3
Then set provider → Local LLM, type → Ollama, URL → http://localhost:11434, model → llama3.
No results / poor answers Go to Settings → RAG Chat and click Rebuild index. This re-indexes all notes from scratch.
API errors Check that your API key is correct and the selected model name matches what your provider offers.
Local LLM not reachable Use the Test button in settings to verify the server URL. Make sure your local server is running before sending a message.
MIT — see LICENSE