πΊοΈ Roadmap: Obsidian Vault Intelligence β
Mission: To transform your Obsidian vault from passive storage into an active, intelligent partner that connects, verifies, and serves knowledge anywhere.
This document outlines the strategic direction for Vault Intelligence. It is a living document that evolves as we learn from our users and the rapidly changing AI landscape.
π’ Phase 1: The Foundation (Completed) β
Goal: Removing vendor lock-in and establishing a "Batteries Included" privacy-first experience.
[x] Universal "Batteries included" embeddings
- The Vision: Install the plugin and it just works. No API keys required for basic search.
- The Tech: We bundle a lightweight, high-performance model (like
all-MiniLM-L6-v2) directly into the plugin. It runs 100% locally on your device, ensuring total privacy and zero cost. - Sovereign Intelligence: "Local-Only" switch that instantly cuts off all cloud API calls.
[x] "Sovereign intelligence" (user control & privacy)
- The Vision: The "Right Model for the Right Task," transparently controlled by you.
- The Tech: Smart routing between fast local models and powerful cloud reasoning.
π’ Phase 2: The Agentic Revolution (Completed) β
Goal: The AI takes initiative. It stops being a passive chatbot and becomes a proactive worker.
[x] "The Researcher" (deep reasoning agent)
- The Vision: A dedicated agent that can read long documents (200k+ tokens) and perform multi-step reasoning.
- Features: Auto-summarisation, cross-document comparison, and citation tracking using "Greedy Context Packing".
[x] "The Computational Solver" (code interpreting)
- The Vision: The agent can write and execute Python code to analyse your data.
- The Use Case: "Read my @Expenses note and forecast next month's spend."
- Privacy: Code runs in a sandboxed WASM environment or via the Gemini Code Execution API.
[x] "The Gardener" (vault hygiene agent)
- The Vision: An agent that proactively tidies your vault.
- The Workflow: Scans recent notes, proposes an interactive plan, and applies changes safely after user review.
- The Ontology: Introduces a formal structure (
Concepts/,Entities/) so the AI knows where things belong.
[x] "The Explorer" (semantic navigation)
- The Vision: A "See Also" sidebar that updates as you type.
- The Tech: High-recall similarity search that finds related notes even if they don't share keywords.
π‘ Phase 3: Breaking Silos (Current Focus) β
Goal: The Agent stops living in the sidebar. It works IN your editor and OUT with other apps.
[ ] "The Ghostwriter" (inline co-creation)
- The Vision: Break the "Chat Sidebar" silo. The agent works directly in your editor, acting as a collaborative writer.
- The Features: Inline Edit, Generative Insertion, and Smart File Creation.
[ ] Model Context Protocol (MCP) server
- The Vision: Use your vault notes inside Claude Desktop, Microsoft Copilot, or other AI tools.
- The Tech: Implement the MCP Standard to turn this plugin into a local server.
[ ] Multi-provider reasoning
- The Vision: Freedom of choice. Use the best model for the job, regardless of who makes it.
- The Tech: Abstraction layer allowing the Research Agent to run on OpenAI, Anthropic, or local open-weights models.
π Phase 4: Visual Intelligence (The "Excalidraw" Stream) β
Goal: Treating diagrams, sketches, and spatial layouts as first-class citizens.
[ ] "The Art Critic" (structure extraction)
- The Insight: Standard search tools cannot see the relationships (arrows, groups, flow) encoded in drawing data.
- The Tech: Parse
compressed-jsonblocks to extract explicit connections.
[ ] ExcaliBrain graph reasoning
- The Integration: Deep support for ExcaliBrain.
[ ] "Sketch-to-Structure" (de-rendering)
- The Vision: Turn a messy whiteboard sketch into a clean note.
[ ] "Text-to-Diagram" (generative UI)
- The Vision: Ask the agent to draw for you.
π΅ Phase 5: The Agentic Leap (Future Horizons) β
Goal: Moving from "Questions" to "Tasks." The agent goes off, does work, and comes back.
[ ] Voice interface (desktop first)
- The Vision: Talk to your vault while you work.
[ ] The "Analyst" (multimodal ingestion)
- The Vision: Drag images, PDFs, and audio recordings into the chat.
[ ] Autonomous research reports
- The Vision: Give the agent a job, not a prompt. "Research the current state of Solid State Batteries."
Phase 6: Blue sky (experimental) β
Goal: Novel interaction paradigms that define the future of PKM.
[ ] "Graph Gardener" (maintenance agent)
- A background agent that studies your vault's structure while you sleep.
[ ] Temporal intelligence ("vault evolution")
- Analyse how your opinion on a topic has changed over time.
Technical architecture & challenges β
1. The "Batteries included" embedding layer β
- Status: Delivered (Phase 1).
- Next: WebGPU transition.
2. Editor integration ("Ghostwriter") β
- Constraint: Concurrency safety.
- Strategy:
Editortransaction API.
3. Model Context Protocol (MCP) implementation β
- Constraint: Local server security.
4. Handling Excalidraw hybrid files β
- Strategy:
LZStringdecompression.
Contributing β
This roadmap is not set in stone. We welcome community feedback!
- Have an idea? Open a Feature Request.
- Want to build it? Look for issues tagged
help wantedorgood first issue.
Research horizons (2026) β
Experimental features targeting the new capabilities of Gemini 3, GPT-5, and Llama 4.
1. Visual vault indexing (multimodal RAG) β
Index every chart, whiteboard photo, and PDF diagram.
2. Autonomous verification layers (corrective RAG) β
The agent verifies its own retrieval quality before answering.
3. "Agent OS" orchestration (knowledge runtimes) β
Treat the Vault as a "Knowledge Runtime" with specialized agents.
4. Federated RAG (privacy & silos) β
Connect to data outside the Obsidian vault without importing it.