⚠️ Early stage. The storage format is still stabilizing and may change between releases. Pending migrations are surfaced automatically after each update — keep an eye on the changelog.
MCP memory server

Your AI assistant
remembers, project by project.

mnemonic is a local MCP server that stores memories as plain markdown in git. Project-scoped semantic search. No database. No always-on service.

agent — mnemonic project
# New session. Context restored before you type a word.   recall "project architecture and key decisions"   mnemonic-project-overview-and-purpose   score: 0.96 MCP memory server backed by plain markdown + JSON, synced via git. No database, no permanent services. Notes are .md files with YAML frontmatter; embeddings via Ollama, gitignored and recomputed locally. Server spawns on demand over stdio — nothing runs in background. tags: architecture, mcp, overview   mnemonic-key-design-decisions   score: 0.88 One file per note for git conflict isolation. Embeddings gitignored because float arrays can't be merged. Project ID derived from git remote URL, not local path — stays stable across machines. tags: design, decisions, architecture

Stop re-explaining your codebase.

Your AI assistant forgets everything between sessions. mnemonic fixes that — without a database or a SaaS subscription.

📄

Plain markdown, always

Every memory is a markdown file with YAML frontmatter — designed for removability from day one. Give it a try; we think you'll like it enough to stay. And if you ever do leave, all the knowledge you've gathered is yours: plain markdown, independent of mnemonic, no strings attached.

🎯

Project-scoped recall

Memories belong to projects. When you're in a repo, relevant notes surface first with a similarity boost. Global memories stay accessible but don't crowd out context.

🌿

Git is the database

No Postgres to babysit. Every remember, update, and consolidate creates a semantic git commit — your decision log and implementation plans travel with the code in the same history. Pushes are controlled so project-vault writes do not fail on unpublished branches by default. History is inspectable, conflicts are isolated to individual files, and sync works across machines.

🔍

Semantic search, locally

Embeddings generated via Ollama run entirely on your machine — no data leaves your network. recall finds the right note even when you don't remember the exact words; grep and file tools only match strings you already know. Embeddings are gitignored and recomputed on each machine.

🤝

Shared team memory

Project memories live in .mnemonic/ inside your repo. Commit them alongside code so the whole team benefits from captured decisions.

No always-on service

The MCP server starts on demand via stdio. Claude Code, Cursor, and other MCP clients invoke it per-session. Zero background processes when you're not using it.

🛡️

Safe schema upgrades

Pending migrations are visible per vault, dry-runs are built in, and failed runs roll staged note writes back instead of leaving half-migrated memories behind.

📌

Notes surface smarter over time

Mark notes temporary for plans and WIP, permanent for decisions worth keeping. Decision and summary notes rank higher automatically as they accumulate references and relationships — no manual tagging required. Pin any note as a session anchor to always bring it to the top.

📜

Understand how decisions evolved

Opt-in temporal mode adds compact git-backed history to recall results. Each change is described in plain language — "Expanded the note with additional detail", "Connected this note to related work" — and the overall arc is summarised: "The core decision remained stable while rationale expanded." See how a decision formed without wading through raw diffs.

Two vaults, one mental model.

mnemonic routes memories between a private main vault and a shareable project vault based on where you are and what you're storing.

main vault

Private global memories stored in ~/mnemonic-vault — its own git repo. Cross-project knowledge, user preferences, early brainstorming before a repo exists, and anything you don't want committed to a project repo.

~/mnemonic-vault/ .gitignore notes/ setup-notes-a1b2c3.md user-prefs-d4e5f6.md embeddings/ *.json ← gitignored projections/ *.json ← gitignored
project vault

Project memories committed into <git-root>/.mnemonic/ alongside your source code. Architecture decisions, bug fix context, and tribal knowledge travel with the repo.

your-project/ .mnemonic/ .gitignore notes/ auth-bug-fix-d4e5f6.md jwt-migration-a1b2c3.md embeddings/ *.json ← gitignored projections/ *.json ← gitignored

No project yet? Capture ideas in the main vault first, then move only project-specific notes into your repo once it exists.

Recall flow — hybrid semantic + lexical search

Before embedding, notes are projected into clean structured text: title, lifecycle, tags, summary, and h1–h3 headings — no prose noise. Semantic search finds the right note even when you don't remember exact words. When results are weak, lexical matching rescues by scanning projections for keyword overlap. Inferred role and importance further boost scores so summary and decision notes surface naturally.

recall query
your search
semantic search
Ollama local
lexical rerank
tiebreaker
+
lexical rescue
weak results
ranked results
best match first

Everything you need, nothing you don't.

All tools are text-first and optimized for LLM consumption — compact, semantically explicit, no unnecessary structure. On protected branches, write actions pause and ask before committing unless you choose a different policy.

Capture & Retrieve
remember
Save something worth keeping
recall
Ask what you've previously learned - or how it evolved
update
Correct or expand a memory
forget
Remove something no longer useful
get
Pull up a specific note
list
See all memories for a project or tag
recent_memories
Pick up where you left off
discover_tags
Suggest canonical tags for a note
Knowledge Graph
relate
Connect two memories
unrelate
Break a connection between memories
memory_graph
See how your memories are connected
consolidate
Clean up overlapping notes
Project Context
detect_project
Check which project this folder belongs to
project_memory_summary
What does mnemonic know about this project?
where_is_memory
Find where a specific note lives
move_memory
Move a note from personal to project (or back)
get_project_identity
See this project's identity
set_project_identity
Change which remote defines this project
get_project_memory_policy
Check default memory and protected-branch behavior
set_project_memory_policy
Set default memory and protected-branch behavior
Vault Operations
sync
Keep memories in sync across machines
list_migrations
Check if your vault needs updating
execute_migration
Update your vault to the latest format

Running in minutes.

mnemonic requires Node.js 18+ and Ollama for local embeddings. No other services needed.

1 — Prerequisites

bash
ollama pull nomic-embed-text-v2-moe

qwen3-embedding:0.6b is also a viable alternative for longer-context notes:

bash
ollama pull qwen3-embedding:0.6b

No code changes are required; set EMBED_MODEL=qwen3-embedding:0.6b.

2 — Install mnemonic

Pick one:

npm (recommended)
npm install @danielmarbach/mnemonic-mcp
or
homebrew
brew tap danielmarbach/mnemonic-mcp https://github.com/danielmarbach/mnemonic
brew install mnemonic-mcp
or
docker
docker pull danielmarbach/mnemonic-mcp:latest

Pre-built for linux/amd64 and linux/arm64. Tagged with the release version and latest.

or build from source
bash
npm install
npm run build

Claude Code / Cursor config

mcp config (native)
{
  "mcpServers": {
    "mnemonic": {
      "command": "npx",
      "args": ["@danielmarbach/mnemonic-mcp"],
      "env": {
        "VAULT_PATH": "/Users/you/mnemonic-vault"
      }
    }
  }
}

Docker alternative

mcp config (docker)
{
  "mcpServers": {
    "mnemonic": {
      "command": "docker",
      "args": [
        "compose", "-f",
        "/path/to/mnemonic/compose.yaml",
        "run", "--rm", "mnemonic"
      ]
    }
  }
}

OpenCode

Add to ~/.config/opencode/opencode.json or project-local opencode.json.

opencode.json
{
  "$schema": "https://opencode.ai/config.json",
  "mcp": {
    "mnemonic": {
      "type": "local",
      "command": ["npx", "@danielmarbach/mnemonic-mcp"],
      "environment": {
        "VAULT_PATH": "/Users/you/mnemonic-vault"
      }
    }
  }
}

Codex

Add to ~/.codex/config.toml or project-local .codex/config.toml.

config.toml
[mcp_servers.mnemonic]
command = "npx"
args = ["@danielmarbach/mnemonic-mcp"]

[mcp_servers.mnemonic.env]
VAULT_PATH = "/Users/you/mnemonic-vault"

VS Code

Open the command palette and search for MCP: Open User Configuration, or edit mcp.json directly, then add the snippet below.

mcp.json
{
  "servers": {
    "mnemonic": {
      "command": "npx",
      "args": ["@danielmarbach/mnemonic-mcp"],
      "env": {
        "VAULT_PATH": "/Users/YOU/mnemonic-vault"
      }
    }
  }
}
Variable Default Description
VAULT_PATH ~/mnemonic-vault Path to your markdown vault
OLLAMA_URL http://localhost:11434 Ollama server URL
EMBED_MODEL nomic-embed-text-v2-moe Ollama embedding model
DISABLE_GIT false Set true to skip all git operations

The runtime is compatible with other Ollama embedding models that support /api/embed. For example, qwen3-embedding:0.6b works as a drop-in EMBED_MODEL override and may be preferable for longer-context notes.

The main vault's ~/mnemonic-vault/config.json holds machine-local settings you can edit by hand. Two fields are user-tunable:

Field Default Description
reindexEmbedConcurrency 4 Parallel embedding requests during sync (capped 1–16)
mutationPushMode main-only all, main-only, or none — controls when writes auto-push to git

projectMemoryPolicies and projectIdentityOverrides are written automatically by MCP tools — no need to edit them by hand.

~/mnemonic-vault/config.json
{
  "reindexEmbedConcurrency": 8,
  "mutationPushMode": "none"
}

No system prompt required

Mnemonic's tools are self-describing — each includes "use when" / "do not use when" guidance, behavioral annotations, and typed schemas. Most models will use them correctly from tool metadata alone.

For on-demand workflow guidance, use the mnemonic-workflow-hint MCP prompt. It stays compact for weaker models: start by checking what is already known, inspect before writing, prefer updating existing memory, capture new memory only when nothing matches, and clean up related notes when work is done. It also keeps resumed work grounded in project orientation first, with short-lived work-in-progress context recovered only afterward.

Commands for your terminal.

Vault maintenance and onboarding without an MCP client. Run them directly from the shell.

mnemonic migrate

Apply pending schema migrations to your vaults. Always preview with --dry-run first — failed runs roll staged writes back automatically.

bash
# Preview what would change
mnemonic migrate --dry-run

# Apply and auto-commit
mnemonic migrate

# List available migrations
mnemonic migrate --list

# Limit to one project vault
mnemonic migrate --cwd=/path/to/project

mnemonic import-claude-memory

Import Claude Code auto-memory into your vault. Each ## heading in ~/.claude/projects/<project>/memory/*.md becomes a separate mnemonic note — independently searchable via recall. Duplicate titles are skipped, so it's safe to re-run.

bash
# Preview what would be imported
mnemonic import-claude-memory --dry-run

# Import for the current project
mnemonic import-claude-memory

# Specific project path
mnemonic import-claude-memory --cwd=/path/to/project

# Then embed and push
mnemonic sync

mnemonic remembers its own decisions.

The project stores its design log in its own .mnemonic/ vault. Every architectural call and deliberate trade-off lives there as a real note, captured through the MCP tools while building the project.

design architecture decisions
mnemonic-key-design-decisions-3f2a6273

mnemonic — key design decisions

Similarity boost, not hard filter. recall gives project notes +0.15 cosine similarity rather than excluding global notes. Global memories (user prefs, cross-project patterns) remain accessible in project context.

No auto-relationship via LLM. Decided against using a local model to auto-build relationships. Small models lack session context, produce spurious edges, and corrupt the graph silently. Instead: agent instructions prompt relate immediately after remember while session context is warm.

Embeddings gitignored. Derived data — always recomputable. Committing them causes unresolvable merge conflicts (can't merge float arrays).

recall architecture decision
recall-heuristic-instead-of-full-dynamic-context-12324717

Recall heuristic instead of full dynamic context loading

Do not implement the full runtime project-context loading/unloading architecture yet. Implement a lightweight recall heuristic instead: when scope is all, prefer current-project matches first and widen to global matches only if needed to fill the limit.

Rationale: this captures most of the practical benefit without introducing cache lifecycle, invalidation, active-project state, or long-lived runtime complexity. Keep the broader dynamic-loading plan as a future scaling option.

architecture performance decision
active-session-project-cache-single-in-memory-vault-cache-pe-7463f124

Active session project cache

A module-level singleton in src/cache.ts caches notes, embeddings, and projections per vault for the duration of the MCP session. Notes and embeddings are co-loaded in a single Promise.all() pass on first access, so both are always warm after one I/O round trip.

The cache is keyed by project ID and invalidated on every write-path tool. All cache functions are fail-soft: errors return undefined and callers fall back to direct storage. Instrumented with [cache:hit/miss/build/invalidate/fallback] events and per-tool timing.

architecture design decision
enrichment-layer-design-provenance-temporal-recall-projectio-7af26f06

Enrichment layer design: provenance, temporal recall, projections, and relationship expansion

Four post-processing layers added on top of semantic recall, each additive and fail-soft: provenance and confidence metadata from git; opt-in temporal history enriched after semantic selection; projection-based embedding from structured note representations; bounded 1-hop relationship previews scored by project affinity, anchor status, and recency.

Core recall ranking is unaffected by any layer. Failures degrade gracefully to basic results.

design temporal phase-8
temporal-interpretation-strategy-f8573d1d

Temporal Interpretation Strategy

Temporal mode now explains what kind of change happened, not just that one occurred. Each history entry is classified into one of eight semantic categories — create, refine, expand, clarify, connect, restructure, reverse, unknown — using structural signals: additions/deletions ratio, churn, relationship changes, and commit message prefixes.

Classification is language-independent. Conservative thresholds prefer unknown over misclassification. Raw diffs are intentionally excluded from default output.

Every architectural call, trade-off, and lesson — committed to the repo inside .mnemonic/notes/ as plain markdown. Human-readable. Open the folder and read them directly.

Browse the decision log on GitHub