mnemonic — AI memory that lives in your repo

Why mnemonic

Stop re-explaining your codebase.

Your AI assistant forgets everything between sessions. mnemonic fixes that — without a database or a SaaS subscription.

📄

Plain markdown, always

Every memory is a markdown file with YAML frontmatter — designed for removability from day one. Give it a try; we think you'll like it enough to stay. And if you ever do leave, all the knowledge you've gathered is yours: plain markdown, independent of mnemonic, no strings attached.

🎯

Project-scoped recall

Memories belong to projects. When you're in a repo, relevant notes surface first with a similarity boost. Global memories stay accessible but don't crowd out context.

🌿

Git is the database

No Postgres to babysit. Every remember, update, and consolidate creates a semantic git commit — your decision log and implementation plans travel with the code in the same history. Pushes are controlled so project-vault writes do not fail on unpublished branches by default. History is inspectable, conflicts are isolated to individual files, and sync works across machines.

🔍

Semantic search, locally

Embeddings generated via Ollama run entirely on your machine — no data leaves your network. recall finds the right note even when you don't remember the exact words; grep and file tools only match strings you already know. Embeddings are gitignored and recomputed on each machine.

🤝

Shared team memory

Project memories live in .mnemonic/ inside your repo. Commit them alongside code so the whole team benefits from captured decisions.

⚡

No always-on service

The MCP server starts on demand via stdio. Claude Code, Cursor, and other MCP clients invoke it per-session. Zero background processes when you're not using it.

🛡️

Safe schema upgrades

Pending migrations are visible per vault, dry-runs are built in, and failed runs roll staged note writes back instead of leaving half-migrated memories behind.

⏳

Temporary note lifecycle

Your LLM can plan and track complex feature work across sessions — implementation plans, WIP checkpoints, open questions — committed to git so they evolve alongside the code. Mark notes temporary and they naturally fade once the work is done; permanent for decisions and lessons worth keeping forever.

Setup

Running in minutes.

mnemonic requires Node.js 18+ and Ollama for local embeddings. No other services needed.

1 — Prerequisites

bash

ollama pull nomic-embed-text-v2-moe

qwen3-embedding:0.6b is also a viable alternative for longer-context notes:

bash

ollama pull qwen3-embedding:0.6b

No code changes are required; set EMBED_MODEL=qwen3-embedding:0.6b.

2 — Install mnemonic

Pick one:

npm (recommended)

npm install @danielmarbach/mnemonic-mcp

homebrew

brew tap danielmarbach/mnemonic-mcp https://github.com/danielmarbach/mnemonic
brew install mnemonic-mcp

docker

docker pull danielmarbach/mnemonic-mcp:latest

Pre-built for linux/amd64 and linux/arm64. Tagged with the release version and latest.

or build from source

bash

npm install
npm run build

Claude Code / Cursor config

mcp config (native)

{
  "mcpServers": {
    "mnemonic": {
      "command": "npx",
      "args": ["@danielmarbach/mnemonic-mcp"],
      "env": {
        "VAULT_PATH": "/Users/you/mnemonic-vault"
      }
    }
  }
}

Docker alternative

mcp config (docker)

{
  "mcpServers": {
    "mnemonic": {
      "command": "docker",
      "args": [
        "compose", "-f",
        "/path/to/mnemonic/compose.yaml",
        "run", "--rm", "mnemonic"
      ]
    }
  }
}

OpenCode

Add to ~/.config/opencode/opencode.json or project-local opencode.json.

opencode.json

{
  "$schema": "https://opencode.ai/config.json",
  "mcp": {
    "mnemonic": {
      "type": "local",
      "command": ["npx", "@danielmarbach/mnemonic-mcp"],
      "environment": {
        "VAULT_PATH": "/Users/you/mnemonic-vault"
      }
    }
  }
}

Codex

Add to ~/.codex/config.toml or project-local .codex/config.toml.

config.toml

[mcp_servers.mnemonic]
command = "npx"
args = ["@danielmarbach/mnemonic-mcp"]

[mcp_servers.mnemonic.env]
VAULT_PATH = "/Users/you/mnemonic-vault"

VS Code

Open the command palette and search for MCP: Open User Configuration, or edit mcp.json directly, then add the snippet below.

mcp.json

{
  "servers": {
    "mnemonic": {
      "command": "npx",
      "args": ["@danielmarbach/mnemonic-mcp"],
      "env": {
        "VAULT_PATH": "/Users/YOU/mnemonic-vault"
      }
    }
  }
}

Variable	Default	Description
`VAULT_PATH`	~/mnemonic-vault	Path to your markdown vault
`OLLAMA_URL`	http://localhost:11434	Ollama server URL
`EMBED_MODEL`	nomic-embed-text-v2-moe	Ollama embedding model
`DISABLE_GIT`	false	Set `true` to skip all git operations

The runtime is compatible with other Ollama embedding models that support /api/embed. For example, qwen3-embedding:0.6b works as a drop-in EMBED_MODEL override and may be preferable for longer-context notes.

The main vault's ~/mnemonic-vault/config.json holds machine-local settings you can edit by hand. Two fields are user-tunable:

Field	Default	Description
`reindexEmbedConcurrency`	4	Parallel embedding requests during `sync` (capped 1–16)
`mutationPushMode`	main-only	`all`, `main-only`, or `none` — controls when writes auto-push to git

projectMemoryPolicies and projectIdentityOverrides are written automatically by MCP tools — no need to edit them by hand.

~/mnemonic-vault/config.json

{
  "reindexEmbedConcurrency": 8,
  "mutationPushMode": "none"
}

No system prompt required

Mnemonic's tools are self-describing — each includes "use when" / "do not use when" guidance, behavioral annotations, and typed schemas. Most models will use them correctly from tool metadata alone.

For on-demand workflow guidance, use the mnemonic-workflow-hint MCP prompt. It now acts as a compact decision protocol: start with recall or list, inspect with get, prefer update for existing memories, use remember only when nothing matches, then organize with relate, consolidate, or move_memory.

Dogfooding

mnemonic remembers its own decisions.

The project stores its design log in its own .mnemonic/ vault. Every architectural call and deliberate trade-off lives there as a real note, captured through the MCP tools while building the project.

design architecture decisions

mnemonic-key-design-decisions-3f2a6273

mnemonic — key design decisions

Similarity boost, not hard filter. recall gives project notes +0.15 cosine similarity rather than excluding global notes. Global memories (user prefs, cross-project patterns) remain accessible in project context.

No auto-relationship via LLM. Decided against using a local model to auto-build relationships. Small models lack session context, produce spurious edges, and corrupt the graph silently. Instead: agent instructions prompt relate immediately after remember while session context is warm.

Embeddings gitignored. Derived data — always recomputable. Committing them causes unresolvable merge conflicts (can't merge float arrays).

recall architecture decision

recall-heuristic-instead-of-full-dynamic-context-12324717

Recall heuristic instead of full dynamic context loading

Do not implement the full runtime project-context loading/unloading architecture yet. Implement a lightweight recall heuristic instead: when scope is all, prefer current-project matches first and widen to global matches only if needed to fill the limit.

Rationale: this captures most of the practical benefit without introducing cache lifecycle, invalidation, active-project state, or long-lived runtime complexity. Keep the broader dynamic-loading plan as a future scaling option.

markdown linting dogfooding

markdown-linting-for-memory-content-259a1c85

Markdown linting for memory content

Lint markdown note bodies during remember and update so recalled content stays clean. Auto-apply fixable issues like malformed headings, list spacing, and extra blank lines. Reject non-fixable issues after auto-fix so low-quality markdown is never stored.

Disable MD013 (line length) and MD041 (first line must be H1) because note bodies are content fragments, not standalone documents.

Dogfooding note: this decision was captured through the local MCP server itself, not by writing vault files directly.

design consolidation mcp-tool

mnemonic-consolidate-tool-design-b9cbac6a

mnemonic consolidate tool design

Two modes. supersedes keeps sources and adds a relationship to the new consolidated note — preserves history, allows pruning later. delete hard-deletes sources via forget for clean, immediate results.

Cross-vault. Gathers all notes with matching project ID from both vaults, consolidates into the project vault. The consolidated note inherits all relationships from sources.

Safety. Dry-run by default for analysis strategies. Git commit per operation for easy revert.

These 25 notes — and everything mnemonic knows about itself — are committed to the repo inside .mnemonic/notes/. Plain markdown. Human-readable. Open the folder and read them directly.

Browse the decision log on GitHub

Your AI assistant
remembers, project by project.

Stop re-explaining your codebase.

Plain markdown, always

Project-scoped recall

Git is the database

Semantic search, locally

Shared team memory

No always-on service

Safe schema upgrades

Temporary note lifecycle

Two vaults, one mental model.

Recall flow — project-boosted semantic search

Everything you need, nothing you don't.

Running in minutes.

1 — Prerequisites

2 — Install mnemonic

Claude Code / Cursor config

Docker alternative

OpenCode

Codex

VS Code

No system prompt required

Commands for your terminal.

mnemonic migrate

mnemonic import-claude-memory

mnemonic remembers its own decisions.

mnemonic — key design decisions

Recall heuristic instead of full dynamic context loading

Markdown linting for memory content

mnemonic consolidate tool design

Your AI assistant remembers, project by project.

Stop re-explaining your codebase.

Plain markdown, always

Project-scoped recall

Git is the database

Semantic search, locally

Shared team memory

No always-on service

Safe schema upgrades

Temporary note lifecycle

Two vaults, one mental model.

Recall flow — project-boosted semantic search

Everything you need, nothing you don't.

Running in minutes.

1 — Prerequisites

2 — Install mnemonic

Claude Code / Cursor config

Docker alternative

OpenCode

Codex

VS Code

No system prompt required

Commands for your terminal.

mnemonic migrate

mnemonic import-claude-memory

mnemonic remembers its own decisions.

mnemonic — key design decisions

Recall heuristic instead of full dynamic context loading

Markdown linting for memory content

mnemonic consolidate tool design

Your AI assistant
remembers, project by project.