Files

T

Alex Newman c648d5d8d2 feat: Knowledge Agents — queryable corpora from claude-mem (#1653 )

* feat: add knowledge agent types, store, builder, and renderer

Phase 1 of Knowledge Agents feature. Introduces corpus compilation
pipeline that filters observations from the database into portable
corpus files stored at ~/.claude-mem/corpora/.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* feat: add corpus CRUD HTTP endpoints and wire into worker service

Phase 2 of Knowledge Agents. Adds CorpusRoutes with 5 endpoints
(build, list, get, delete, rebuild) and registers them during
worker background initialization alongside SearchRoutes.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* feat: add KnowledgeAgent with V1 SDK prime/query/reprime

Phase 3 of Knowledge Agents. Uses Agent SDK V1 query() with
resume and disallowedTools for Q&A-only knowledge sessions.
Auto-reprimes on session expiry. Adds prime, query, and reprime
HTTP endpoints to CorpusRoutes.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* feat: add MCP tools and skill for knowledge agents

Phase 4 of Knowledge Agents. Adds build_corpus, list_corpora,
prime_corpus, and query_corpus MCP tools delegating to worker
HTTP endpoints. Includes /knowledge-agent skill with workflow docs.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* fix: handle SDK process exit in KnowledgeAgent, add e2e test

The Agent SDK may throw after yielding all messages when the
Claude process exits with a non-zero code. Now tolerates this
if session_id/answer were already captured. Adds comprehensive
e2e test script (31 assertions) orchestrated via tmux-cli.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* fix: use settings model ID instead of hardcoded model in KnowledgeAgent

Reads CLAUDE_MEM_MODEL from user settings via getModelId(), matching
the existing SDKAgent pattern. No more hardcoded model assumptions.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* feat: improve knowledge agents developer experience

Add public documentation page, rebuild/reprime MCP tools, and actionable
error messages. DX review scored knowledge agents 4/10 — core engineering
works (31/31 e2e) but the feature was invisible. This addresses
discoverability (docs, cross-links), API completeness (missing MCP tools),
and error quality (fix/example fields in error responses).

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* docs: add quick start guide to knowledge agents page

Covers the three main use cases upfront: creating an agent, asking a
single question, and starting a fresh conversation with reprime. Includes
keeping-it-current section for rebuild + reprime workflow.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* fix: address code review issues — path traversal, session safety, prompt injection

- Block path traversal in CorpusStore with alphanumeric name validation and resolved path check
- Harden system prompt against instruction injection from untrusted corpus content
- Validate question field as non-empty string in query endpoint
- Only persist session_id after successful prime (not null on failure)
- Persist refreshed session_id after query execution
- Only auto-reprime on session resume errors, not all query failures
- Add fenced code block language tags to SKILL.md

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* fix: address remaining code review issues — e2e robustness, MCP validation, docs

- Harden e2e curl wrappers with connect-timeout, fallback to HTTP 000 on transport failure
- Use curl_post wrapper consistently for all long-running POST calls
- Add runtime name validation to all corpus MCP tool handlers
- Fix docs: soften hallucination guarantee to probabilistic claim
- Fix architecture diagram: add missing rebuild_corpus and reprime_corpus tools

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* fix: enforce string[] type in safeParseJsonArray for corpus data integrity

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* fix: add blank line before fenced code blocks in SKILL.md maintenance section

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

---------

Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-04-08 17:30:20 -07:00

5.9 KiB

Raw Blame History

name, description

name	description
mem-search	Search claude-mem's persistent cross-session memory database. Use when user asks "did we already solve this?", "how did we do X last time?", or needs work from previous sessions.

Memory Search

Search past work across all sessions. Simple workflow: search -> filter -> fetch.

When to Use

Use when users ask about PREVIOUS sessions (not current conversation):

"Did we already fix this?"
"How did we solve X last time?"
"What happened last week?"

3-Layer Workflow (ALWAYS Follow)

NEVER fetch full details without filtering first. 10x token savings.

Step 1: Search - Get Index with IDs

Use the search MCP tool:

search(query="authentication", limit=20, project="my-project")

Returns: Table with IDs, timestamps, types, titles (~50-100 tokens/result)

| ID | Time | T | Title | Read |
|----|------|---|-------|------|
| #11131 | 3:48 PM | 🟣 | Added JWT authentication | ~75 |
| #10942 | 2:15 PM | 🔴 | Fixed auth token expiration | ~50 |

Parameters:

query (string) - Search term
limit (number) - Max results, default 20, max 100
project (string) - Project name filter
type (string, optional) - "observations", "sessions", or "prompts"
obs_type (string, optional) - Comma-separated: bugfix, feature, decision, discovery, change
dateStart (string, optional) - YYYY-MM-DD or epoch ms
dateEnd (string, optional) - YYYY-MM-DD or epoch ms
offset (number, optional) - Skip N results
orderBy (string, optional) - "date_desc" (default), "date_asc", "relevance"

Step 2: Timeline - Get Context Around Interesting Results

Use the timeline MCP tool:

timeline(anchor=11131, depth_before=3, depth_after=3, project="my-project")

Or find anchor automatically from query:

timeline(query="authentication", depth_before=3, depth_after=3, project="my-project")

Returns: depth_before + 1 + depth_after items in chronological order with observations, sessions, and prompts interleaved around the anchor.

Parameters:

anchor (number, optional) - Observation ID to center around
query (string, optional) - Find anchor automatically if anchor not provided
depth_before (number, optional) - Items before anchor, default 5, max 20
depth_after (number, optional) - Items after anchor, default 5, max 20
project (string) - Project name filter

Step 3: Fetch - Get Full Details ONLY for Filtered IDs

Review titles from Step 1 and context from Step 2. Pick relevant IDs. Discard the rest.

Use the get_observations MCP tool:

get_observations(ids=[11131, 10942])

ALWAYS use get_observations for 2+ observations - single request vs N requests.

Parameters:

ids (array of numbers, required) - Observation IDs to fetch
orderBy (string, optional) - "date_desc" (default), "date_asc"
limit (number, optional) - Max observations to return
project (string, optional) - Project name filter

Returns: Complete observation objects with title, subtitle, narrative, facts, concepts, files (~500-1000 tokens each)

Examples

Find recent bug fixes:

search(query="bug", type="observations", obs_type="bugfix", limit=20, project="my-project")

Find what happened last week:

search(type="observations", dateStart="2025-11-11", limit=20, project="my-project")

Understand context around a discovery:

timeline(anchor=11131, depth_before=5, depth_after=5, project="my-project")

Batch fetch details:

get_observations(ids=[11131, 10942, 10855], orderBy="date_desc")

Why This Workflow?

Search index: ~50-100 tokens per result
Full observation: ~500-1000 tokens each
Batch fetch: 1 HTTP request vs N individual requests
10x token savings by filtering before fetching

Smart-Explore Language Support

Smart-explore tools (smart_search, smart_outline, smart_unfold) use tree-sitter AST parsing. The following languages are supported out of the box.

24 Bundled Languages

JS, TS, Python, Go, Rust, Ruby, Java, C, C++, Kotlin, Swift, PHP, Elixir, Lua, Scala, Bash, Haskell, Zig, CSS, SCSS, TOML, YAML, SQL, Markdown

Markdown Special Support

Markdown files get structure-aware parsing beyond generic tree-sitter:

Heading hierarchy -- #/##/### headings are extracted as nested symbols (sections contain subsections)
Code block detection -- fenced code blocks are surfaced as code symbols with language annotation
Section-aware unfold -- smart_unfold on a heading returns the full section content (heading through all subsections until the next heading of equal or higher level)

User-Installable Grammars via `.claude-mem.json`

Add custom tree-sitter grammars for languages not in the bundled set. Place .claude-mem.json in the project root:

{
  "grammars": {
    "gleam": {
      "package": "tree-sitter-gleam",
      "extensions": [".gleam"]
    },
    "protobuf": {
      "package": "tree-sitter-proto",
      "extensions": [".proto"],
      "query": ".claude-mem/queries/proto.scm"
    }
  }
}

Fields:

package (string, required) -- npm package name for the tree-sitter grammar
extensions (array of strings, required) -- file extensions to associate with this language
query (string, optional) -- path to a custom .scm query file for symbol extraction. If omitted, a generic query is used.

Rules:

User grammars do NOT override bundled languages. If a language is already bundled, the entry is ignored.
The npm package must be installed in the project (npm install tree-sitter-gleam).
Config is cached per project root. Changes to .claude-mem.json take effect on next worker restart.

Knowledge Agents

Want synthesized answers instead of raw records? Use /knowledge-agent to build a queryable corpus from your observation history. The knowledge agent reads all matching observations and answers questions conversationally.

5.9 KiB Raw Blame History

Memory Search

When to Use

3-Layer Workflow (ALWAYS Follow)

Step 1: Search - Get Index with IDs

Step 2: Timeline - Get Context Around Interesting Results

Step 3: Fetch - Get Full Details ONLY for Filtered IDs

Examples

Why This Workflow?

Smart-Explore Language Support

24 Bundled Languages

Markdown Special Support

User-Installable Grammars via .claude-mem.json

Knowledge Agents

5.9 KiB

Raw Blame History

User-Installable Grammars via `.claude-mem.json`