Files

T

Alex Newman 68290a9121 Performance improvements: Token reduction and enhanced summaries (#101 )

* refactor: Reduce continuation prompt token usage by 95 lines

Removed redundant instructions from continuation prompt that were originally
added to mitigate a session continuity issue. That issue has since been
resolved, making these detailed instructions unnecessary on every continuation.

Changes:
- Reduced continuation prompt from ~106 lines to ~11 lines (~95 line reduction)
- Changed "User's Goal:" to "Next Prompt in Session:" (more accurate framing)
- Removed redundant WHAT TO RECORD, WHEN TO SKIP, and OUTPUT FORMAT sections
- Kept concise reminder: "Continue generating observations and progress summaries..."
- Initial prompt still contains all detailed instructions

Impact:
- Significant token savings on every continuation prompt
- Faster context injection with no loss of functionality
- Instructions remain comprehensive in initial prompt

Files modified:
- src/sdk/prompts.ts (buildContinuationPrompt function)
- plugin/scripts/worker-service.cjs (compiled output)

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>

* refactor: Enhance observation and summary prompts for clarity and token efficiency

* Enhance prompt clarity and instructions in prompts.ts

- Added a reminder to think about instructions before starting work.
- Simplified the continuation prompt instruction by removing "for this ongoing session."

* feat: Enhance settings.json with permissions and deny access to sensitive files

refactor: Remove PLAN-full-observation-display.md and PR_SUMMARY.md as they are no longer needed

chore: Delete SECURITY_SUMMARY.md since it is redundant after recent changes

fix: Update worker-service.cjs to streamline observation generation instructions

cleanup: Remove src-analysis.md and src-tree.md for a cleaner codebase

refactor: Modify prompts.ts to clarify instructions for memory processing

* refactor: Remove legacy worker service implementation

* feat: Enhance summary hook to extract last assistant message and improve logging

- Added function to extract the last assistant message from the transcript.
- Updated summary hook to include last assistant message in the summary request.
- Modified SDKSession interface to store last assistant message.
- Adjusted buildSummaryPrompt to utilize last assistant message for generating summaries.
- Updated worker service and session manager to handle last assistant message in summarize requests.
- Introduced silentDebug utility for improved logging and diagnostics throughout the summary process.

* docs: Add comprehensive implementation plan for ROI metrics feature

Added detailed implementation plan covering:
- Token usage capture from Agent SDK
- Database schema changes (migration #8)
- Discovery cost tracking per observation
- Context hook display with ROI metrics
- Testing and rollout strategy

Timeline: ~20 hours over 4 days
Goal: Empirical data for YC application amendment

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>

* feat: Add transcript processing scripts for analysis and formatting

- Implemented `dump-transcript-readable.ts` to generate a readable markdown dump of transcripts, excluding certain entry types.
- Created `extract-rich-context-examples.ts` to extract and showcase rich context examples from transcripts, highlighting user requests and assistant reasoning.
- Developed `format-transcript-context.ts` to format transcript context into a structured markdown format for improved observation generation.
- Added `test-transcript-parser.ts` for validating data extraction from transcript JSONL files, including statistics and error reporting.
- Introduced `transcript-to-markdown.ts` for a complete representation of transcript data in markdown format, showing all context data.
- Enhanced type definitions in `transcript.ts` to support new features and ensure type safety.
- Built `transcript-parser.ts` to handle parsing of transcript JSONL files, including error handling and data extraction methods.

* Refactor hooks and SDKAgent for improved observation handling

- Updated `new-hook.ts` to clean user prompts by stripping leading slashes for better semantic clarity.
- Enhanced `save-hook.ts` to include additional tools in the SKIP_TOOLS set, preventing unnecessary observations from certain command invocations.
- Modified `prompts.ts` to change the structure of observation prompts, emphasizing the observational role and providing a detailed XML output format for observations.
- Adjusted `SDKAgent.ts` to enforce stricter tool usage restrictions, ensuring the memory agent operates solely as an observer without any tool access.

* feat: Enhance session initialization to accept user prompts and prompt numbers

- Updated `handleSessionInit` in `worker-service.ts` to extract `userPrompt` and `promptNumber` from the request body and pass them to `initializeSession`.
- Modified `initializeSession` in `SessionManager.ts` to handle optional `currentUserPrompt` and `promptNumber` parameters.
- Added logic to update the existing session's `userPrompt` and `lastPromptNumber` if a `currentUserPrompt` is provided.
- Implemented debug logging for session initialization and updates to track user prompts and prompt numbers.

---------

Co-authored-by: Claude <noreply@anthropic.com>

2025-11-13 18:22:44 -05:00

9.1 KiB

Raw Blame History

Claude-Mem: AI Development Instructions

What This Project Is

Claude-mem is a Claude Code plugin providing persistent memory across sessions. It captures tool usage, compresses observations using the Claude Agent SDK, and injects relevant context into future sessions.

Your Role: You are working on the plugin itself. When users interact with Claude Code with this plugin installed, your observations get captured and become their persistent memory.

Current Version: 5.5.1

IMPORTANT: Skills Are Auto-Invoked

There is no /skill command. Skills auto-invoke based on description metadata matching user queries. Don't document manual invocation (e.g., "Run /skill troubleshoot"). Instead: "The troubleshoot skill auto-activates when issues are detected."

Critical Architecture Knowledge

The Lifecycle Flow

SessionStart → smart-install.js runs first (pre-hook), then context-hook.ts runs
- Smart installer checks dependencies (cached, only runs on version changes)
- Starts PM2 worker if not healthy
- Injects context from previous sessions (configurable observation count)
UserPromptSubmit → new-hook.ts runs
- Creates session record in SQLite
- Saves raw user prompt for FTS5 search
PostToolUse → save-hook.ts runs
- Captures your tool executions
- Sends to worker service for AI compression
Summary → Summary hook generates session summaries
SessionEnd → cleanup-hook.ts runs
- Marks session complete (graceful, not DELETE)
- Skips on /clear to preserve ongoing sessions

Note: smart-install.js is a pre-hook script (not a lifecycle hook). It's called before context-hook via command chaining in hooks.json and only runs when dependencies need updating.

Key Components

Hooks (src/hooks/*.ts)

Built to plugin/scripts/*-hook.js (ESM format)
Must output valid JSON to hookSpecificOutput field
Called by Claude Code lifecycle events

Worker Service (src/services/worker-service.ts)

Express.js API on port 37777 (configurable via CLAUDE_MEM_WORKER_PORT)
Managed by PM2 (auto-started by hooks)
Built to plugin/worker-service.cjs (CJS format)
Handles AI processing asynchronously to avoid hook timeouts

Database (src/services/sqlite/)

SQLite3 with better-sqlite3 (NOT bun:sqlite - that's legacy)
Location: ~/.claude-mem/claude-mem.db
FTS5 virtual tables for full-text search
SessionStore = CRUD, SessionSearch = FTS5 queries

Search Skill (plugin/skills/mem-search/SKILL.md)

Provides access to all search functionality via HTTP API + skill
Auto-invoked when users ask about past work, decisions, or history
Uses HTTP endpoints instead of MCP tools (~2,250 token savings per session)
10 search operations: observations, sessions, prompts, by-type, by-file, by-concept, timelines, etc.

Chroma Vector Database (src/services/sync/ChromaSync.ts)

Hybrid semantic + keyword search architecture
Automatic vector embedding synchronization
90-day recency filtering for relevant results
Combined with SQLite FTS5 for optimal search performance

Viewer UI (src/ui/viewer/)

React + TypeScript web interface accessible at http://localhost:37777
Real-time memory stream visualization via Server-Sent Events (SSE)
Infinite scroll pagination for observations, sessions, and user prompts
Project filtering and settings persistence
Built to plugin/ui/viewer.html (self-contained bundle via esbuild)
Auto-reconnection and error recovery

How to Make Changes

When You Modify Hooks

npm run build
npm run sync-marketplace

Changes take effect on next Claude Code session. No worker restart needed.

When You Modify Worker Service

npm run build
npm run sync-marketplace
npm run worker:restart

Must restart PM2 worker for changes to take effect.

When You Modify Search Skill

npm run sync-marketplace

Skill changes take effect immediately on next Claude Code session. No build or restart needed (skills are markdown).

When You Modify Viewer UI

npm run build
npm run sync-marketplace
npm run worker:restart

Changes to React components, styles, or viewer logic require rebuilding and restarting the worker. Refresh browser to see changes.

Build Pipeline

npm run build → Compiles TypeScript, outputs to plugin/
npm run sync-marketplace → Syncs to ~/.claude/plugins/marketplaces/thedotmack/
Changes are live for next session (hooks/skills) or after restart (worker)

Coding Standards

Philosophy: Write the dumb, obvious thing first. Add complexity only when you hit the problem.

Key Principles:

YAGNI: Don't build it until you need it
DRY: Extract patterns after second duplication, not before
Fail Fast: Explicit errors beat silent failures
Simple First: Write the obvious solution, optimize only if needed
Delete Aggressively: Less code = fewer bugs

Common anti-patterns to avoid:

Ceremonial wrapper functions for constants (just export the constant)
Unused default parameters (remove if never used)
Magic numbers without named constants
Silent failures instead of explicit errors
Fragile string parsing (use structured JSON output)
Copy-pasted promise wrappers (extract helper functions)
Overengineered "defensive" code for problems you don't have

Common Tasks

Adding a New Hook

Create src/hooks/new-hook.ts
Add to scripts/build-hooks.js build list
Add configuration to plugin/hooks/hooks.json
Build and sync: npm run build && npm run sync-marketplace

Note: smart-install.js is not a hook - it's a pre-hook dependency checker that runs before context-hook via command chaining.

Modifying Database Schema

Update schema in src/services/sqlite/schema.ts
Update SessionStore/SessionSearch classes
Migration strategy: The plugin currently recreates on schema changes (acceptable for alpha)
TODO: Add proper migrations for production

Debugging Worker Issues

pm2 list                    # Check worker status
npm run worker:logs         # View logs
npm run worker:restart      # Restart if needed
pm2 delete claude-mem-worker # Force clean start

Testing Changes Locally

Make changes in src/
npm run build && npm run sync-marketplace
Start new Claude Code session (hooks) or restart worker (worker changes)
Check ~/.claude-mem/claude-mem.db for database state
Use mem-search skill to verify behavior (auto-invoked when asking about past work)

Version Bumps

Use the version-bump skill (auto-invokes when requesting version updates). It handles:

Semantic version increments (patch/minor/major)
Updates all version references (package.json, plugin.json, CLAUDE.md, marketplace.json)
Creates git tags and GitHub releases
Auto-generates CHANGELOG.md from releases

Investigation Best Practices

When investigations fail persistently, use Task agents for comprehensive file analysis instead of repeated grep/search. Deploy agents to read full files and answer specific questions - more efficient than multiple rounds of searching.

Environment Variables

CLAUDE_MEM_MODEL - Model for observations/summaries (default: claude-haiku-4-5)
CLAUDE_MEM_CONTEXT_OBSERVATIONS - Observations injected at SessionStart (default: 50)
CLAUDE_MEM_WORKER_PORT - Worker service port (default: 37777)

Key Design Decisions

Why PM2 Instead of Direct Process

Hooks have strict timeout limits. PM2 manages a persistent background worker, allowing AI processing to continue after hooks complete.

Why SQLite FTS5

Enables instant full-text search across thousands of observations without external dependencies. Automatic sync triggers keep FTS5 tables synchronized.

Why Graceful Cleanup

Changed from aggressive DELETE requests to marking sessions complete. Prevents interrupting summary generation and other async operations.

Why Smart Install Caching

npm install is expensive (2-5s). Caching version state and only installing on changes makes SessionStart nearly instant (10ms).

Why Web-Based Viewer UI

Real-time visibility into memory stream helps users understand what's being captured and how context is being built. SSE provides instant updates without polling. Self-contained HTML bundle (esbuild) eliminates deployment complexity - everything served from a single file.

File Locations

Source: <project-root>/src/ - TypeScript source files Built Plugin: <project-root>/plugin/ - Compiled JavaScript outputs Installed Plugin: ~/.claude/plugins/marketplaces/thedotmack/ - User's installed plugin location Database: ~/.claude-mem/claude-mem.db - SQLite database with observations, sessions, summaries Chroma Database: ~/.claude-mem/chroma/ - Vector embeddings for semantic search Usage Logs: ~/.claude-mem/usage-logs/usage-YYYY-MM-DD.jsonl - Daily API usage tracking

Quick Reference

Build: npm run build Sync: npm run sync-marketplace Worker Restart: npm run worker:restart Worker Logs: npm run worker:logs Usage Analysis: npm run usage:today Viewer UI: http://localhost:37777 (auto-starts with worker)

9.1 KiB Raw Blame History