claude-mem

Author	SHA1	Message	Date
Alex Newman	2b8fbcf50e	Merge main into thedotmack/file-read-timeline-inject Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-05 03:00:06 -07:00
Alex Newman	76a27296f0	fix: wire up Cursor integration in installer (#1605 ) * fix: wire up Cursor integration in installer — was incorrectly marked "coming soon" CursorHooksInstaller.ts was fully built but never connected to the installer. Set supported: true in IDE detection and call installCursorHooks in the setup flow, matching the pattern used by other integrations. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: wire up Cursor MCP configuration during install PR review flagged that the hint says "hooks + MCP integration" but configureCursorMcp() was never called during install. Now invoked after hooks install with graceful fallback if MCP setup fails. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 22:44:49 -07:00
Alex Newman	9063c5d8a7	fix: block memory agent prose-skip responses at prompt and runtime levels Observer prompt now explicitly requires XML observation blocks or empty responses — prose explanations like "Skipping" are discarded. ResponseProcessor logs a warning when non-XML content is received. Recording focus expanded to include concrete debugging findings. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 19:39:01 -07:00
Alex Newman	29ef3f5603	fix: downgrade concept-type cleanup log from error to debug (#1606 ) The parser correctly strips observation types from concepts arrays when the LLM ignores the prompt instruction. This is routine data normalization, not an error — downgrade to debug to reduce log noise. Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 19:21:38 -07:00
Alex Newman	70a8edc5b1	fix: restore full interactive installer — Claude Code CLI delegation was Claude-Code-only The install simplification in `21b10b46` over-applied scope: it replaced the entire runInstallCommand (interactive IDE multi-select, --ide flag, 13 IDE setup dispatchers) with just two `claude` CLI commands. The intent was to simplify the Claude Code path only. Now: Claude Code uses `claude plugin marketplace add` + `claude plugin install`. All other IDEs get the full installer flow (file copy, registration, IDE-specific setup). Interactive multi-select and --ide flag are restored. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 18:57:53 -07:00
Alex Newman	76207fb8d6	Merge branch 'feat/tier-routing-feedback' into thedotmack/merge-alessandro-prs	2026-04-04 15:18:34 -07:00
Alessandro Costa	42cc863bf2	fix: address CodeRabbit review on PR #1569 Critical: - migrations: change version 8 → 25 to avoid collision with MigrationRunner.addObservationHierarchicalFields (uses version 8) - SessionRoutes: remove duplicate imports that prevent compilation Major: - SessionRoutes: call applyTierRouting() before every generator spawn (stale-recovery and crash-recovery paths were missing it) - applyTierRouting: clear session.modelOverride at top before re-evaluating to prevent stale tier from persisting across spawns Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 15:18:13 -07:00
Alessandro Costa	0fcc078873	feat: tier routing by queue complexity + observation feedback table Tier Routing: - Inspect pending queue before starting generator - Summarize messages → CLAUDE_MEM_TIER_SUMMARY_MODEL (e.g., Opus) - All simple tools (Read, Glob, Grep, LS) → CLAUDE_MEM_TIER_SIMPLE_MODEL (Haiku) - Mixed/complex → default model (no override) - session.modelOverride in ActiveSession, used by SDKAgent.getModelId() - peekPendingTypes() in PendingMessageStore for non-claiming inspection - Configurable via CLAUDE_MEM_TIER_ROUTING_ENABLED (default: true) Feedback Collection (schema only): - New observation_feedback table via MigrationRunner (schema version 24) - Tracks signal_type (semantic_inject_hit, search_accessed, etc.) - Indexes on observation_id and signal_type - Foundation for future Thompson Sampling optimization Production data (24h tier routing test): - 36 Haiku observations in 4 min, quality indistinguishable from Sonnet - Estimated ~52% cost reduction on SDK Agent usage - 835 → 6,695 feedback signals collected over 13 days Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 15:18:13 -07:00
Alex Newman	d11c0821bb	fix: correct semantic endpoint doc comment GET→POST, clamp limit 1-20 Follow-up to PR #1568: fix stale doc comment that still said GET, and add limit parameter validation (default 5, clamped to 1-20 range). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 15:17:11 -07:00
Alessandro Costa	876cc4d837	feat: semantic context injection via Chroma on UserPromptSubmit (#1568 ) * feat: semantic context injection via Chroma on every UserPromptSubmit On each prompt, queries ChromaDB for the top-N most relevant past observations and injects them as additionalContext. Replaces the recency-based "last N observations" approach with relevance-based semantic search. Changes: - session-init.ts: After session init, query /api/context/semantic with user's prompt text. If results found, return as hookSpecificOutput with hookEventName 'UserPromptSubmit'. - SearchRoutes.ts: New GET /api/context/semantic endpoint that queries SearchManager with format='json' and formats results as markdown. - SettingsDefaultsManager.ts: New settings CLAUDE_MEM_SEMANTIC_INJECT (default: true) and CLAUDE_MEM_SEMANTIC_INJECT_LIMIT (default: 5). Key behaviors: - Fires on every UserPromptSubmit (not just SessionStart) - Minimum prompt length: 20 chars (skips "ok", "yes", etc.) - Skips media-only prompts - Graceful degradation: if worker/Chroma unavailable, no injection - Survives /clear: re-injects on next prompt (not session-bound) - Uses workerHttpRequest (v10.6.3 API, not raw fetch) Production data (23 days, 3,400+ observations): - Before: 8 most recent observations (often irrelevant to current topic) - After: 5 most relevant observations (semantic match) - Token cost: ~1800 → ~800-1200 per injection Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: address CodeRabbit review on PR #1568 - session-init: don't skip semantic injection when contextInjected=true (only skip agent re-init, semantic lookup must run every prompt) - session-init: normalize SEMANTIC_INJECT toggle via String().toLowerCase() - semantic endpoint: change from GET to POST to avoid URL-length limits and prompt exposure in access logs. Handler accepts both body and query for backwards compatibility. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Alessandro Costa <alessandro@claudio.dev> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 15:16:46 -07:00
Alessandro Costa	64cce2bf10	fix: resolve 3 upstream bugs (summarize, ChromaSync, HealthMonitor) (#1566 ) * fix: resolve 3 upstream bugs in summarize, ChromaSync, and HealthMonitor 1. summarize.ts: Skip summary when transcript has no assistant message. Prevents error loop where empty transcripts cause repeated failed summarize attempts (~30 errors/day observed in production). 2. ChromaSync.ts: Fallback to chroma_update_documents when add fails with "IDs already exist". Handles partial writes after MCP timeout without waiting for next backfill cycle. 3. HealthMonitor.ts: Replace HTTP-based isPortInUse with atomic socket bind on Unix. Eliminates TOCTOU race when two sessions start simultaneously (HTTP check is non-atomic — both see "port free" before either completes listen()). Updated tests accordingly. All three bugs are pre-existing in v10.5.5. Confirmed via log analysis of 543K lines over 17 days of production usage across two servers. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * chore: add CONTRIB_NOTES.md to gitignore Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: address CodeRabbit review on PR #1566 - HealthMonitor: add APPROVED OVERRIDE annotation for Win32 HTTP fallback - ChromaSync: replace chroma_update_documents with delete+add for proper upsert (update only modifies existing IDs, silently ignores missing ones) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Alessandro Costa <alessandro@claudio.dev> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 15:15:08 -07:00
Alessandro Costa	8958c3335d	feat: drain orphaned pending messages on SIGTERM session completion (#1567 ) * feat: drain orphaned pending messages on session completion (SIGTERM) When deleteSession() aborts the SDK agent via SIGTERM, pending messages in the queue are never processed. Without drain, they remain in 'pending' status forever — no future generator picks them up because the session is already completed. Adds markAllSessionMessagesAbandoned() call after deleteSession() in completeByDbId(). This reuses the existing PendingMessageStore method already used by worker-service.ts terminateSession(). Production evidence: 15 orphaned summarize messages found across completed sessions (ages 3h to 3 days) before this fix. After fix: 0 orphaned messages over 23 days of operation. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: document best-effort drain limitation per CodeRabbit review #1567 Add comment noting the rare race condition when generators outlive the 30s SIGTERM timeout. Practical risk is negligible (0 orphans over 23 days). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Alessandro Costa <alessandro@claudio.dev> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 15:14:25 -07:00
Alex Newman	c7c68e81f4	fix: address 10 unresolved PR review threads - README: add language specifier to fenced code block - paths.ts: guard npmPackageRootDirectory() against bundle structure drift - OpenCodeInstaller: resolve bundle from import.meta.url, not process.cwd() - OpenCodeInstaller: log warnings on AGENTS.md injection failures - WindsurfHooksInstaller: key registry by full workspace path, not basename - uninstall.ts: poll health endpoint to wait for worker exit before file deletion - uninstall.ts: call IDE-specific uninstallers (Gemini, Windsurf, OpenCode, OpenClaw, Codex) - opencode-plugin: cap session tracking Map at 1000 entries with LRU eviction - GeminiCliHooksInstaller: document intentional JSON double-escaping Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 14:45:53 -07:00
Alex Newman	21b10b4696	refactor: replace custom installer with native Claude plugin commands Delegates to `claude plugin marketplace add` + `claude plugin install` instead of manually copying files, registering marketplace/plugin JSON, running npm install, and dispatching IDE-specific setup. 536 → 36 lines. Also fixes double-shebang in npx-cli bundle (source + esbuild banner). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 14:35:06 -07:00
Alex Newman	4de417663c	fix: catch corrupt JSON in Gemini CLI status command readGeminiSettings() throws on corrupt JSON since `ae6915b`, but checkGeminiCliHooksStatus() called it without catching — violating its "returns 0 always" contract. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 14:29:08 -07:00
Alex Newman	190c74492f	fix: address second PR review — clean replace, IDE failure bubbling, bun validation - cpSync now does rmSync before copy to avoid stale file merges - setupIDEs() returns failed IDE list; install reports partial success - runSmartInstall() returns boolean status instead of void - Worker port in next-steps URL reads CLAUDE_MEM_WORKER_PORT env var - Goose YAML regex stops at column-0 keys (prevents eating sibling sections) - AGENTS.md uninstall removes header-only stub files - findBunPath() validated before use in WindsurfHooksInstaller - Cursor marked unsupported in ide-detection until installer is wired Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 14:17:18 -07:00
Alex Newman	ae6915b88e	fix: address PR review — shebang, double-escaping, data loss, uninstall scope - Add shebang banner to NPX CLI esbuild config so npx claude-mem works - Remove manual backslash pre-escaping in WindsurfHooksInstaller (JSON.stringify handles it) - Scope cache deletion to claude-mem only, not entire vendor namespace - Use getWorkerPort() in OpenCodeInstaller instead of hard-coded 37777 - Throw on corrupt JSON in readJsonSafe/readGeminiSettings/Windsurf to prevent data loss - Fix Cursor install stub to warn instead of silently succeeding - Fix Gemini uninstall to remove individual hooks within groups, not whole groups - Update tests for new corrupt-file-throws behavior Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 13:49:14 -07:00
Alex Newman	2495f98496	refactor: consolidate MCP factory, add non-TTY support, auto-detect transcript watchers - Phase 1: Replace 5 duplicate MCP installers with config-driven factory, extract shared context-injection and json-utils utilities, fix process.execPath usage - Phase 2: Add non-TTY fallback for @clack/prompts to prevent ENOENT in CI/Docker - Phase 3: Wire GeminiCliHooksInstaller through hook command framework with adapter - Phase 4: Auto-start transcript watchers on worker boot when config exists Net -107 lines via DRY consolidation of duplicated installer logic. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 00:35:55 -07:00
Alex Newman	a2ac116aac	fix: move summary wait + session-complete into Stop hook to prevent lost summaries SessionEnd has a 1.5s hardcoded cap from Claude Code (CLAUDE_CODE_SESSIONEND_HOOKS_TIMEOUT_MS), making it unsuitable for waiting on async work. Previously, the Stop hook would fire-and-forget the summarize request, then SessionEnd would immediately call deleteSession — aborting the SDK agent mid-summary. Now the Stop hook (120s timeout, no cap) owns the full lifecycle: 1. Queue summarize request 2. Poll new GET /api/sessions/status endpoint until queue drains 3. Call /api/sessions/complete after summary finishes SessionEnd is now a true fire-and-forget fallback (process.exit(0) immediately). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-03 14:05:53 -07:00
Alex Newman	8265fc7aa1	Merge remote-tracking branch 'origin/thedotmack/npx-gemini-cli' into thedotmack/npx-gemini-cli Resolve merge conflicts in adapter index, gemini-cli adapter, and rebuilt CJS artifacts. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-03 13:47:49 -07:00
Alex Newman	76a880a3d6	feat: update install CLI, ESM compat, and Gemini CLI docs Fixes CursorHooksInstaller ESM compatibility, updates install command with improved path resolution, and refreshes built plugin artifacts. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-03 12:38:45 -07:00
Alex Newman	67645041fa	Merge main into thedotmack/file-read-timeline-inject Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-01 16:11:41 -07:00
Alex Newman	80d1deedbe	fix: address PR review feedback from CodeRabbit - Add sessionId to summarize.ts warning log for easier triage - Add APPROVED OVERRIDE annotation to Windows spawn catch block Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-28 15:34:42 -07:00
Alex Newman	07ab7000a8	fix: patch 7 critical bugs affecting all non-dev-machine users and Windows 1. Fix esbuild inlining build-machine __dirname as string literal — use CJS-compatible runtime banner with require("node:url").fileURLToPath across worker-service, mcp-server, and context-generator builds. 2. Fix isMainModule check missing .cjs extension and Windows backslash path normalization. 3. Wrap extractLastMessage in try-catch to prevent infinite Stop hook feedback loop on malformed transcripts (exit 0 instead of exit 2). 4. Replace heavy SessionEnd hook (Node→Bun→1.7MB CJS→HTTP) with lightweight inline node -e one-liner (~200ms vs >1s). 5. Add 7 Gemini/OpenRouter error patterns to unrecoverablePatterns circuit breaker to prevent 77K+ retry loops on expired API keys. 6. Preserve CLAUDE_CODE_OAUTH_TOKEN and CLAUDE_CODE_GIT_BASH_PATH in sanitizeEnv instead of stripping them with the CLAUDE_CODE_ prefix. 7. Use PowerShell -EncodedCommand for spawnDaemon to fix path quoting when Windows usernames contain spaces. Closes #1515, #1495, #1475, #1465, #1500, #1513, #1512, #1450, #1460, #1486, #1449, #1481, #1451, #1480, #1453, #1445 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-28 15:20:29 -07:00
Conductor	5621b67ccd	Saving uncommitted changes before archiving	2026-03-26 19:35:27 -07:00
Alex Newman	a656af2bff	feat: improve Gemini CLI timeline display by stripping ANSI colors and providing markdown fallback	2026-03-25 23:51:56 -07:00
Alex Newman	88636ec012	feat: remove old installer, update docs to npx claude-mem Removes installer/ directory (16 files) — fully replaced by src/npx-cli/. Updates install.sh and installer.js to redirect to npx claude-mem. Adds npx claude-mem as primary install method in docs and README. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-23 23:02:18 -07:00
Alex Newman	031513d723	feat: add Codex CLI, OpenClaw, and MCP-based IDE integrations Codex CLI: transcript-based integration watching ~/.codex/sessions/, schema bumped to v0.3 with exec_command support, AGENTS.md context. OpenClaw: installer wires pre-built plugin to ~/.openclaw/extensions/, registers in openclaw.json with memory slot and sync config. MCP integrations (6 IDEs): Copilot CLI, Antigravity, Goose, Crush, Roo Code, and Warp — config writing + context injection. Goose uses string-based YAML manipulation (no parser dependency). All 13 IDE targets now supported in npx claude-mem install. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-23 23:02:18 -07:00
Alex Newman	f2cc33b494	feat: add Gemini CLI, OpenCode, and Windsurf IDE integrations Gemini CLI: platform adapter mapping 6 of 11 hooks, settings.json deep-merge installer, GEMINI.md context injection. OpenCode: plugin with tool.execute.after interceptor, bus events for session lifecycle, claude_mem_search custom tool, AGENTS.md context. Windsurf: platform adapter for tool_info envelope format, hooks.json installer for 5 post-action hooks, .windsurf/rules context injection. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-23 23:02:18 -07:00
Alex Newman	3a09c1bb1a	feat: add NPX CLI and OpenClaw build pipeline, optimize npm package size Adds esbuild steps for npx-cli (57KB, Node.js ESM) and openclaw plugin (12KB). Creates .npmignore to exclude node_modules and Bun binary from npm package, reducing pack size from 146MB to 2MB. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-23 23:02:18 -07:00
Alex Newman	85eb796b18	feat: add npx CLI entry point with install, runtime, and IDE detection commands Replaces the old git-clone installer with a direct npm package copy workflow. Supports 13 IDE auto-detection targets, runtime delegation to Bun worker, and pure Node.js install path (no Bun required for installation). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-23 23:02:18 -07:00
Alex Newman	4d7bec4d05	fix: stop spinner from spinning forever (#1440 ) * fix: stop spinner from spinning forever due to orphaned DB messages The activity spinner never stopped because isAnySessionProcessing() queried ALL pending/processing messages in the database, including orphaned messages from dead sessions that no generator would ever process. Root cause: isAnySessionProcessing() used hasAnyPendingWork() which is a global DB scan. Changed it to use getTotalQueueDepth() which only checks sessions in the active in-memory Map. Additional fixes: - Add terminateSession() to enforce restart-or-terminate invariant - Fix 3 zombie paths in .finally() handler that left sessions alive - Clean up idle sessions from memory on successful completion - Remove redundant bare isProcessing:true broadcast - Replace inline require() with proper accessor - Add 8 regression tests for session termination invariant Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: address review findings — idle-timeout race, double broadcast, query amplification - Move pendingCount check before idle-timeout termination to prevent abandoning fresh messages that arrive between idle abort and .finally() - Move broadcastProcessingStatus() inside restart branch only — the else branch already broadcasts via removeSessionImmediate callback - Compute queueDepth once in broadcastProcessingStatus() and derive isProcessing from it, eliminating redundant double iteration Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-21 14:13:10 -07:00
Alex Newman	c80763390b	feat: file-read decision gate — block reads when observation history exists Add a PreToolUse gate that blocks file reads on first attempt when rich observation history exists, presenting the timeline as feedback. Claude then decides: use get_observations() (skip read, save tokens) or re-read (allowed on second attempt). - FileReadGate: in-memory session-scoped gate with 4h TTL - POST /api/file-context/gate endpoint in worker - stderrMessage plumbing in hook-command for exit code 2 - file-context handler uses gate to block/allow reads Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-19 12:11:02 -07:00
Alex Newman	47d6d51030	Merge main into thedotmack/file-read-timeline-inject Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-19 12:10:26 -07:00
Alex Newman	9f529a30f5	feat: strip <system_instruction> tags before DB storage (#1398 ) * feat: strip <system_instruction> tags before database storage Extends the existing tag-stripping mechanism (used for <private> and <claude-mem-context>) to also filter Conductor-injected system instructions, preventing them from being persisted in the observation database. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: also strip <system-instruction> (hyphen variant) before DB storage Conductor uses both <system_instruction> and <system-instruction> tag formats. This adds the hyphen variant to the same stripping mechanism. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-19 12:08:25 -07:00
Alex Newman	e07b13f7de	fix: proper project isolation and relative path matching for file-context hook - Use getProjectContext(cwd).allProjects for project scoping (same as SessionStart) - Convert absolute file_path to relative using cwd (observations store relative paths) - API accepts comma-separated projects param with IN() SQL filter - Remove basename matching — use full relative path to avoid cross-file collisions Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-18 15:38:53 -07:00
Alex Newman	1d48f63b99	fix: remove project filter from file-context hook — cwd != stored project name The handler was passing input.cwd (full absolute path) as the project filter, but observations store short project names ('san-diego', not '/Users/.../san-diego'). This caused zero results for every query. Removing the filter entirely is better: cross-project observations about the same file are useful for duplicate prevention. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-18 15:24:34 -07:00
Alex Newman	fb9d917f8a	feat: inject file observation timeline on PreToolUse Read hook When Claude reads a file, the PreToolUse hook queries for existing observations about that file and injects the timeline into context via additionalContext + permissionDecision: allow. This prevents duplicate observations and saves tokens through active rediscovery. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-18 15:18:54 -07:00
Alex Newman	7e07210635	feat: add timeline-report skill with token economics, compress context output 53% ## Summary - New timeline-report skill for generating narrative project history reports - Compressed markdown context output ~53% (tables → flat compact lines, verbose labels → terse format) - Added `full=true` param to /api/context/inject for fetching all observations - Split TimelineRenderer into separate markdown/color rendering paths - Removed arbitrary file write vulnerability (dump_to_file param) - Fixed timestamp ditto marker leaking across session summary boundaries ## Review - Rebased on main (v10.6.0) to preserve OpenClaw system prompt injection - Reviewed by /review (gstack) + /octo:review (Codex, Gemini, Claude fleet) - Security fix (dump_to_file removal) confirmed by all 3 reviewers - Timestamp bug caught by Codex, fixed 🤖 Generated with [Claude Code](https://claude.com/claude-code)	2026-03-18 13:57:20 -07:00
Glucksberg	9361e33b6d	fix(openclaw): inject context via system prompt instead of overwriting MEMORY.md (#1386 ) * fix(openclaw): inject context via system prompt instead of overwriting MEMORY.md The OpenClaw plugin was overwriting each agent's MEMORY.md with a large auto-generated observation dump (~12-15KB) on every before_agent_start and tool_result_persist event. This conflicts with OpenClaw's design where MEMORY.md is agent-curated long-term memory. Migrate context injection from file-based (writeFile MEMORY.md) to OpenClaw's native before_prompt_build hook, which returns context via appendSystemContext. This keeps MEMORY.md under agent control while still providing cross-session observation context to the LLM. Changes: - Add before_prompt_build hook that returns { appendSystemContext } - Remove writeFile/MEMORY.md sync from before_agent_start - Remove MEMORY.md sync from tool_result_persist (observations still recorded) - Add 60s TTL cache to avoid re-fetching context on every LLM turn - Add syncMemoryFileExclude config for per-agent opt-out - Remove dead workspaceDirsBySessionKey tracking map - Rewrite test suite to verify prompt injection instead of file writes Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix(ui): align settings defaults with backend and use nullish coalescing The web UI had two issues causing settings inflation: 1. DEFAULT_SETTINGS in the UI used FULL_COUNT='5' and all token columns 'true', while SettingsDefaultsManager (backend) uses FULL_COUNT='0' and token columns 'false'. Opening the settings modal and saving without changes would silently inflate the context. 2. useSettings used \|\| for fallback, which treats '0' and 'false' as falsy — even when the backend correctly returns these values, the UI would replace them with inflated defaults. Changed to ?? (nullish coalescing) so only null/undefined trigger the fallback. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs(openclaw): update integration docs for system prompt injection Reflect the migration from MEMORY.md file writes to before_prompt_build hook-based context injection: - Update architecture diagram and overview to show new hook flow - Replace "MEMORY.md Live Sync" section with "System Prompt Context Injection" - Update event lifecycle steps (before_agent_start, tool_result_persist) - Add before_prompt_build step with TTL cache description - Document new syncMemoryFileExclude config parameter - Update session tracking to reflect removed workspaceDirsBySessionKey Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: fix terminology and update SKILL.md for system prompt injection Replace "prompt injection" with "context injection" in docs to avoid confusion with the OWASP security term. Update openclaw/SKILL.md to reflect the new before_prompt_build hook and remove stale MEMORY.md references. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: Alex Newman <thedotmack@gmail.com>	2026-03-17 17:14:30 -07:00
Alex Newman	80a8c90a1a	feat: add embedded Process Supervisor for unified process lifecycle (#1370 ) * feat: add embedded Process Supervisor for unified process lifecycle management Consolidates scattered process management (ProcessManager, GracefulShutdown, HealthMonitor, ProcessRegistry) into a unified src/supervisor/ module. New: ProcessRegistry with JSON persistence, env sanitizer (strips CLAUDECODE_* vars), graceful shutdown cascade (SIGTERM → 5s wait → SIGKILL with tree-kill on Windows), PID file liveness validation, and singleton Supervisor API. Fixes #1352 (worker inherits CLAUDECODE env causing nested sessions) Fixes #1356 (zombie TCP socket after Windows reboot) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: add session-scoped process reaping to supervisor Adds reapSession(sessionId) to ProcessRegistry for killing session-tagged processes on session end. SessionManager.deleteSession() now triggers reaping. Tightens orphan reaper interval from 60s to 30s. Fixes #1351 (MCP server processes leak on session end) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: add Unix domain socket support for worker communication Introduces socket-manager.ts for UDS-based worker communication, eliminating port 37777 collisions between concurrent sessions. Worker listens on ~/.claude-mem/sockets/worker.sock by default with TCP fallback. All hook handlers, MCP server, health checks, and admin commands updated to use socket-aware workerHttpRequest(). Backwards compatible — settings can force TCP mode via CLAUDE_MEM_WORKER_TRANSPORT=tcp. Fixes #1346 (port 37777 collision across concurrent sessions) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: remove in-process worker fallback from hook command Removes the fallback path where hook scripts started WorkerService in-process, making the worker a grandchild of Claude Code (killed by sandbox). Hooks now always delegate to ensureWorkerStarted() which spawns a fully detached daemon. Fixes #1249 (grandchild process killed by sandbox) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: add health checker and /api/admin/doctor endpoint Adds 30-second periodic health sweep that prunes dead processes from the supervisor registry and cleans stale socket files. Adds /api/admin/doctor endpoint exposing supervisor state, process liveness, and environment health. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * test: add comprehensive supervisor test suite 64 tests covering all supervisor modules: process registry (18 tests), env sanitizer (8), shutdown cascade (10), socket manager (15), health checker (5), and supervisor API (6). Includes persistence, isolation, edge cases, and cross-module integration scenarios. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: revert Unix domain socket transport, restore TCP on port 37777 The socket-manager introduced UDS as default transport, but this broke the HTTP server's TCP accessibility (viewer UI, curl, external monitoring). Since there's only ever one worker process handling all sessions, the port collision rationale for UDS doesn't apply. Reverts to TCP-only, removing ~900 lines of unnecessary complexity. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * chore: remove dead code found in pre-landing review Remove unused `acceptingSpawns` field from Supervisor class (written but never read — assertCanSpawn uses stopPromise instead) and unused `buildWorkerUrl` import from context handler. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * updated gitignore * fix: address PR review feedback - downgrade HTTP logging, clean up gitignore, harden supervisor - Downgrade request/response HTTP logging from info to debug to reduce noise - Remove unused getWorkerPort imports, use buildWorkerUrl helper - Export ENV_PREFIXES/ENV_EXACT_MATCHES from env-sanitizer, reuse in Server.ts - Fix isPidAlive(0) returning true (should be false) - Add shutdownInitiated flag to prevent signal handler race condition - Make validateWorkerPidFile testable with pidFilePath option - Remove unused dataDir from ShutdownCascadeOptions - Upgrade reapSession log from debug to warn - Rename zombiePidFiles to deadProcessPids (returns actual PIDs) - Clean up gitignore: remove duplicate datasets/, stale ~/ and http/ patterns - Fix tests to use temp directories instead of relying on real PID file Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-16 14:49:23 -07:00
Vincent Leraitre	237a4c37f8	fix: always pass --ssl flag to chroma-mcp in remote mode (#1286 ) * fix: always pass --ssl flag to chroma-mcp in remote mode The chroma-mcp CLI defaults to SSL when using --client-type http. When CLAUDE_MEM_CHROMA_SSL is false (the common case for local ChromaDB servers), buildCommandArgs() omitted --ssl entirely, causing chroma-mcp to attempt an SSL connection to a plain HTTP server and fail with "Could not connect to a Chroma server". Always pass --ssl with an explicit true/false value so the user's CLAUDE_MEM_CHROMA_SSL setting is faithfully forwarded. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * test: add regression tests for ChromaMcpManager SSL flag fix Adds 4 focused test cases verifying buildCommandArgs() produces correct --ssl args, covering SSL=false, SSL=true, unset (defaults to false), and local mode (no --ssl flag). Requested by @xkonjin in PR #1286 review. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: rebuild checked-in bundles to include SSL flag fix Rebuild all bundles against upstream/main so the --ssl <true\|false> fix is present in the runtime artifacts that hooks and the marketplace plugin actually execute. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-12 20:03:58 -07:00
laihenyi	626654f816	fix: prevent infinite restart loop on FOREIGN KEY constraint errors (#1334 ) The pending-work-restart logic had no retry limit, causing infinite loops when sessions encountered FOREIGN KEY constraint failures. This led to 2000+ error log entries per minute and eventual worker crash via SIGTERM. Two fixes: 1. Add 'FOREIGN KEY constraint failed' to unrecoverable error patterns so it short-circuits immediately instead of falling through to restart 2. Add MAX_PENDING_RESTARTS (3) limit to pending-work-restart path as a safety net for any future unhandled persistent errors Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-12 20:03:48 -07:00
enzoricciulli	e7ba9acaa7	fix: add content-hash dedup to batch observation store methods (#1302 ) storeObservations() and storeObservationsAndMarkComplete() were missing the content-hash deduplication that storeObservation() (singular) already had via computeObservationContentHash() and findDuplicateObservation(). This caused the Gemini provider (and potentially others that return multiple observations per response) to insert 2-10x duplicate rows per tool use, since the batch methods inserted unconditionally without checking content_hash. The fix adds the same dedup pattern from storeObservation() to both batch methods: 1. Compute content hash via computeObservationContentHash() 2. Check for existing observation within 30s window via findDuplicateObservation() 3. Skip insert and reuse existing ID if duplicate found 4. Include content_hash column in INSERT statement Fixes #1158 (duplicate observations with Gemini provider) Co-authored-by: Enzo Ricciulli <e.ricciulli@systhema.ai>	2026-03-12 20:01:53 -07:00
antmid	ad902bedd9	fix: auto-repair malformed database schema from cross-version sync (#1308 ) When a claude-mem DB is synced between machines running different versions, orphaned indexes can reference non-existent columns (e.g. idx_observations_content_hash referencing content_hash). This causes SQLite to throw "malformed database schema" on ALL queries, including PRAGMAs, creating a silent 503 failure loop. The fix detects this on startup, uses Python's sqlite3 module to drop the orphaned schema objects (bun:sqlite doesn't support writable_schema modifications), resets migration versions, and lets the idempotent migration system recreate everything properly. Fixes #1307 Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-12 20:01:51 -07:00
GigiTiti-Kai	b88566dcdd	fix(ui): include SSE live data when project filter is active (#1315 ) When a project filter was selected in the Web UI, all SSE live data (observations, summaries, prompts) was completely discarded. Only paginated API data was shown, meaning new real-time events were invisible until the user refreshed the page. Fix: filter SSE data by project before merging with paginated data, instead of discarding it entirely. Fixes #1313 Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-12 20:01:48 -07:00
Rajiv Sinclair	1fac57535e	fix: gracefully handle missing transcript files in worktree sessions (#1326 ) When Claude Code runs in a worktree (via Agent tool with isolation: "worktree"), the transcript path points to the worktree's project directory. After the worktree is cleaned up, the Stop hook fires but the transcript file no longer exists, causing extractLastMessage() to throw. This error triggers Claude to respond, which fires another Stop hook, creating an infinite error loop. Changed throws to warn-and-return-empty so the summarize hook exits cleanly with exit 0 instead of cascading errors. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-12 19:59:47 -07:00
AlexWorland	10e980cd69	fix: remove unrecognized fields from Claude Code Stop hook output (#1291 ) * fix: remove unrecognized fields from Claude Code Stop hook output Claude Code validates Stop hook JSON output against its hook contract schema which only accepts {decision?, reason?, systemMessage?}. The formatOutput() function was returning {continue, suppressOutput} which are not part of the Claude Code hook API, causing "JSON validation failed" errors on every session stop. Return an empty object {} for the default case (no hookSpecificOutput), preserving only systemMessage when present. This is valid for all hook event types and eliminates the schema validation error. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * test: add unhappy-path tests for formatOutput per PR review Add edge case coverage for malformed input (undefined/null), falsy systemMessage values, non-contract field stripping, and contract key allowlist. Also add defensive null guard to formatOutput matching normalizeInput pattern. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Alex Worland <alexworland@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-12 19:59:45 -07:00
Nir Alfasi	38d9ac7adb	fix: prevent zombie subprocess accumulation by only trusting exitCode (#1226 ) (#1325 ) proc.killed only means Node sent a signal — the process can still be alive. This caused premature pool slot release, allowing unbounded process spawning. - ensureProcessExit: remove proc.killed from early-exit checks, only trust exitCode - Fix 3 call-site guards that skipped cleanup for signaled-but-alive processes - Add TOTAL_PROCESS_HARD_CAP=10 safety net in waitForSlot() - After SIGKILL, wait up to 1s via exit event instead of blind 200ms sleep - Reduce reaper interval from 5min to 1min, idle threshold from 2min to 1min Closes #1226 Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-12 19:59:42 -07:00
Ben Younes	503bda4868	fix: add null guards for getChromaSync() when Chroma is disabled (#1336 ) When CLAUDE_MEM_CHROMA_ENABLED=false, getChromaSync() returns null. Two call sites were missing null guards, causing "null is not an object" errors on every UserPromptSubmit / session init. Fixes #1294 Vibe-coded by Ousama Ben Younes Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-12 19:58:03 -07:00

1 2 3 4 5 ...

570 Commits