f38b5b85bc
* docs: add investigation reports for 5 open GitHub issues Comprehensive analysis of issues #543, #544, #545, #555, and #557: - #557: settings.json not generated, module loader error (node/bun mismatch) - #555: Windows hooks not executing, hasIpc always false - #545: formatTool crashes on non-JSON tool_input strings - #544: mem-search skill hint shown incorrectly to Claude Code users - #543: /claude-mem slash command unavailable despite installation Each report includes root cause analysis, affected files, and proposed fixes. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> * fix(logger): handle non-JSON tool_input in formatTool (#545) Wrap JSON.parse in try-catch to handle raw string inputs (e.g., Bash commands) that aren't valid JSON. Falls back to using the string as-is. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> * fix(context): update mem-search hint to reference MCP tools (#544) Update hint messages to reference MCP tools (search, get_observations) instead of the deprecated "mem-search skill" terminology. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> * fix(settings): auto-create settings.json on first load (#557, #543) When settings.json doesn't exist, create it with defaults instead of returning in-memory defaults. Creates parent directory if needed. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> * fix(hooks): use bun runtime for hooks except smart-install (#557) Change hook commands from node to bun since hooks use bun:sqlite. Keep smart-install.js on node since it bootstraps bun installation. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> * chore: rebuild plugin scripts * docs: clarify that build artifacts must be committed * fix(docs): update build artifacts directory reference in CLAUDE.md * test: add test coverage for PR #558 fixes - Fix 2 failing tests: update "mem-search skill" → "MCP tools" expectations - Add 56 tests for formatTool() JSON.parse crash fix (Issue #545) - Add 27 tests for settings.json auto-creation (Issue #543) Test coverage includes: - formatTool: JSON parsing, raw strings, objects, null/undefined, all tool types - Settings: file creation, directory creation, schema migration, edge cases 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> * fix(tests): clean up flaky tests and fix circular dependency Phase 1 of test quality improvements: - Delete 6 harmful/worthless test files that used problematic mock.module() patterns or tested implementation details rather than behavior: - context-builder.test.ts (tested internal implementation) - export-types.test.ts (fragile mock patterns) - smart-install.test.ts (shell script testing antipattern) - session_id_refactor.test.ts (outdated, tested refactoring itself) - validate_sql_update.test.ts (one-time migration validation) - observation-broadcaster.test.ts (excessive mocking) - Fix circular dependency between logger.ts and SettingsDefaultsManager.ts by using late binding pattern - logger now lazily loads settings - Refactor mock.module() to spyOn() in several test files for more maintainable and less brittle tests: - observation-compiler.test.ts - gemini_agent.test.ts - error-handler.test.ts - server.test.ts - response-processor.test.ts All 649 tests pass. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> * refactor(tests): phase 2 - reduce mock-heavy tests and improve focus - Remove mock-heavy query tests from observation-compiler.test.ts, keep real buildTimeline tests - Convert session_id_usage_validation.test.ts from 477 to 178 lines of focused smoke tests - Remove tests for language built-ins from worker-spawn.test.ts (JSON.parse, array indexing) - Rename logger-coverage.test.ts to logger-usage-standards.test.ts for clarity 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> * docs(tests): phase 3 - add JSDoc mock justification to test files Document mock usage rationale in 5 test files to improve maintainability: - error-handler.test.ts: Express req/res mocks, logger spies (~11%) - fallback-error-handler.test.ts: Zero mocks, pure function tests - session-cleanup-helper.test.ts: Session fixtures, worker mocks (~19%) - hook-constants.test.ts: process.platform mock for Windows tests (~12%) - session_store.test.ts: Zero mocks, real SQLite :memory: database Part of ongoing effort to document mock justifications per TESTING.md guidelines. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> * test(integration): phase 5 - add 72 tests for critical coverage gaps Add comprehensive test coverage for previously untested areas: - tests/integration/hook-execution-e2e.test.ts (10 tests) Tests lifecycle hooks execution flow and context propagation - tests/integration/worker-api-endpoints.test.ts (19 tests) Tests all worker service HTTP endpoints without heavy mocking - tests/integration/chroma-vector-sync.test.ts (16 tests) Tests vector embedding synchronization with ChromaDB - tests/utils/tag-stripping.test.ts (27 tests) Tests privacy tag stripping utilities for both <private> and <meta-observation> tags All tests use real implementations where feasible, following the project's testing philosophy of preferring integration-style tests over unit tests with extensive mocking. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> * context update * docs: add comment linking DEFAULT_DATA_DIR locations Added NOTE comment in logger.ts pointing to the canonical DEFAULT_DATA_DIR in SettingsDefaultsManager.ts. This addresses PR reviewer feedback about the fragility of having the default defined in two places to avoid circular dependencies. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>
179 lines
6.7 KiB
TypeScript
179 lines
6.7 KiB
TypeScript
import { describe, it, expect, beforeEach, afterEach } from 'bun:test';
|
|
import { SessionStore } from '../src/services/sqlite/SessionStore.js';
|
|
|
|
/**
|
|
* Session ID Usage Validation - Smoke Tests for Critical Invariants
|
|
*
|
|
* These tests validate the most critical behaviors of the dual session ID system:
|
|
* - contentSessionId: User's Claude Code conversation session (immutable)
|
|
* - memorySessionId: SDK agent's session ID for resume (captured from SDK response)
|
|
*
|
|
* CRITICAL INVARIANTS:
|
|
* 1. Cross-contamination prevention: Observations from different sessions never mix
|
|
* 2. Resume safety: Resume only allowed when memorySessionId is actually captured (non-NULL)
|
|
* 3. 1:1 mapping: Each contentSessionId maps to exactly one memorySessionId
|
|
*/
|
|
describe('Session ID Critical Invariants', () => {
|
|
let store: SessionStore;
|
|
|
|
beforeEach(() => {
|
|
store = new SessionStore(':memory:');
|
|
});
|
|
|
|
afterEach(() => {
|
|
store.close();
|
|
});
|
|
|
|
describe('Cross-Contamination Prevention', () => {
|
|
it('should never mix observations from different content sessions', () => {
|
|
// Create two independent sessions
|
|
const content1 = 'user-session-A';
|
|
const content2 = 'user-session-B';
|
|
const memory1 = 'memory-session-A';
|
|
const memory2 = 'memory-session-B';
|
|
|
|
const id1 = store.createSDKSession(content1, 'project-a', 'Prompt A');
|
|
const id2 = store.createSDKSession(content2, 'project-b', 'Prompt B');
|
|
store.updateMemorySessionId(id1, memory1);
|
|
store.updateMemorySessionId(id2, memory2);
|
|
|
|
// Store observations in each session
|
|
store.storeObservation(memory1, 'project-a', {
|
|
type: 'discovery',
|
|
title: 'Observation A',
|
|
subtitle: null,
|
|
facts: [],
|
|
narrative: null,
|
|
concepts: [],
|
|
files_read: [],
|
|
files_modified: []
|
|
}, 1);
|
|
|
|
store.storeObservation(memory2, 'project-b', {
|
|
type: 'discovery',
|
|
title: 'Observation B',
|
|
subtitle: null,
|
|
facts: [],
|
|
narrative: null,
|
|
concepts: [],
|
|
files_read: [],
|
|
files_modified: []
|
|
}, 1);
|
|
|
|
// CRITICAL: Each session's observations must be isolated
|
|
const obsA = store.getObservationsForSession(memory1);
|
|
const obsB = store.getObservationsForSession(memory2);
|
|
|
|
expect(obsA.length).toBe(1);
|
|
expect(obsB.length).toBe(1);
|
|
expect(obsA[0].title).toBe('Observation A');
|
|
expect(obsB[0].title).toBe('Observation B');
|
|
|
|
// Verify no cross-contamination: A's query doesn't return B's data
|
|
expect(obsA.some(o => o.title === 'Observation B')).toBe(false);
|
|
expect(obsB.some(o => o.title === 'Observation A')).toBe(false);
|
|
});
|
|
});
|
|
|
|
describe('Resume Safety', () => {
|
|
it('should prevent resume when memorySessionId is NULL (not yet captured)', () => {
|
|
const contentSessionId = 'new-session-123';
|
|
const sessionDbId = store.createSDKSession(contentSessionId, 'test-project', 'First prompt');
|
|
|
|
const session = store.getSessionById(sessionDbId);
|
|
|
|
// CRITICAL: Before SDK returns real session ID, memory_session_id must be NULL
|
|
expect(session?.memory_session_id).toBeNull();
|
|
|
|
// hasRealMemorySessionId check: only resume when non-NULL
|
|
const hasRealMemorySessionId = session?.memory_session_id !== null;
|
|
expect(hasRealMemorySessionId).toBe(false);
|
|
|
|
// Resume options should be empty (no resume parameter)
|
|
const resumeOptions = hasRealMemorySessionId ? { resume: session?.memory_session_id } : {};
|
|
expect(resumeOptions).toEqual({});
|
|
});
|
|
|
|
it('should allow resume only after memorySessionId is captured', () => {
|
|
const contentSessionId = 'resume-ready-session';
|
|
const capturedMemoryId = 'sdk-returned-session-xyz';
|
|
|
|
const sessionDbId = store.createSDKSession(contentSessionId, 'test-project', 'Prompt');
|
|
|
|
// Before capture
|
|
let session = store.getSessionById(sessionDbId);
|
|
expect(session?.memory_session_id).toBeNull();
|
|
|
|
// Capture memory session ID (simulates SDK response)
|
|
store.updateMemorySessionId(sessionDbId, capturedMemoryId);
|
|
|
|
// After capture
|
|
session = store.getSessionById(sessionDbId);
|
|
const hasRealMemorySessionId = session?.memory_session_id !== null;
|
|
|
|
expect(hasRealMemorySessionId).toBe(true);
|
|
expect(session?.memory_session_id).toBe(capturedMemoryId);
|
|
expect(session?.memory_session_id).not.toBe(contentSessionId);
|
|
});
|
|
|
|
it('should maintain consistent memorySessionId across multiple prompts in same conversation', () => {
|
|
const contentSessionId = 'multi-prompt-session';
|
|
const realMemoryId = 'consistent-memory-id';
|
|
|
|
// Prompt 1: Create session
|
|
let sessionDbId = store.createSDKSession(contentSessionId, 'test-project', 'Prompt 1');
|
|
store.updateMemorySessionId(sessionDbId, realMemoryId);
|
|
|
|
// Prompt 2: Look up session (createSDKSession uses INSERT OR IGNORE + SELECT)
|
|
sessionDbId = store.createSDKSession(contentSessionId, 'test-project', 'Prompt 2');
|
|
let session = store.getSessionById(sessionDbId);
|
|
expect(session?.memory_session_id).toBe(realMemoryId);
|
|
|
|
// Prompt 3: Still same memory ID
|
|
sessionDbId = store.createSDKSession(contentSessionId, 'test-project', 'Prompt 3');
|
|
session = store.getSessionById(sessionDbId);
|
|
expect(session?.memory_session_id).toBe(realMemoryId);
|
|
});
|
|
});
|
|
|
|
describe('UNIQUE Constraint Enforcement', () => {
|
|
it('should prevent duplicate memorySessionId (protects against multiple transcripts)', () => {
|
|
const content1 = 'content-session-1';
|
|
const content2 = 'content-session-2';
|
|
const sharedMemoryId = 'shared-memory-id';
|
|
|
|
const id1 = store.createSDKSession(content1, 'project', 'Prompt 1');
|
|
const id2 = store.createSDKSession(content2, 'project', 'Prompt 2');
|
|
|
|
// First session captures memory ID - should succeed
|
|
store.updateMemorySessionId(id1, sharedMemoryId);
|
|
|
|
// Second session tries to use SAME memory ID - should FAIL
|
|
expect(() => {
|
|
store.updateMemorySessionId(id2, sharedMemoryId);
|
|
}).toThrow(); // UNIQUE constraint violation
|
|
|
|
// First session still has the ID
|
|
const session1 = store.getSessionById(id1);
|
|
expect(session1?.memory_session_id).toBe(sharedMemoryId);
|
|
});
|
|
});
|
|
|
|
describe('Foreign Key Integrity', () => {
|
|
it('should reject observations for non-existent sessions', () => {
|
|
expect(() => {
|
|
store.storeObservation('nonexistent-session-id', 'test-project', {
|
|
type: 'discovery',
|
|
title: 'Invalid FK',
|
|
subtitle: null,
|
|
facts: [],
|
|
narrative: null,
|
|
concepts: [],
|
|
files_read: [],
|
|
files_modified: []
|
|
}, 1);
|
|
}).toThrow(); // FK constraint violation
|
|
});
|
|
});
|
|
});
|