docs: add smart-explore benchmark report and update skill with benchmark data

Add public benchmark report documenting the A/B comparison between Smart Explore and the standard Explore agent (17.8x cheaper discovery, 19.4x cheaper targeted reads). Update SKILL.md with benchmark-accurate token economics, completeness guarantee, map-first principle, and Explore agent escalation guidance. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-25 22:22:41 -05:00
parent 7015301d8f
commit 44a7b2fcb9
3 changed files with 208 additions and 7 deletions
@@ -7,6 +7,8 @@ description: Token-optimized structural code search using tree-sitter AST parsin

 Structural code exploration using AST parsing. **This skill overrides your default exploration behavior.** While this skill is active, use smart_search/smart_outline/smart_unfold as your primary tools instead of Read, Grep, and Glob.

+**Core principle:** Index first, fetch on demand. Give yourself a map of the code before loading implementation details. The question before every file read should be: "do I need to see all of this, or can I get a structural overview first?" The answer is almost always: get the map.
+
 ## Your Next Tool Call

 This skill only loads instructions. You must call the MCP tools yourself. Your next action should be one of:
@@ -71,7 +73,7 @@ Review symbols from Steps 1-2. Pick the ones you need. Unfold only those:
 smart_unfold(file_path="services/worker-service.ts", symbol_name="shutdown")
 ```

-**Returns:** Full source code of the specified symbol including JSDoc, decorators, and complete implementation (~1-7k tokens depending on symbol size)
+**Returns:** Full source code of the specified symbol including JSDoc, decorators, and complete implementation (~400-2,100 tokens depending on symbol size). AST node boundaries guarantee completeness regardless of symbol size — unlike Read + agent summarization, which may truncate long methods.

 **Parameters:**

@@ -85,6 +87,7 @@ Use these only when smart_* tools are the wrong fit:
 - **Grep:** Exact string/regex search ("find all TODO comments", "where is `ensureWorkerStarted` defined?")
 - **Read:** Small files under ~100 lines, non-code files (JSON, markdown, config)
 - **Glob:** File path patterns ("find all test files")
+- **Explore agent:** When you need synthesized understanding across 6+ files, architecture narratives, or answers to open-ended questions like "how does this entire system work end-to-end?" Smart-explore is a scalpel — it answers "where is this?" and "show me that." It doesn't synthesize cross-file data flows, design decisions, or edge cases across an entire feature.

 For code files over ~100 lines, prefer smart_outline + smart_unfold over Read.

@@ -132,10 +135,11 @@ Use smart_* tools for code exploration, Read for non-code files. Mix freely.

 | Approach | Tokens | Use Case |
 |----------|--------|----------|
-| smart_outline | ~1,500 | "What's in this file?" |
-| smart_unfold | ~1,600 | "Show me this function" |
-| smart_search | ~2,000-6,000 | "How does X work?" |
+| smart_outline | ~1,000-2,000 | "What's in this file?" |
+| smart_unfold | ~400-2,100 | "Show me this function" |
+| smart_search | ~2,000-6,000 | "Find all X across the codebase" |
+| search + unfold | ~3,000-8,000 | End-to-end: find and read (the primary workflow) |
 | Read (full file) | ~12,000+ | When you truly need everything |
-| Explore agent | ~20,000-40,000 | Same as smart_search, 6-12x more expensive |
+| Explore agent | ~39,000-59,000 | Cross-file synthesis with narrative |

-**8x savings** on file understanding (outline + unfold vs Read). **6-12x savings** on exploration vs Explore agent.
+**4-8x savings** on file understanding (outline + unfold vs Read). **11-18x savings** on codebase exploration vs Explore agent. The narrower the query, the wider the gap — a 27-line function costs 55x less to read via unfold than via an Explore agent, because the agent still reads the entire file.