Codex Debug-With-File Prompt

Overview

Enhanced evidence-based debugging with documented exploration process. Records understanding evolution, consolidates insights, and uses analysis to correct misunderstandings.

Core workflow: Explore → Document → Log → Analyze → Correct Understanding → Fix → Verify

Key enhancements over /prompts:debug:

understanding.md: Timeline of exploration and learning
Analysis-assisted correction: Validates and corrects hypotheses
Consolidation: Simplifies proven-wrong understanding to avoid clutter
Learning retention: Preserves what was learned, even from failed attempts

Target Bug

$BUG

Execution Process

Session Detection: ├─ Check if debug session exists for this bug ├─ EXISTS + understanding.md exists → Continue mode └─ NOT_FOUND → Explore mode

Explore Mode: ├─ Locate error source in codebase ├─ Document initial understanding in understanding.md ├─ Generate testable hypotheses with analysis validation ├─ Add NDJSON logging instrumentation └─ Output: Hypothesis list + await user reproduction

Analyze Mode: ├─ Parse debug.log, validate each hypothesis ├─ Use analysis to evaluate hypotheses and correct understanding ├─ Update understanding.md with: │ ├─ New evidence │ ├─ Corrected misunderstandings (strikethrough + correction) │ └─ Consolidated current understanding └─ Decision: ├─ Confirmed → Fix root cause ├─ Inconclusive → Add more logging, iterate └─ All rejected → Assisted new hypotheses

Fix & Cleanup: ├─ Apply fix based on confirmed hypothesis ├─ User verifies ├─ Document final understanding + lessons learned ├─ Remove debug instrumentation └─ If not fixed → Return to Analyze mode

Implementation Details

Session Setup & Mode Detection

Step 0: Determine Project Root

检测项目根目录，确保 .workflow/ 产物位置正确：

PROJECT_ROOT=$(git rev-parse --show-toplevel 2>/dev/null || pwd)

优先通过 git 获取仓库根目录；非 git 项目回退到 pwd 取当前绝对路径。存储为 {projectRoot} ，后续所有 .workflow/ 路径必须以此为前缀。

const getUtc8ISOString = () => new Date(Date.now() + 8 * 60 * 60 * 1000).toISOString() const projectRoot = bash('git rev-parse --show-toplevel 2>/dev/null || pwd').trim()

const bugSlug = "$BUG".toLowerCase().replace(/[^a-z0-9]+/g, '-').substring(0, 30) const dateStr = getUtc8ISOString().substring(0, 10)

const sessionId = DBG-${bugSlug}-${dateStr} const sessionFolder = ${projectRoot}/.workflow/.debug/${sessionId} const debugLogPath = ${sessionFolder}/debug.log const understandingPath = ${sessionFolder}/understanding.md const hypothesesPath = ${sessionFolder}/hypotheses.json

// Auto-detect mode const sessionExists = fs.existsSync(sessionFolder) const hasUnderstanding = sessionExists && fs.existsSync(understandingPath) const logHasContent = sessionExists && fs.existsSync(debugLogPath) && fs.statSync(debugLogPath).size > 0

const mode = logHasContent ? 'analyze' : (hasUnderstanding ? 'continue' : 'explore')

if (!sessionExists) { bash(mkdir -p ${sessionFolder}) }

Explore Mode

Step 1.1: Locate Error Source

// Extract keywords from bug description const keywords = extractErrorKeywords("$BUG")

// Search codebase for error locations const searchResults = [] for (const keyword of keywords) { const results = Grep({ pattern: keyword, path: ".", output_mode: "content", "-C": 3 }) searchResults.push({ keyword, results }) }

// Identify affected files and functions const affectedLocations = analyzeSearchResults(searchResults)

Step 1.2: Document Initial Understanding

Create understanding.md :

Understanding Document

Session ID: ${sessionId} Bug Description: $BUG Started: ${getUtc8ISOString()}

Exploration Timeline

Iteration 1 - Initial Exploration (${timestamp})

Current Understanding

Based on bug description and initial code search:

Error pattern: ${errorPattern}
Affected areas: ${affectedLocations.map(l => l.file).join(', ')}
Initial hypothesis: ${initialThoughts}

Evidence from Code Search

${searchResults.map(r => ` Keyword: "${r.keyword}"

Found in: ${r.results.files.join(', ')}
Key findings: ${r.insights} `).join('\n')}

Next Steps

Generate testable hypotheses
Add instrumentation
Await reproduction

Current Consolidated Understanding

${initialConsolidatedUnderstanding}

Step 1.3: Generate Hypotheses

Analyze the bug and generate 3-5 testable hypotheses:

// Hypothesis generation based on error pattern const HYPOTHESIS_PATTERNS = { "not found|missing|undefined|未找到": "data_mismatch", "0|empty|zero|registered": "logic_error", "timeout|connection|sync": "integration_issue", "type|format|parse": "type_mismatch" }

function generateHypotheses(bugDescription, affectedLocations) { // Generate targeted hypotheses based on error analysis // Each hypothesis includes: // - id: H1, H2, ... // - description: What might be wrong // - testable_condition: What to log // - logging_point: Where to add instrumentation // - evidence_criteria: What confirms/rejects it return hypotheses }

Save to hypotheses.json :

{ "iteration": 1, "timestamp": "2025-01-21T10:00:00+08:00", "hypotheses": [ { "id": "H1", "description": "Data structure mismatch - expected key not present", "testable_condition": "Check if target key exists in dict", "logging_point": "file.py:func:42", "evidence_criteria": { "confirm": "data shows missing key", "reject": "key exists with valid value" }, "likelihood": 1, "status": "pending" } ] }

Step 1.4: Add NDJSON Instrumentation

For each hypothesis, add logging at the specified location:

Python template:

region debug [H{n}]

try: import json, time _dbg = { "sid": "{sessionId}", "hid": "H{n}", "loc": "{file}:{line}", "msg": "{testable_condition}", "data": { # Capture relevant values here }, "ts": int(time.time() * 1000) } with open(r"{debugLogPath}", "a", encoding="utf-8") as _f: _f.write(json.dumps(_dbg, ensure_ascii=False) + "\n") except: pass

endregion

JavaScript/TypeScript template:

// region debug [H{n}] try { require('fs').appendFileSync("{debugLogPath}", JSON.stringify({ sid: "{sessionId}", hid: "H{n}", loc: "{file}:{line}", msg: "{testable_condition}", data: { /* Capture relevant values */ }, ts: Date.now() }) + "\n"); } catch(_) {} // endregion

Step 1.5: Output to User

Hypotheses Generated

Based on error "$BUG", generated {n} hypotheses:

{hypotheses.map(h => `

${h.id}: ${h.description}

Logging at: ${h.logging_point}
Testing: ${h.testable_condition}
Evidence to confirm: ${h.evidence_criteria.confirm}
Evidence to reject: ${h.evidence_criteria.reject} `).join('')}

Debug log: ${debugLogPath}

Next: Run reproduction steps, then come back for analysis.

Analyze Mode

Step 2.1: Parse Debug Log

// Parse NDJSON log const entries = Read(debugLogPath).split('\n') .filter(l => l.trim()) .map(l => JSON.parse(l))

// Group by hypothesis const byHypothesis = groupBy(entries, 'hid')

// Validate each hypothesis for (const [hid, logs] of Object.entries(byHypothesis)) { const hypothesis = hypotheses.find(h => h.id === hid) const latestLog = logs[logs.length - 1]

// Check if evidence confirms or rejects hypothesis const verdict = evaluateEvidence(hypothesis, latestLog.data) // Returns: 'confirmed' | 'rejected' | 'inconclusive' }

Step 2.2: Analyze Evidence and Correct Understanding

Review the debug log and evaluate each hypothesis:

Parse all log entries
Group by hypothesis ID
Compare evidence against expected criteria
Determine verdict: confirmed | rejected | inconclusive
Identify incorrect assumptions from previous understanding
Generate corrections

Step 2.3: Update Understanding with Corrections

Append new iteration to understanding.md :

Iteration ${n} - Evidence Analysis (${timestamp})

Log Analysis Results

${results.map(r => ` ${r.id}: ${r.verdict.toUpperCase()}

Evidence: ${JSON.stringify(r.evidence)}
Reasoning: ${r.reason} `).join('\n')}

Corrected Understanding

Previous misunderstandings identified and corrected:

${corrections.map(c => `

~~${c.wrong}~~ → ${c.corrected}
- Why wrong: ${c.reason}
- Evidence: ${c.evidence} `).join('\n')}

New Insights

${newInsights.join('\n- ')}

${confirmedHypothesis ? `

Root Cause Identified

${confirmedHypothesis.id}: ${confirmedHypothesis.description}

Evidence supporting this conclusion: ${confirmedHypothesis.supportingEvidence} :

Next Steps

${nextSteps} `}

Current Consolidated Understanding (Updated)

${consolidatedUnderstanding}

Step 2.4: Update hypotheses.json

{ "iteration": 2, "timestamp": "2025-01-21T10:15:00+08:00", "hypotheses": [ { "id": "H1", "status": "rejected", "verdict_reason": "Evidence shows key exists with valid value", "evidence": {...} }, { "id": "H2", "status": "confirmed", "verdict_reason": "Log data confirms timing issue", "evidence": {...} } ], "corrections": [ { "wrong_assumption": "...", "corrected_to": "...", "reason": "..." } ] }

Fix & Verification

Step 3.1: Apply Fix

Based on confirmed hypothesis, implement the fix in the affected files.

Step 3.2: Document Resolution

Append to understanding.md :

Iteration ${n} - Resolution (${timestamp})

Fix Applied

Modified files: ${modifiedFiles.join(', ')}
Fix description: ${fixDescription}
Root cause addressed: ${rootCause}

Verification Results

${verificationResults}

Lessons Learned

What we learned from this debugging session:

${lesson1}
${lesson2}
${lesson3}

Key Insights for Future

${insight1}
${insight2}

Step 3.3: Cleanup

Remove debug instrumentation by searching for region markers:

const instrumentedFiles = Grep({ pattern: "# region debug|// region debug", output_mode: "files_with_matches" })

for (const file of instrumentedFiles) { // Remove content between region markers removeDebugRegions(file) }

Session Folder Structure

{projectRoot}/.workflow/.debug/DBG-{slug}-{date}/ ├── debug.log # NDJSON log (execution evidence) ├── understanding.md # Exploration timeline + consolidated understanding └── hypotheses.json # Hypothesis history with verdicts

Understanding Document Template

Understanding Document

Session ID: DBG-xxx-2025-01-21 Bug Description: [original description] Started: 2025-01-21T10:00:00+08:00

Exploration Timeline

Iteration 1 - Initial Exploration (2025-01-21 10:00)

Current Understanding

...

Evidence from Code Search

...

Hypotheses Generated

...

Iteration 2 - Evidence Analysis (2025-01-21 10:15)

Log Analysis Results

...

Corrected Understanding

~~[wrong]~~ → [corrected]

Analysis Results

...

Current Consolidated Understanding

What We Know

[valid understanding points]

What Was Disproven

~~[disproven assumptions]~~

Current Investigation Focus

[current focus]

Remaining Questions

[open questions]

Debug Log Format (NDJSON)

Each line is a JSON object:

{"sid":"DBG-xxx-2025-01-21","hid":"H1","loc":"file.py:func:42","msg":"Check dict keys","data":{"keys":["a","b"],"target":"c","found":false},"ts":1734567890123}

Field Description

sid

Session ID

hid

Hypothesis ID (H1, H2, ...)

loc

Code location

msg

What's being tested

data

Captured values

Timestamp (ms)

Iteration Flow

First Call (BUG="error"): ├─ No session exists → Explore mode ├─ Extract error keywords, search codebase ├─ Document initial understanding in understanding.md ├─ Generate hypotheses ├─ Add logging instrumentation └─ Await user reproduction

After Reproduction (BUG="error"): ├─ Session exists + debug.log has content → Analyze mode ├─ Parse log, evaluate hypotheses ├─ Update understanding.md with: │ ├─ Evidence analysis results │ ├─ Corrected misunderstandings (strikethrough) │ ├─ New insights │ └─ Updated consolidated understanding ├─ Update hypotheses.json with verdicts └─ Decision: ├─ Confirmed → Fix → Document resolution ├─ Inconclusive → Add logging, document next steps └─ All rejected → Assisted new hypotheses

Output: ├─ {projectRoot}/.workflow/.debug/DBG-{slug}-{date}/debug.log ├─ {projectRoot}/.workflow/.debug/DBG-{slug}-{date}/understanding.md (evolving document) └─ {projectRoot}/.workflow/.debug/DBG-{slug}-{date}/hypotheses.json (history)

Error Handling

Situation Action

Empty debug.log Verify reproduction triggered the code path

All hypotheses rejected Generate new hypotheses based on disproven assumptions

Fix doesn't work Document failed fix attempt, iterate with refined understanding

5 iterations Review consolidated understanding, escalate with full context

Understanding too long Consolidate aggressively, archive old iterations to separate file

Consolidation Rules

When updating "Current Consolidated Understanding":

Simplify disproven items: Move to "What Was Disproven" with single-line summary
Keep valid insights: Promote confirmed findings to "What We Know"
Avoid duplication: Don't repeat timeline details in consolidated section
Focus on current state: What do we know NOW, not the journey
Preserve key corrections: Keep important wrong→right transformations for learning

Bad (cluttered):

Current Consolidated Understanding

In iteration 1 we thought X, but in iteration 2 we found Y, then in iteration 3... Also we checked A and found B, and then we checked C...

Good (consolidated):

Current Consolidated Understanding

What We Know

Error occurs during runtime update, not initialization
Config value is None (not missing key)

What Was Disproven

~~Initialization error~~ (Timing evidence)
~~Missing key hypothesis~~ (Key exists)

Current Investigation Focus

Why is config value None during update?

Key Features

Feature Description

NDJSON logging Structured debug log with hypothesis tracking

Hypothesis generation Analysis-assisted hypothesis creation

Exploration documentation understanding.md with timeline

Understanding evolution Timeline + corrections tracking

Error correction Strikethrough + reasoning for wrong assumptions

Consolidated learning Current understanding section

Hypothesis history hypotheses.json with verdicts

Analysis validation At key decision points

When to Use

Best suited for:

Complex bugs requiring multiple investigation rounds
Learning from debugging process is valuable
Team needs to understand debugging rationale
Bug might recur, documentation helps prevention

Now execute the debug-with-file workflow for bug: $BUG

debug-with-file

Safety Notice

Copy this and send it to your AI assistant to learn

Understanding Document

Exploration Timeline

Iteration 1 - Initial Exploration (${timestamp})

Current Understanding

Evidence from Code Search

Next Steps

Current Consolidated Understanding

region debug [H{n}]

endregion

Hypotheses Generated

${h.id}: ${h.description}

Iteration ${n} - Evidence Analysis (${timestamp})

Log Analysis Results

Corrected Understanding

New Insights

Root Cause Identified

Next Steps

Current Consolidated Understanding (Updated)

Iteration ${n} - Resolution (${timestamp})

Fix Applied

Verification Results

Lessons Learned

Key Insights for Future

Understanding Document

Exploration Timeline

Iteration 1 - Initial Exploration (2025-01-21 10:00)

Current Understanding

Evidence from Code Search

Hypotheses Generated

Iteration 2 - Evidence Analysis (2025-01-21 10:15)

Log Analysis Results

Corrected Understanding

Analysis Results

Current Consolidated Understanding

What We Know

What Was Disproven

Current Investigation Focus

Remaining Questions

Current Consolidated Understanding

Current Consolidated Understanding

What We Know

What Was Disproven

Current Investigation Focus

Source Transparency

Related Skills

skill-generator

review-code

ccw-help

skill-tuning