<quick_start> Start session:
pwd && git status && git log --oneline -5 cat PROJECT_CONTEXT.md 2>/dev/null
→ Auto-spawns Observer team (non-negotiable)
Feature development: Contract → Builder Team → Observer monitors → Security gate → Ship
End session:
-
Observer final report
-
Security sweep: gitleaks detect --source .
-
Update PROJECT_CONTEXT.md
-
Log costs to costs/daily-YYYY-MM-DD.json
</quick_start>
<success_criteria> Workflow is successful when:
-
Context scan completed at session start
-
Observer team spawned and monitoring (non-negotiable)
-
Contract defined before any feature implementation
-
Observer BLOCKER gate checked before phase transitions
-
Security sweep passes before any commits
-
Cost tracking updated (daily.json, mtd.json)
-
Observer final report generated at end of day
-
PROJECT_CONTEXT.md updated at session end
-
All security gates passed before shipping </success_criteria>
-
Session Management: "start day", "begin session", "what's the status", "end day", "wrap up", "done for today"
-
Feature Development: "new feature", "build", "implement"
-
Dual-Team: "dual team", "observer team", "builder team", "spawn observers", "devil's advocate"
-
Debugging: "debug", "investigate", "why is this broken"
-
Research: "research", "evaluate", "should we use"
DUAL-TEAM ARCHITECTURE
Every session runs two concurrent teams under the Orchestrator:
ORCHESTRATOR
┌────┴────┐
BUILDER TEAM OBSERVER TEAM
(ships fast) (watches quality)
├ Lead Builder ├ Code Quality (haiku)
├ Builder(s) └ Architecture (sonnet)
└ Devil's Adv. └─ (Devil's Advocate)
Observer team is non-negotiable. It always runs, even for "small" changes.
-
Builders optimize for velocity — ship features via worktrees or subagents
-
Observers optimize for correctness — detect drift, debt, gaps, scope creep
-
Devil's advocate on each team prevents groupthink and blind spots
Observers write findings to .claude/OBSERVER_*.md with severity levels:
-
🔴 BLOCKER — stop work, fix immediately
-
🟡 WARNING — fix before merge or log
-
🔵 INFO — backlog
Deep dive: See reference/dual-team-architecture.md
START DAY
Pre-Flight Checks
git status --short | head -5 [ -f package.json ] && [ ! -d node_modules ] && echo "Run npm install" [ -f requirements.txt ] && [ ! -d .venv ] && echo "Run pip install" [ -f .env.example ] && [ ! -f .env ] && echo "Copy .env.example to .env"
Context Scan (Mandatory)
pwd && git status && git log --oneline -5 cat PROJECT_CONTEXT.md 2>/dev/null || echo "No context file" cat CLAUDE.md 2>/dev/null cat PLANNING.md 2>/dev/null
Observer Team Spawn (Automatic)
Spawn Observer team concurrent with standup:
-
Code Quality Observer (haiku) — tech debt, test gaps, imports
-
Architecture Observer (sonnet) — contract drift, scope creep, design violations
Detect Native Agent Teams support
if [ "${CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS}" = "1" ]; then echo "Native Agent Teams enabled — using DAG task system"
Use TeamCreate + SendMessage for Observer coordination
else echo "Native Agent Teams not enabled — falling back to manual worktree spawn"
Use subagent-teams-skill Task tool pattern
fi
Sprint Plan
Include Observer BLOCKER gate in sprint plan — no phase transitions if active BLOCKERs in .claude/OBSERVER_ALERTS.md .
Output Format
Session Start: [PROJECT_NAME]
Completed (Last Session)
- Task 1
In Progress
| Task | Branch/Worktree | Status |
|---|---|---|
| API endpoint | feature/api | 70% |
Observer Status
- Team: ACTIVE (Code Quality + Architecture)
- Active blockers: 0
Today's Priority Queue
- [BUILDER] Feature implementation
- [OBSERVER] Continuous monitoring
Cost Context
- Today: $0.00 | MTD: $12.34 | Budget: $100
Deep dive: See reference/start-day-protocol.md
RESEARCH PHASE
Trigger: Before ANY feature development involving new frameworks, APIs, or architectural decisions.
Scan → Evaluate → Decide
-
Check existing solutions in your repos and MCP cookbook
-
Use research-skill checklist for framework selection
-
Cost projection before building
-
Create RESEARCH.md → FINDINGS.md with GO/NO-GO recommendation
⛔ Gate: Human checkpoint required before proceeding
Deep dive: See reference/research-workflow.md
FEATURE DEVELOPMENT
Phase 0: CONTRACT DEFINITION (Before Code)
Feature Contract: [NAME]
Endpoints / Interfaces
- POST /api/widgets → { id, name, created_at }
Scope Boundaries
- IN SCOPE: [list]
- OUT OF SCOPE: [list]
Observer Checkpoints
- Architecture Observer approves contract
- Code Quality Observer runs after each merge
⛔ Gate: Architecture Observer must approve contract before Phase 1
Phase 1: BUILDER TEAM SPAWN
Option A: Worktree sessions (long tasks, 30+ min)
git worktree add -b feature/api ~/tmp/worktrees/$(basename $(pwd))/api
Option B: Native Agent Teams (short tasks, <20 min)
Use TeamCreate + Task tool subagents
Option C: Hybrid — worktrees for Builders, Task tool for Observers
Each builder gets a WORKTREE_TASK.md with:
-
Scope boundary (what they CAN and CANNOT touch)
-
Contract reference
-
Devil's advocate mandate (for one builder per cycle)
Phase 2: OBSERVER MONITORING (Concurrent)
Observers run parallel to builders on a 5-10 minute loop:
-
Pull latest from builder branches
-
Run 7 drift detection patterns (see reference/observer-patterns.md )
-
Write findings to .claude/OBSERVER_QUALITY.md and .claude/OBSERVER_ARCH.md
-
Escalate BLOCKERs to .claude/OBSERVER_ALERTS.md
Phase 3: SECURITY + QUALITY GATE
Security scans
semgrep --config auto . gitleaks detect --source . npm audit --audit-level=critical || pip-audit pytest --cov=src || npm test -- --coverage
⛔ Gate: ALL must pass + no active Observer BLOCKERs
gate = ( sast_clean AND secrets_found == 0 AND critical_vulns == 0 AND test_coverage >= 80 AND observer_blockers == 0 # NEW: Observer gate )
Phase 4: SHIP
git diff main...HEAD git add . && git commit -m "feat: [description]" git push echo '{"feature": "X", "cost": 1.23}' >> costs/by-feature.jsonl
Deep dive: See reference/feature-development.md
DEBUG MODE
Trigger: When standard troubleshooting fails or issue is complex.
Evidence → Hypothesize → Test → Verify
-
Evidence gathering — exact error, reproduction steps, expected vs actual
-
Hypothesis formation — 3+ hypotheses with evidence for each
-
Systematic testing — one variable at a time
-
Verification — root cause confirmed before committing fix
Critical Rules
-
❌ NO DRIVE-BY FIXES — explain WHY before committing
-
❌ NO GUESSING — verify everything
-
✅ Use all tools: MCP servers, web search, extended thinking
-
✅ One variable at a time
Deep dive: See reference/debug-methodology.md
END DAY
Phase 1: Observer Final Report (Before Security Sweep)
Observers generate final reports:
-
Summary of all findings (resolved + open)
-
Metrics: debt items, test coverage delta, contract compliance
-
Write to .claude/OBSERVER_QUALITY.md and .claude/OBSERVER_ARCH.md
⛔ Gate: Review Observer report before proceeding
Phase 2: Security Sweep (Mandatory)
gitleaks detect --source . --verbose git log -p | grep -E "(password|secret|api.?key|token)" || echo "Clean" npm audit --audit-level=critical 2>/dev/null || pip-audit 2>/dev/null
⛔ Gate: ALL must pass before any commits
Phase 3: Context + Metrics + Cleanup
PROJECT_CONTEXT.md Update
Completed This Session
- Feature X
Observer Summary
- Blockers resolved: N | Warnings logged: N | Debt delta: +/-N
Tomorrow's Priorities
- Next task
Archive observer reports to .claude/OBSERVER_HISTORY/ , reset for next session.
Cost tracking
echo '{"total": 0.47}' >> costs/daily-$(date +%Y-%m-%d).json
Portfolio metrics
git diff --stat $(git log --since="today 00:00" --format="%H" | tail -1)..HEAD 2>/dev/null
Deep dive: See reference/end-day-protocol.md
COST GATE
Pre-Flight Budget Check
COST_FILE=~/.claude/daily-cost.json SPENT=$(jq '.spent' "$COST_FILE" 2>/dev/null || echo 0) BUDGET=$(jq '.budget_monthly' "$COST_FILE" 2>/dev/null || echo 100) echo "MTD: $$SPENT / $$BUDGET"
% of Budget Action
< 50% Proceed normally
50-80% Cost warning, suggest model downgrade
80-95% WARN — Ask user before proceeding
95% BLOCK — Require explicit override
Deep dive: See reference/cost-tracking.md
ROLLBACK / RECOVERY
-
Stash current work → find last known good → selective rollback or full revert → verify tests → investigate root cause
-
Use debug-like-expert-skill for root cause analysis
Deep dive: See reference/rollback-recovery.md
SKILL INVOCATION QUICK REFERENCE
Need Invoke Model
Spawn builder team agent-teams-skill or Task tool with team_name
sonnet
Spawn observer team Auto at START DAY, or workflow-orchestrator
haiku/sonnet
Debug a failing test debug-like-expert-skill
sonnet
Review code quality superpowers:requesting-code-review
sonnet
Run security sweep security-skill
sonnet
Track costs cost-metering-skill
haiku
Write tests testing-skill
sonnet
Design API contract api-design-skill
sonnet
Plan architecture planning-prompts-skill
opus
Capture metrics portfolio-artifact-skill
haiku
Parallel build (worktrees) agent-teams-skill
sonnet
Parallel build (in-session) subagent-teams-skill
sonnet
Map task → best agent agent-capability-matrix-skill
—
Native Agent Teams (Experimental)
export CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS=1 claude --worktree # auto-creates worktree branch
Features: DAG task system, peer-to-peer messaging, TeammateIdle/TaskCompleted hooks, shared task lists, isolation: "worktree" for subagents.
Deep dive: See reference/agent-routing.md for complete 70+ agent catalog
PROGRESS RENDERING
Use TaskCreate with activeForm for live UI spinners:
TaskCreate({ subject: "Plan architecture", activeForm: "Planning architecture" }) TaskUpdate({ taskId: "1", status: "in_progress" }) // → live spinner TaskUpdate({ taskId: "1", status: "completed" }) // → checkmark
Use addBlockedBy for phase sequencing. Markdown tables for summaries.
CLAUDE CODE COMMANDS
Command Workflow
/start-day
Start Day — context scan, Observer spawn, cost status
/build-feature <name>
Feature Dev — contract → build → observe → ship
/end-day
End Day — Observer report, security sweep, context save
/quick-fix <issue>
Debug — evidence → hypothesis → fix
/cost-check
Display daily/MTD spend and budget status
GTME PERSPECTIVE
This dual-team workflow demonstrates advanced GTME capabilities:
-
Process Engineering — Quality control built into lifecycle, not bolted on
-
Parallel Orchestration — Managing concurrent teams with different objectives
-
Adversarial Thinking — Devil's advocate prevents "happy path only" trap
-
Cost Awareness — Observers use cheaper models; unit economics thinking
-
Measurable Output — Observer reports quantify debt, coverage, compliance
Portfolio Value: Produces auditable artifacts demonstrating engineering discipline + process thinking + cost optimization.