<quick_start> Start session:

pwd && git status && git log --oneline -5 cat PROJECT_CONTEXT.md 2>/dev/null

→ Auto-spawns Observer team (non-negotiable)

Feature development: Contract → Builder Team → Observer monitors → Security gate → Ship

End session:

Observer final report
Security sweep: gitleaks detect --source .
Update PROJECT_CONTEXT.md
Log costs to costs/daily-YYYY-MM-DD.json

</quick_start>

<success_criteria> Workflow is successful when:

Context scan completed at session start
Observer team spawned and monitoring (non-negotiable)
Contract defined before any feature implementation
Observer BLOCKER gate checked before phase transitions
Security sweep passes before any commits
Cost tracking updated (daily.json, mtd.json)
Observer final report generated at end of day
PROJECT_CONTEXT.md updated at session end
All security gates passed before shipping </success_criteria>
Session Management: "start day", "begin session", "what's the status", "end day", "wrap up", "done for today"
Feature Development: "new feature", "build", "implement"
Dual-Team: "dual team", "observer team", "builder team", "spawn observers", "devil's advocate"
Debugging: "debug", "investigate", "why is this broken"
Research: "research", "evaluate", "should we use"

DUAL-TEAM ARCHITECTURE

Every session runs two concurrent teams under the Orchestrator:

          ORCHESTRATOR
          ┌────┴────┐
    BUILDER TEAM   OBSERVER TEAM
    (ships fast)   (watches quality)
    ├ Lead Builder  ├ Code Quality (haiku)
    ├ Builder(s)    └ Architecture (sonnet)
    └ Devil's Adv.    └─ (Devil's Advocate)

Observer team is non-negotiable. It always runs, even for "small" changes.

Builders optimize for velocity — ship features via worktrees or subagents
Observers optimize for correctness — detect drift, debt, gaps, scope creep
Devil's advocate on each team prevents groupthink and blind spots

Observers write findings to .claude/OBSERVER_*.md with severity levels:

🔴 BLOCKER — stop work, fix immediately
🟡 WARNING — fix before merge or log
🔵 INFO — backlog

Deep dive: See reference/dual-team-architecture.md

START DAY

Pre-Flight Checks

git status --short | head -5 [ -f package.json ] && [ ! -d node_modules ] && echo "Run npm install" [ -f requirements.txt ] && [ ! -d .venv ] && echo "Run pip install" [ -f .env.example ] && [ ! -f .env ] && echo "Copy .env.example to .env"

Context Scan (Mandatory)

pwd && git status && git log --oneline -5 cat PROJECT_CONTEXT.md 2>/dev/null || echo "No context file" cat CLAUDE.md 2>/dev/null cat PLANNING.md 2>/dev/null

Observer Team Spawn (Automatic)

Spawn Observer team concurrent with standup:

Code Quality Observer (haiku) — tech debt, test gaps, imports
Architecture Observer (sonnet) — contract drift, scope creep, design violations

Detect Native Agent Teams support

if [ "${CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS}" = "1" ]; then echo "Native Agent Teams enabled — using DAG task system"

Use TeamCreate + SendMessage for Observer coordination

else echo "Native Agent Teams not enabled — falling back to manual worktree spawn"

Use subagent-teams-skill Task tool pattern

Sprint Plan

Include Observer BLOCKER gate in sprint plan — no phase transitions if active BLOCKERs in .claude/OBSERVER_ALERTS.md .

Output Format

Session Start: [PROJECT_NAME]

Completed (Last Session)

Task 1

In Progress

Task	Branch/Worktree	Status
API endpoint	feature/api	70%

Observer Status

Team: ACTIVE (Code Quality + Architecture)
Active blockers: 0

Today's Priority Queue

[BUILDER] Feature implementation
[OBSERVER] Continuous monitoring

Cost Context

Today: $0.00 | MTD: $12.34 | Budget: $100

Deep dive: See reference/start-day-protocol.md

RESEARCH PHASE

Trigger: Before ANY feature development involving new frameworks, APIs, or architectural decisions.

Scan → Evaluate → Decide

Check existing solutions in your repos and MCP cookbook
Use research-skill checklist for framework selection
Cost projection before building
Create RESEARCH.md → FINDINGS.md with GO/NO-GO recommendation

⛔ Gate: Human checkpoint required before proceeding

Deep dive: See reference/research-workflow.md

FEATURE DEVELOPMENT

Phase 0: CONTRACT DEFINITION (Before Code)

Feature Contract: [NAME]

Endpoints / Interfaces

POST /api/widgets → { id, name, created_at }

Scope Boundaries

IN SCOPE: [list]
OUT OF SCOPE: [list]

Observer Checkpoints

Architecture Observer approves contract
Code Quality Observer runs after each merge

⛔ Gate: Architecture Observer must approve contract before Phase 1

Phase 1: BUILDER TEAM SPAWN

Option A: Worktree sessions (long tasks, 30+ min)

git worktree add -b feature/api ~/tmp/worktrees/$(basename $(pwd))/api

Option B: Native Agent Teams (short tasks, <20 min)

Use TeamCreate + Task tool subagents

Option C: Hybrid — worktrees for Builders, Task tool for Observers

Each builder gets a WORKTREE_TASK.md with:

Scope boundary (what they CAN and CANNOT touch)
Contract reference
Devil's advocate mandate (for one builder per cycle)

Phase 2: OBSERVER MONITORING (Concurrent)

Observers run parallel to builders on a 5-10 minute loop:

Pull latest from builder branches
Run 7 drift detection patterns (see reference/observer-patterns.md )
Write findings to .claude/OBSERVER_QUALITY.md and .claude/OBSERVER_ARCH.md
Escalate BLOCKERs to .claude/OBSERVER_ALERTS.md

Phase 3: SECURITY + QUALITY GATE

Security scans

semgrep --config auto . gitleaks detect --source . npm audit --audit-level=critical || pip-audit pytest --cov=src || npm test -- --coverage

⛔ Gate: ALL must pass + no active Observer BLOCKERs

gate = ( sast_clean AND secrets_found == 0 AND critical_vulns == 0 AND test_coverage >= 80 AND observer_blockers == 0 # NEW: Observer gate )

Phase 4: SHIP

git diff main...HEAD git add . && git commit -m "feat: [description]" git push echo '{"feature": "X", "cost": 1.23}' >> costs/by-feature.jsonl

Deep dive: See reference/feature-development.md

DEBUG MODE

Trigger: When standard troubleshooting fails or issue is complex.

Evidence → Hypothesize → Test → Verify

Evidence gathering — exact error, reproduction steps, expected vs actual
Hypothesis formation — 3+ hypotheses with evidence for each
Systematic testing — one variable at a time
Verification — root cause confirmed before committing fix

Critical Rules

❌ NO DRIVE-BY FIXES — explain WHY before committing
❌ NO GUESSING — verify everything
✅ Use all tools: MCP servers, web search, extended thinking
✅ One variable at a time

Deep dive: See reference/debug-methodology.md

END DAY

Phase 1: Observer Final Report (Before Security Sweep)

Observers generate final reports:

Summary of all findings (resolved + open)
Metrics: debt items, test coverage delta, contract compliance
Write to .claude/OBSERVER_QUALITY.md and .claude/OBSERVER_ARCH.md

⛔ Gate: Review Observer report before proceeding

Phase 2: Security Sweep (Mandatory)

gitleaks detect --source . --verbose git log -p | grep -E "(password|secret|api.?key|token)" || echo "Clean" npm audit --audit-level=critical 2>/dev/null || pip-audit 2>/dev/null

⛔ Gate: ALL must pass before any commits

Phase 3: Context + Metrics + Cleanup

PROJECT_CONTEXT.md Update

Completed This Session

Feature X

Observer Summary

Blockers resolved: N | Warnings logged: N | Debt delta: +/-N

Tomorrow's Priorities

Next task

Archive observer reports to .claude/OBSERVER_HISTORY/ , reset for next session.

Cost tracking

echo '{"total": 0.47}' >> costs/daily-$(date +%Y-%m-%d).json

Portfolio metrics

git diff --stat $(git log --since="today 00:00" --format="%H" | tail -1)..HEAD 2>/dev/null

Deep dive: See reference/end-day-protocol.md

COST GATE

Pre-Flight Budget Check

COST_FILE=~/.claude/daily-cost.json SPENT=$(jq '.spent' "$COST_FILE" 2>/dev/null || echo 0) BUDGET=$(jq '.budget_monthly' "$COST_FILE" 2>/dev/null || echo 100) echo "MTD: $$SPENT / $$BUDGET"

% of Budget Action

< 50% Proceed normally

50-80% Cost warning, suggest model downgrade

80-95% WARN — Ask user before proceeding

95% BLOCK — Require explicit override

Deep dive: See reference/cost-tracking.md

ROLLBACK / RECOVERY

Stash current work → find last known good → selective rollback or full revert → verify tests → investigate root cause
Use debug-like-expert-skill for root cause analysis

Deep dive: See reference/rollback-recovery.md

SKILL INVOCATION QUICK REFERENCE

Need Invoke Model

Spawn builder team agent-teams-skill or Task tool with team_name

sonnet

Spawn observer team Auto at START DAY, or workflow-orchestrator

haiku/sonnet

Debug a failing test debug-like-expert-skill

sonnet

Review code quality superpowers:requesting-code-review

sonnet

Run security sweep security-skill

sonnet

Track costs cost-metering-skill

haiku

Write tests testing-skill

sonnet

Design API contract api-design-skill

sonnet

Plan architecture planning-prompts-skill

opus

Capture metrics portfolio-artifact-skill

haiku

Parallel build (worktrees) agent-teams-skill

sonnet

Parallel build (in-session) subagent-teams-skill

sonnet

Map task → best agent agent-capability-matrix-skill

—

Native Agent Teams (Experimental)

export CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS=1 claude --worktree # auto-creates worktree branch

Features: DAG task system, peer-to-peer messaging, TeammateIdle/TaskCompleted hooks, shared task lists, isolation: "worktree" for subagents.

Deep dive: See reference/agent-routing.md for complete 70+ agent catalog

PROGRESS RENDERING

Use TaskCreate with activeForm for live UI spinners:

TaskCreate({ subject: "Plan architecture", activeForm: "Planning architecture" }) TaskUpdate({ taskId: "1", status: "in_progress" }) // → live spinner TaskUpdate({ taskId: "1", status: "completed" }) // → checkmark

Use addBlockedBy for phase sequencing. Markdown tables for summaries.

CLAUDE CODE COMMANDS

Command Workflow

/start-day

Start Day — context scan, Observer spawn, cost status

/build-feature <name>

Feature Dev — contract → build → observe → ship

/end-day

End Day — Observer report, security sweep, context save

/quick-fix <issue>

Debug — evidence → hypothesis → fix

/cost-check

Display daily/MTD spend and budget status

GTME PERSPECTIVE

This dual-team workflow demonstrates advanced GTME capabilities:

Process Engineering — Quality control built into lifecycle, not bolted on
Parallel Orchestration — Managing concurrent teams with different objectives
Adversarial Thinking — Devil's advocate prevents "happy path only" trap
Cost Awareness — Observers use cheaper models; unit economics thinking
Measurable Output — Observer reports quantify debt, coverage, compliance

Portfolio Value: Produces auditable artifacts demonstrating engineering discipline + process thinking + cost optimization.

workflow-orchestrator

Safety Notice

Copy this and send it to your AI assistant to learn

Detect Native Agent Teams support

Use TeamCreate + SendMessage for Observer coordination

Use subagent-teams-skill Task tool pattern

Session Start: [PROJECT_NAME]

Completed (Last Session)

In Progress

Observer Status

Today's Priority Queue

Cost Context

Feature Contract: [NAME]

Endpoints / Interfaces

Scope Boundaries

Observer Checkpoints

Option A: Worktree sessions (long tasks, 30+ min)

Option B: Native Agent Teams (short tasks, <20 min)

Use TeamCreate + Task tool subagents

Option C: Hybrid — worktrees for Builders, Task tool for Observers

Security scans

PROJECT_CONTEXT.md Update

Completed This Session

Observer Summary

Tomorrow's Priorities

Cost tracking

Portfolio metrics

Source Transparency

Related Skills

langgraph-agents

agent-teams

workflow-enforcer-skill

hubspot-revops-skill