Context Engineering for AI-Augmented Development

Quick Reference

Task Primary Skill Reference

Write AGENTS.md / CLAUDE.md agents-project-memory memory-patterns.md

Create implementation plan dev-workflow-planning —

Write PRD / spec docs-ai-prd agentic-coding-best-practices.md

Create subagents agents-subagents —

Set up hooks agents-hooks —

Configure MCP servers agents-mcp —

Git workflow + worktrees dev-git-workflow ai-agent-worktrees.md

Orchestrate parallel agents agents-swarm-orchestration —

Application security software-security-appsec —

Assess repo maturity this skill maturity-model.md

Full idea-to-ship lifecycle this skill —

Multi-repo coordination this skill multi-repo-strategy.md

Regulated environment setup this skill regulated-environment-patterns.md

Fast-track onboarding this skill fast-track-guide.md

Context lifecycle (CDLC) this skill context-development-lifecycle.md

Convert existing repos this skill repo-conversion-playbook.md

Team transformation this skill team-transformation-patterns.md

Measure AI coding impact dev-ai-coding-metrics —

The Paradigm Shift

Software development is shifting from tool-centric workflows to context-driven development:

Dimension Traditional Context-Driven

Source of truth Jira + Confluence Repository (AGENTS.md + docs/)

Standards Wiki page .claude/rules/ (loaded every session)

Execution Human writes code Agent writes code with structured context

Knowledge transfer Onboarding meetings AGENTS.md = instant context

Planning Sprint board docs/plans/ with dependency graphs

Review Humans only Humans + AI disclosure checklist

Why it matters: Unstructured AI coding ("vibe coding") is 19% slower with 1.7x more issues (METR). Structured context engineering inverts this — agents become faster and more reliable than solo coding. But context quality matters more than quantity: ETH Zurich research (March 2026) shows LLM-generated context files degrade performance by 3% while human-written files help only when limited to non-inferable details.

Cross-platform convention: AGENTS.md is the primary file. CLAUDE.md is always a symlink (ln -s AGENTS.md CLAUDE.md ). Codex reads AGENTS.md directly; Claude Code reads the symlink. One file, two agents, zero drift.

See: references/paradigm-comparison.md for full mapping + migration playbook.

Complete Lifecycle: Idea to Ship

flowchart LR P1["1 CAPTURE\n─────────\nIdea → Spec\n(docs-ai-prd)"] P2["2 PLAN\n─────────\nSpec → Plan\n(dev-workflow-planning)"] P3["3 CONTEXT\n─────────\nPlan → Repo Context\n(agents-project-memory)"] P4["4 EXECUTE\n─────────\nContext → Code\n(agents-swarm-orchestration)"] P5["5 VERIFY\n─────────\nCode → Quality Gate\n(agents-hooks)"] P6["6 SHIP\n─────────\nVerified → Merged\n(dev-git-workflow)"] P7["7 LEARN\n─────────\nShipped → Better Context\n(CDLC)"]

P1 --> P2 --> P3 --> P4 --> P5 --> P6 --> P7
P7 -.->|"feedback\nloop"| P1

style P1 fill:#e8daef,color:#4a235a
style P2 fill:#d6eaf8,color:#1b4f72
style P3 fill:#d5f5e3,color:#1e8449
style P4 fill:#fdebd0,color:#7e5109
style P5 fill:#fadbd8,color:#922b21
style P6 fill:#d4efdf,color:#1e8449
style P7 fill:#fef9e7,color:#7d6608

Seven phases from idea capture to learning. Each phase references the primary skill and key actions.

Phase 1: CAPTURE — Idea to Spec

Skill: docs-ai-prd

Capture the idea in docs/specs/feature-name.md
Use docs-ai-prd to generate a structured PRD
Include: problem statement, success criteria, constraints, non-goals
Architecture extraction: docs-ai-prd/references/architecture-extraction.md
Convention mining: docs-ai-prd/references/convention-mining.md

Phase 2: PLAN — Spec to Implementation Plan

Skill: dev-workflow-planning

Create docs/plans/feature-name.md from the spec
Break into tasks with dependencies and verification steps
Identify parallelizable tasks for multi-agent execution
Estimate token budget for the implementation

Phase 3: CONTEXT SETUP — Plan to Repository Context

Skills: agents-project-memory, agents-subagents

Update AGENTS.md if the feature introduces new patterns
Add/update .claude/rules/ for any new conventions
Create specialized subagents if needed (e.g., test-writer, migration-helper)
For multi-repo: ensure coordination repo is updated if shared context changes

Phase 4: EXECUTE — Context to Working Code

Skills: agents-swarm-orchestration, dev-git-workflow

Create feature branch and worktree for isolation
Execute plan tasks — use subagents for parallel work
Follow plan verification steps after each task
Use --add-dir for cross-repo context if needed

Phase 5: VERIFY — Code to Quality + Compliance Gate

Skills: agents-hooks, dev-git-workflow

Run automated verification: tests, lint, type-check
Run compliance gates (if regulated): signed commits, secrets scan, SAST, PII check
AI disclosure: complete PR template with AI involvement
Human review: code reviewer verifies AI-generated code

Phase 6: SHIP — Verified to Merged + Deployed

Skill: dev-git-workflow

PR approved by reviewer (different person from author)
Security review for critical paths (auth/, payments/, crypto/)
Merge to main via merge commit (not squash — audit trail)
Deployment approved by DevOps (separate from code approval)

Phase 7: LEARN — Shipped to Better Context

Framework: CDLC (context-development-lifecycle.md)

Session retrospective: what context was missing or misleading?
Update AGENTS.md and rules based on learnings
Extract patterns: if you repeated the same instruction 3+ times, make it a rule
Track metrics: agent success rate, rework rate, token cost

SDLC Compression

Traditional regulated SDLC: Requirements (14d) → Dev (3w) → QA (6-8w) → Deploy (1-2w) = 12-16 weeks.

The 2-month QA is a late discovery problem, not a QA problem. CDLC shifts verification left into every phase:

Phase Traditional With CDLC Key Enabler

Requirements 14 days 3-5 days AI-assisted specs, architecture extraction

Development 3 weeks 2-3 weeks Structured context = fewer mistakes

QA 6-8 weeks 1-2 weeks Automated gates + verification per task

Deployment 1-2 weeks 1-3 days Pre-verified compliance, audit trail

Total 12-16 weeks 4-6 weeks 60-65% compression

QA compresses the most because convention violations, integration bugs, compliance gaps, and missing tests are caught during development — not discovered weeks later. Automated compliance gates mean QA focuses on what humans are good at: exploratory testing and edge cases.

See: references/context-development-lifecycle.md § SDLC Compression for full analysis with caveats.

Repository Maturity Quick Assessment

Level Per-Repo Org-Wide (100 repos) Key Action

L0 No Context No AGENTS.md No shared standards Create AGENTS.md (30 min)

L1 Basic AGENTS.md <50 lines Template repo exists, 10% adoption Add rules + docs (2-4 hrs)

L2 Structured

rules + docs/specs Shared rules, 50% adoption Add agents + hooks (1-2 days)

L3 Automated

agents + hooks + CI gates Compliance gates, 80% adoption Start CDLC (2-4 weeks)

L4 Full CE

CDLC active + metrics InnerSource governance, 95%+ Sustain + optimize

Quick self-assessment: 14 yes/no questions in references/maturity-model.md.

Multi-Repo at Scale

For organizations with many repositories, use a coordination layer pattern:

Coordination Repo (recommended for polyrepo)

flowchart TD CR["Coordination Repo\n━━━━━━━━━━━━━━\nOrg AGENTS.md\nShared rules\nSync scripts"]

R1["Service A\n─────────\nLocal AGENTS.md\nLocal rules"]
R2["Service B\n─────────\nLocal AGENTS.md\nLocal rules"]
R3["Service C\n─────────\nLocal AGENTS.md\nLocal rules"]
RN["... 97 more"]

CR -->|"mandatory rules\n(CI/CD sync)"| R1
CR -->|"mandatory rules\n(CI/CD sync)"| R2
CR -->|"mandatory rules\n(CI/CD sync)"| R3
CR -.->|sync| RN

DEV["Developer Session\nclaude --add-dir coordination-repo"]
DEV -->|"reads shared"| CR
DEV -->|"reads local"| R2

style CR fill:#d6eaf8,color:#1b4f72
style DEV fill:#d5f5e3,color:#1e8449
style R1 fill:#fef9e7,color:#7d6608
style R2 fill:#fef9e7,color:#7d6608
style R3 fill:#fef9e7,color:#7d6608
style RN fill:#f5f5f5,color:#666666

One meta-repo holds shared context: org-wide AGENTS.md, mandatory rules, shared skills, sync scripts. Individual repos maintain focused local context.

Load shared context into any repo session

claude --add-dir ../coordination-repo

Shared vs Local Context

Category Scope Distribution

Mandatory (compliance, security, data handling) All repos CI/CD sync (automated)

Recommended (coding standards, commit conventions) Most repos Template sync or --add-dir

Local (architecture, domain patterns, subagents) Per-repo Maintained by repo team

Symlink Convention (enforced everywhere)

Every repo, every time

ln -s AGENTS.md CLAUDE.md

CI validates: [ -L CLAUDE.md ] or fail

See: references/multi-repo-strategy.md for full patterns, sync scripts, token budgets, and InnerSource governance.

Regulated Environments

For FCA-regulated EMIs and similar organizations:

Mandatory Compliance Rules

Install these in every repo (copy from assets/ directory):

Asset File Install To Purpose

compliance-fca-emi.md

.claude/rules/compliance-fca-emi.md

Audit trail, separation of duties, SM&CR

data-handling-gdpr-pci.md

.claude/rules/data-handling-gdpr-pci.md

Safe/prohibited data categories

ai-agent-governance.md

.claude/rules/ai-agent-governance.md

Approved tools, disclosure, training

pr-template-ai-disclosure.md

.github/pull_request_template.md

AI involvement checklist per PR

fca-compliance-gate.yml

.github/workflows/fca-compliance-gate.yml

Signed commits, secrets, SAST, PII, AI disclosure

Core Regulatory Principles

Audit trail: Signed commits, merge commits, immutable history (PS21/3)
Separation of duties: AI cannot approve/merge/deploy; different reviewer required
No sensitive data in context: PII, card data, credentials never in agent prompts or files
AI disclosure: Every PR declares AI involvement and human verification
Accountability: Named Senior Manager accountable for AI governance (SM&CR)
Portability: Dual-agent strategy (Claude Code + Codex) avoids vendor lock-in (PS24/16)
Agent isolation: Sandbox execution for automated agent runs (microVM/gVisor for CI/CD)
Platform audit: GitHub Agent HQ audit logs with actor_is_agent identifiers (Feb 2026)

Also track: NIST AI Agent Standards Initiative (Feb 2026) — US framework for agent identity, security, governance. FINRA 2026 — first financial regulator to require AI agent action logging and human-in-the-loop oversight.

See: references/regulated-environment-patterns.md for full regulatory mapping and incident response.

Agent and Tool Selection

Primary Agents (use both)

Both Claude Code and Codex are available as first-class agents on GitHub Agent HQ (Feb 2026), with enterprise audit logging (actor_is_agent identifiers), MCP allowlists, and organization-wide policy management.

Capability Claude Code Codex

Best for Interactive planning, complex refactoring Async batch tasks, issue triage

Context file Reads CLAUDE.md (symlink) Reads AGENTS.md (direct)

Execution Local, interactive Cloud, sandboxed

GitHub Agent HQ Yes (cloud sessions) Yes (cloud sessions)

Subagents Yes (.claude/agents/ ) No

Hooks Yes (.claude/hooks/ ) No

MCP servers Yes No

Worktrees Yes Branches

Multi-repo --add-dir

Single repo per task

Decision Tree

flowchart TD Q1{"Interactive task?\n(needs back-and-forth)"} Q2{"Batch of independent\ntasks?"} Q3{"Complex refactor\nneeding subagents?"} CC1["Claude Code"] CX1["Codex\n(parallel async)"] CC2["Claude Code"] EITHER["Either works\n(prefer Claude Code\nfor regulated envs)"]

Q1 -->|Yes| CC1
Q1 -->|No| Q2
Q2 -->|Yes| CX1
Q2 -->|No| Q3
Q3 -->|Yes| CC2
Q3 -->|No| EITHER

style CC1 fill:#d5f5e3,color:#1e8449
style CC2 fill:#d5f5e3,color:#1e8449
style CX1 fill:#d6eaf8,color:#1b4f72
style EITHER fill:#fef9e7,color:#7d6608

Supplementary Tools

Tool Use When Context File

Cursor IDE-embedded editing, quick fixes .cursor/rules

GitHub Copilot Inline suggestions during manual coding —

Context as Infrastructure

Six principles for treating context like production infrastructure:

Version it — AGENTS.md and rules live in git, reviewed in PRs
Review it — Context changes get the same review rigor as code changes
Test it — Run a task with new context to verify it works before committing
Scope it — One concern per rule file; clear sections in AGENTS.md
Budget it — Monitor token cost; compress or split when context grows
Retire it — Remove stale rules quarterly; outdated context is worse than no context

Anti-Patterns

Anti-Pattern Problem Fix

Vibe coding No spec, no plan, just "build it" Start with Phase 1 (CAPTURE)

Context bloat 2000-line AGENTS.md nobody reads Split into rules/ and references; keep AGENTS.md <200 lines

Over-specification Rules for every edge case Write rules for patterns, not exceptions

Tool accumulation 5 AI tools, no coordination Pick 2 primary (Claude Code + Codex), standardize context

Parallel Jira+context Maintaining specs in both Jira and repo Jira for portfolio; repo for execution context

Static context Write AGENTS.md once, never update CDLC: monthly review, retire stale rules

God agent One agent does everything Specialized subagents for distinct tasks

Skipping verification Trust AI output without review Phase 5 (VERIFY) is mandatory, not optional

Compliance bypass "We'll add gates later" Install mandatory rules from day 1 (assets/)

Separate CLAUDE.md CLAUDE.md and AGENTS.md with different content Always symlink: ln -s AGENTS.md CLAUDE.md

LLM-generated context Auto-generated AGENTS.md duplicates discoverable info (-3% perf) Write only non-inferable details (ETH Zurich 2026)

Single-file at scale One massive file can't scale beyond modest codebases Three-tier architecture: hot memory → agents → cold knowledge

Do / Avoid

Do:

Start with maturity assessment before investing in automation
Use the lifecycle (7 phases) — skipping CAPTURE and PLAN is the #1 cause of rework
Install compliance rules before development starts (not after)
Run context retrospectives — context without feedback loops decays
Use both Claude Code and Codex for their respective strengths

Avoid:

Don't migrate from Jira overnight — use the incremental playbook
Don't create 500-line AGENTS.md files — use progressive disclosure
Don't skip the symlink convention — drift between AGENTS.md and CLAUDE.md causes bugs
Don't let context go stale — if it hasn't been updated in 90 days, it's suspect
Don't treat AI-generated code differently from human code in review rigor

Navigation

References

File Content Lines

paradigm-comparison.md Old vs new paradigm mapping, 2026 industry validation ~200

maturity-model.md 5-level maturity, adoption data, research caveats ~280

fast-track-guide.md 30-min, 2-hour, batch tracks + quality research insight ~250

context-development-lifecycle.md CDLC + three-tier architecture, Manus patterns, ETH research ~615

multi-repo-strategy.md Coordination patterns, GitHub Agent HQ, VS Code CE ~420

regulated-environment-patterns.md FCA/EMI, NIST, FINRA 2026, sandbox isolation, GH audit ~400

repo-conversion-playbook.md Step-by-step conversion with real scripts and templates ~790

team-transformation-patterns.md AI-native vs traditional teams, shadow experiments, risk assessment ~230

Assets (Copy-Ready Templates)

File Install To Purpose

compliance-fca-emi.md .claude/rules/

FCA/EMI audit trail and separation of duties

data-handling-gdpr-pci.md .claude/rules/

GDPR/PCI safe and prohibited data categories

ai-agent-governance.md .claude/rules/

AI tool restrictions and disclosure

pr-template-ai-disclosure.md .github/

PR template with AI involvement checklist

fca-compliance-gate.yml .github/workflows/

CI/CD compliance gates

Related Skills

Skill Relationship

agents-project-memory How to write AGENTS.md (L1 foundation)

dev-workflow-planning Creating implementation plans (Phase 2)

docs-ai-prd Writing specs for AI agents (Phase 1)

agents-subagents Creating specialized subagents (Phase 3)

agents-hooks Event-driven automation (Phase 5)

agents-mcp MCP server configuration

dev-git-workflow Git patterns, worktrees (Phase 4-6)

agents-swarm-orchestration Parallel agent execution (Phase 4)

Web Verification

83 curated sources in data/sources.json across 10 categories:

Category Sources Key Items

Context Engineering 10 Anthropic CE, Fowler, CDLC, Codified Context (arxiv), Manus lessons

AGENTS.md Standard 6 agents.md spec, Linux Foundation, ETH Zurich evaluation (arxiv)

Paradigm Shift 8 OpenAI Harness, METR study, Anthropic 2026 Trends Report

Tool Documentation 10 Claude Code, Codex, GitHub Agent HQ, VS Code CE guide

Multi-Repo Patterns 6 Spine Pattern, InnerSource, Git submodules, GH Actions

Security Tooling 10 Gitleaks, Semgrep, NIST Agent Standards, sandbox patterns

FCA/EMI Compliance 9 PS21/3, SS1/23, SM&CR, PS24/16, FINRA 2026 AI agents

Data Protection 4 IAPP GDPR, PCI SSC, Anthropic DPA, OpenAI DPA

SDLC and DevOps 6 DORA metrics, GitHub Enterprise AI Controls, branch protection

Practitioner Insights 14 Stripe Minions, Block/Dorsey, HBR AI layoffs, Harvard/P&G, OpenAI guide

Verify current facts before final answers. Priority areas:

AGENTS.md specification changes (agents.md — 60,000+ repos, evolving rapidly)
Claude Code and Codex feature updates (now on GitHub Agent HQ)
GitHub Enterprise AI Controls evolution (MCP allowlists, agent governance)
FCA regulatory updates (PS21/3, SS1/23, PS24/16 — watch for consultations)
NIST AI Agent Standards Initiative (comments due April 2026)
FINRA AI agent guidance evolution (annual oversight reports)
CDLC framework evolution (community-driven, externally validated March 2026)
Context file effectiveness research (ETH Zurich, Codified Context — ongoing)

Fact-Checking

Use web search/web fetch to verify current external facts, versions, pricing, deadlines, regulations, or platform behavior before final answers.
Prefer primary sources; report source links and dates for volatile information.
If web access is unavailable, state the limitation and mark guidance as unverified.

dev-context-engineering

Safety Notice

Copy this and send it to your AI assistant to learn

Load shared context into any repo session

Every repo, every time

CI validates: [ -L CLAUDE.md ] or fail

Source Transparency

Related Skills

software-clean-code-standard

docs-codebase

software-code-review

ops-devops-platform