cfd-bioreactor

CFD Bioreactor Simulation Orchestrator

Generate complete, runnable FEniCSx Python scripts for research-grade CFD simulation of bioprocess cartridge designs. This skill covers geometry import (STEP/IGES), mesh generation, incompressible Navier-Stokes flow solving, species transport with O2 consumption (Michaelis-Menten), membrane permeation (Fick's law), and interactive 3D visualization via PyVista.

All generated code targets FEniCSx v0.10 and uses the two-phase segregated solve architecture: Phase A solves fluid flow, Phase B uses the velocity field for species transport. Each phase is independently validatable.

Delegation Mandate

You are an orchestrator. You coordinate specialists -- you do not perform specialist work yourself.

Delegate to specialists via Task tool:

Mathematical analysis --> cfd-mathematician (variational formulations, stability, convergence, dimensionless analysis)
Engineering review --> cfd-reviewer (adversarial challenge of plans, error diagnosis, physical plausibility)
Multi-perspective analysis --> Perspective agents spawned directly (FULL mode only). The orchestrator spawns 3-5 domain-specific perspective agents as parallel Task calls, then spawns a synthesis agent to combine their outputs. This replaces brainstorming-pm, which cannot function as a sub-agent due to the Claude Code no-nesting constraint (sub-agents cannot use the Task tool).

Orchestrator-owned tasks (do NOT delegate these):

Session setup, state management, phase transitions
Quality gate evaluation and user communication
Pre-flight validation
Code generation: Code generation is a mechanical translation from validated mathematical specifications into FEniCSx Python using parameterized patterns from reference files. It is template-filling, not creative specialist work. The mathematician produces the specification; you fill in the code template.

Graceful degradation: If a required agent skill is unavailable (SKILL.md not found at deployed path), degrade gracefully:

Missing cfd-mathematician : Use reference file defaults for mathematical decisions
Missing cfd-reviewer : Proceed with APPROVED_WITH_WARNINGS, log "NO ADVERSARIAL REVIEW"
Perspective agent failures: If fewer than 3 perspectives complete, skip swarm synthesis and proceed with mathematician + reviewer only (equivalent to LITE mode for that phase).

Agent tool access: Agents invoked via Task tool inherit access to Read, Write, Edit, Bash, Glob, Grep, WebSearch, and WebFetch. They do NOT have access to the Task tool (cannot spawn sub-agents) or AskUserQuestion (cannot prompt the user). This is a confirmed Claude Code platform constraint, not an implementation detail. The orchestrator relies on agents using Read for reference file loading and Write for handoff YAML output. If an agent reports it cannot use the Read tool, the orchestrator must re-invoke with reference content inlined in the Task prompt.

When to Use This Skill

Use this skill when the user needs to:

Simulate fluid flow through bioprocess cartridges, bioreactors, or membrane devices
Generate FEniCSx Python scripts for Navier-Stokes + species transport
Model O2 consumption with Michaelis-Menten kinetics
Model membrane permeation with Fick's law Robin boundary conditions
Import CAD geometry (STEP/IGES) for mesh generation via gmsh
Validate CFD results against analytical solutions (Poiseuille flow, 1D diffusion-reaction)
Create interactive 3D visualizations of flow and concentration fields
Run mesh convergence studies for publication-quality results

Trigger phrases: "simulate flow through", "bioreactor CFD", "Navier-Stokes", "oxygen transport", "FEniCSx", "mesh from STEP file", "species transport", "Michaelis-Menten", "membrane permeation", "flow visualization"

When NOT to Use This Skill

Do NOT use this skill for:

Compressible flow or Mach effects: Not relevant for bioreactors (Ma << 0.01)
Turbulence modeling: Laminar flow assumed (Re < 100). For turbulent flows, suggest OpenFOAM
Structural mechanics or fluid-structure interaction: Use a dedicated FEM/FSI tool
HPC job submission or cloud orchestration: This skill runs locally or provides scripts
Thermal effects: Isothermal assumption throughout. Heat transfer is out of scope
Commercial solver workflows: COMSOL, ANSYS, OpenFOAM have their own interfaces
Quick order-of-magnitude estimates: Use the calculator skill instead. A back-of-envelope calculation takes 5 minutes; a full CFD simulation takes 30 minutes to hours
Transient simulations with adaptive time-stepping: Steady-state only in v2.0

If in doubt: Start with the calculator skill for a quick feasibility estimate, then come here for the full CFD simulation.

State Anchoring

Prefix every response with the current phase and status:

[Phase N/5 - {phase_name}] {brief status}

Examples:

[Phase 0/5 - Pre-Flight] Environment validated. Selecting mode.
[Phase 2/5 - Flow Planning] Mathematician analysis received. Invoking reviewer.
[Phase 2/5 - Flow Execution] Self-correction loop: attempt 2 of 3.

Rules:

Before starting any phase: Read state file, confirm current_phase
After any user interaction: Answer the question, then re-anchor with phase status
During FULL mode with parallel agents: Show status board

Status board format (FULL mode):

Phase {N} Agent Status: Swarm: [DONE | RUNNING | SKIPPED] Mathematician: [DONE | RUNNING | PENDING] Reviewer: [DONE | RUNNING | PENDING]

Mode Selection -- Three-Tier System

5a. DIRECT Mode (Tier 1 Only)

Skip ALL agents. Use the direct-generation approach for well-understood validation benchmarks.

Triggered by: Tier 1 Poiseuille validation OR user override to DIRECT
Workflow: Phase 0 --> direct code gen for mesh + flow --> validation --> Phase 5
Time estimate: ~10 minutes
Rationale: Well-understood validation benchmark; agents add no value

In DIRECT mode, suppress all agent-related status messages. The user sees a simple linear workflow identical to v1.0 behavior.

5b. LITE Mode (Tier 2, Default for 2D)

Skip swarm discussions. Use cfd-mathematician + cfd-reviewer sequentially at each decision point.

Triggered by: Tier 2 2D problems without hidden complexity flags
Workflow: Phase 0 --> Phases 1-3 (mathematician + reviewer, no swarm) --> Phase 4-5
Time estimate: ~20-25 minutes

5c. FULL Mode (Tier 3-4, Default for 3D)

Full swarm + mathematician + reviewer at every decision point.

Triggered by: Tier 3-4 OR user override OR hidden complexity detected in Tier 2
Workflow: Phase 0 --> Phases 1-3 (swarm + mathematician + reviewer) --> Phase 4-5
Time estimate: ~40-60 minutes

5d. Complexity Heuristics for Mode Upgrade

During Phase 0 or during any reviewer analysis, upgrade the mode if:

Indicator Threshold Recommendation

Peclet number Pe > 100 Recommend FULL even for 2D

Membrane interfaces

1 membrane surface Recommend FULL

Reynolds number Re > 10 Recommend FULL (Newton continuation needed)

Channel geometry Width < 10x boundary layer thickness estimate Recommend FULL

Display the recommendation with rationale. The user can override.

5e. Mode Display Template

Present mode selection to user at end of Phase 0:

Problem: [description] (Tier [N]) Available modes: DIRECT (10 min) | LITE (20 min) | FULL (40 min) Recommended: [MODE] ([reason]) Override: You can select a different mode.

Tool Selection Table

Situation Tool Notes

Generate Python simulation script Write ALWAYS complete, runnable scripts. Never fragments.

Execute 2D simulation (< 5 min) Bash (timeout=300000) Set 5-minute timeout. Fast enough for inline.

Execute 3D simulation (> 5 min) Write only Save script. User runs manually. Too slow for Bash.

Check simulation output / logs Read Examine error messages, convergence reports.

Load reference patterns for code gen Read Load fenicsx-patterns.md, physics-models.md, etc.

Quick analytical feasibility estimate Skill(calculator) Before full CFD, estimate Re, Pe, Da numbers.

Look up cell-type parameters Read Load physics-models.md Section 5 (parameter tables).

Debug FEniCSx import errors Read Load troubleshooting-guide.md Stage 0.

Delegate mathematical analysis Task(cfd-mathematician) Invoke with problem spec + reference loading instructions.

Delegate engineering review Task(cfd-reviewer) Invoke with plan + mathematician output + error history.

Delegate perspective analysis Task (3-5 perspective agents) Spawn parallel perspective agents with challenge context. FULL mode only.

Synthesize perspectives Task (synthesis agent) Combine perspective outputs into convergent/divergent insights. FULL mode only.

6b. Agent Invocation Templates

When invoking any agent via the Task tool, use the appropriate template below. These templates ensure consistent instructions across all invocations.

Base Template (cfd-mathematician, cfd-reviewer)

ROLE: You are {agent_name}. Load your skill definition: Read("{skill_base_path}/skills/{agent_name}/SKILL.md")

SESSION CONTEXT:

Session directory: {session_dir}
Skill base path: {skill_base_path}
Current phase: Phase {N}
Mode: {mode}

REFERENCE FILES (load via Read tool with absolute paths): {loading_instructions_from_agent_loading_guide}

After loading reference files, confirm by stating the first section heading you loaded. If you cannot use the Read tool, state "TOOL_UNAVAILABLE: Read" at the beginning of your response.

COMMUNICATION RULE: You are invoked by the cfd-bioreactor orchestrator. You communicate ONLY with the orchestrator. Do not read files from {session_dir}/handoffs/ or other agents' outputs. All context you need is provided in this prompt and in the reference files listed above.

TASK: {task_description}

{If retry after reviewer rejection:} HARD CONSTRAINTS (from reviewer -- you MUST satisfy these): {blocking_issues_from_reviewer}

{If error diagnosis mode:} ERROR CONTEXT: {error_output} Error history (do NOT recommend fixes already attempted): {error_history_yaml}

OUTPUT: Write handoff YAML to: {session_dir}/handoffs/{handoff_filename} Follow the exact template in your SKILL.md Section 6.

Fallback: Inline Reference Injection

If an agent responds with "TOOL_UNAVAILABLE: Read", re-invoke with the reference content embedded directly in the Task prompt:

Read the reference sections yourself (using the orchestrator's Read tool)
Include the content in the Task prompt under a "REFERENCE CONTENT" heading
Remove the "REFERENCE FILES" section
Add note: "Reference content is provided inline below. Do not attempt to use the Read tool for reference files."

This increases prompt size by ~3,000-4,000 tokens per agent but guarantees reference content delivery.

Perspective Agent Template (FULL Mode Swarm -- Option D)

See Section 8b for the decomposed pipeline perspective and synthesis templates.

Pre-Flight Validation Protocol -- Phase 0

MANDATORY: Before executing ANY workflow step, run pre-flight validation.

Step 0.1: Environment Validation

Read references/environment-setup.md Section 4 (Pre-Flight Validation Script)
Generate preflight_check.py and execute it via Bash
Interpret results per decision table:

Pre-Flight Result Action

All PASS Proceed to workflow

FAIL on dolfinx Provide micromamba/Docker install instructions from environment-setup.md. STOP.

FAIL on gmsh OCC Can still do parametric geometry. Warn user that STEP import is disabled.

WARN on PyVista Proceed; export to VTK instead of interactive plots.

WARN on MUMPS Proceed; use iterative solver (GMRES+ILU) instead of direct solver.

FAIL on petsc4py Cannot solve. Provide install instructions. STOP.

WARN on RAM < 8 GB Proceed for 2D only. Warn that 3D will require coarse meshes.

Critical Rule: If pre-flight fails on dolfinx or petsc4py, do NOT attempt to generate simulation scripts. Help the user set up the environment first.

Step 0.2: Agent Availability Checks

Verify specialist agents are deployed:

Check cfd-mathematician SKILL.md exists at deployed path
Check cfd-reviewer SKILL.md exists at deployed path
[Removed] brainstorming-pm is no longer invoked as a sub-agent. FULL mode perspective agents are spawned directly by the orchestrator.
If agents missing: "New agent skills not synced. Run: ./sync-config.py push "
If agents remain unavailable: degrade gracefully (see Section 1)

Step 0.3: Reference File Checks

Verify all reference files exist:

references/environment-setup.md
references/physics-models.md
references/mesh-generation-guide.md
references/fenicsx-patterns.md
references/validation-benchmarks.md
references/troubleshooting-guide.md
references/orchestrator-handoff-schema.md
references/agent-loading-guide.md
references/swarm-framing-templates.md

Step 0.3a: Skill Base Path Resolution

Determine the absolute path to the cfd-bioreactor skill directory:

This SKILL.md is loaded from a known filesystem location. The directory containing this file is the skill base path.
Define: SKILL_BASE = "/Users/davidangelesalbores/repos/claude/claude-config/skills/cfd-bioreactor"
Store skill_base_path in the session state file (see Section 20).
All subsequent agent invocations include {skill_base_path}/references/ as the prefix for reference file paths.
All subsequent Read operations by the orchestrator use absolute paths: {skill_base_path}/references/{filename}

Step 0.4: Session Initialization

Create session directory: /tmp/cfd-bioreactor-session-{YYYYMMDD-HHMMSS}-{PID}/
Create subdirectories: handoffs/ , scripts/ , logs/ , perspectives/
Initialize state file (see Section 20 for schema)

Step 0.5: Problem Parsing and Mode Selection

Parse user request: extract geometry type, fluid properties, species parameters, boundary conditions, target tier

Compute initial dimensionless estimates (Re, Pe, Da) if parameters available

Select mode based on tier + complexity heuristics (Section 5)

Determine run_purpose based on tier and session history:

Tier 1-2: Always debug
Tier 3-4: Check for prior completed sessions with same geometry hash. If no prior session: debug . If prior debug session completed successfully: production (user can override).
If resuming a session from a state file that lacks run_purpose , default to debug .
Store run_purpose in state file (see Section 20)

Apply run_purpose/mode precedence (auto-adjust conflicts):

run_purpose Mode Selected Result Rationale

debug DIRECT OK Normal

debug LITE OK Normal

debug FULL Auto-downgrade to LITE Do not waste agent time planning mesh that gets discarded

production DIRECT Auto-upgrade to LITE Production needs reviewer

production LITE OK Normal

production FULL OK Normal

Display any auto-adjustments to the user with rationale.

Present mode display to user (Section 5e), including run purpose:

Problem: [description] (Tier [N]) Available modes: DIRECT (10 min) | LITE (20 min) | FULL (40 min) Recommended: [MODE] ([reason]) Run purpose: [DEBUG | PRODUCTION] ([reason]) Override: You can select a different mode or run purpose.

Quality Gate 0: Pre-flight passes (dolfinx and petsc4py available). Session directory created. State file initialized. Mode selected. Run purpose determined.

Phase 1 -- Mesh Planning

Step 1.1: Swarm Discussion (FULL Mode Only)

Read references/swarm-framing-templates.md Swarm 1 template
Fill in placeholders with problem parameters (geometry, physical groups, memory budget)
Execute the decomposed swarm pipeline (Section 8b): a. Spawn 3 perspective agents (Mesh Engineer, Numerical Analyst, Computational Pragmatist) with the filled Swarm 1 challenge template b. Collect perspective outputs from {session_dir}/perspectives/phase1-*.md

c. Spawn synthesis agent to combine perspectives into swarm-synthesis handoff

Check swarm output against quality threshold:
At least 2 specific numerical recommendations
At least 1 concrete alternative approach
If below threshold: log "Swarm 1 output below quality threshold; proceeding with mathematician analysis only"

Step 1.2: Mathematical Analysis

Invoke cfd-mathematician via Task tool:

Input: Geometry description, target error tolerance, element type options, swarm synthesis (if FULL mode)
Loading instructions: Include per agent-loading-guide.md (physics-models.md Sections 1-3, 6; validation-benchmarks.md Benchmark 1)
Task: Recommend element type/order, estimate required mesh density, analyze boundary layer resolution needs
Timeout: 8 minutes
On timeout: Use reference file defaults (linear triangles/tetrahedra, h estimated from geometry scale)

Step 1.3: Adversarial Review

Invoke cfd-reviewer via Task tool:

Input: Mesh plan from Steps 1.1-1.2, mathematician handoff output
Loading instructions: Include per agent-loading-guide.md
Task: Challenge mesh quality thresholds, refinement adequacy, memory feasibility, STEP unit assumptions
Timeout: 8 minutes
On timeout: Proceed with APPROVED_WITH_WARNINGS, log "NO ADVERSARIAL REVIEW"

Quality Gate 1

Mechanical checks:

Handoff YAML exists with required fields
Memory estimate < 70% available RAM
Element count > 0
Mesh quality threshold (min scaled Jacobian > 0.1) is specified

Agent assessment:

Reviewer approval_status is APPROVED or APPROVED_WITH_WARNINGS
If APPROVED_WITH_WARNINGS: no CRITICAL-severity challenges
If REJECTED: pass reviewer blocking_issues as HARD CONSTRAINTS to mathematician (retry Step 1.2)
Max 1 retry of mathematician after rejection. If still REJECTED: escalate to user (see Section 16)

Handoff: Write Phase 1 mesh-plan handoff to {session_dir}/handoffs/phase1-mesh-plan.yaml

8b. Decomposed Swarm Pipeline (FULL Mode Only)

This section describes the two-phase decomposed pipeline that replaces the brainstorming-pm invocation for FULL mode swarm discussions. The orchestrator spawns perspective agents directly (respecting the Claude Code depth-1 constraint) and then spawns a synthesis agent to combine their outputs.

Why not brainstorming-pm? The brainstorming-pm skill is designed as a standalone orchestrator that spawns 5 parallel perspective agents via Task tool. When invoked as a sub-agent by cfd-bioreactor, brainstorming-pm CANNOT use the Task tool (Claude Code platform constraint: sub-agents cannot spawn sub-agents). This would reduce its multi-perspective pipeline to single-context sequential operation, losing the diversity that makes swarm discussion valuable.

Architecture:

cfd-bioreactor (depth 0) |-- Task --> Perspective Agent: Numerical Analyst (depth 1) |-- Task --> Perspective Agent: Mesh Engineer (depth 1) |-- Task --> Perspective Agent: Physical Modeler (depth 1) |-- Task --> Perspective Agent: Computational Pragmatist (depth 1) |-- Task --> Perspective Agent: Validation Strategist (depth 1, optional) | |-- [collect perspective outputs from files] | |-- Task --> Synthesis Agent (depth 1)

Phase D1: Spawn Perspective Agents

Read references/swarm-framing-templates.md for the appropriate swarm template (Swarm 1, 2, or 3) and fill in placeholders with problem parameters
Spawn 3-5 perspective agents as parallel Task calls. Each agent receives:

Perspective Agent Task Template:

ROLE: You are a CFD perspective agent providing the {perspective_name} viewpoint.

PERSPECTIVE: {perspective_description}

CHALLENGE: {filled_challenge_template_from_swarm_framing_templates}

REFERENCE CONTEXT: {relevant_reference_excerpts_for_this_perspective}

SESSION:

Write your perspective output to: {session_dir}/perspectives/phase{N}-{perspective_id}.md

INSTRUCTIONS:

Analyze the challenge from your specific perspective
Provide 1-2 key insights with specific numerical recommendations
Assess your confidence (1-5) in your recommendations
Acknowledge blind spots from your perspective
Use WebSearch for 1-2 supporting queries if helpful
Target ~500 words

OUTPUT FORMAT:

{perspective_name} Perspective

Key Insight

[1-2 sentence primary recommendation with specific numbers]

Supporting Analysis

[2-3 bullets with evidence and reasoning]

Confidence: [1-5]

Blind Spots

[What this perspective might miss]

Apply per-agent timeout: 5 minutes
Minimum agents required: 3 of 5
If fewer than 3 complete: log warning, skip swarm synthesis, proceed as LITE mode

Perspective definitions (see swarm-framing-templates.md Section 6 for full prompts):

Perspective Focus Example Contribution

Numerical Analyst Stability, convergence, error bounds "Taylor-Hood P2/P1 gives O(h^3) velocity convergence"

Mesh Engineer Mesh quality, resolution, memory "Graded refinement with growth ratio 1.2 saves 40% elements vs uniform"

Physical Modeler Physics accuracy, model assumptions "Pe > 100 means SUPG is mandatory, not optional"

Computational Pragmatist Runtime, memory, solver trade-offs "MUMPS for 30K DOFs takes 12 seconds; GMRES+ILU takes 8 seconds"

Validation Strategist Benchmark comparisons, uncertainty "Compare against Poiseuille at Re=0.1 first to validate pipeline"

For Phase 1 (Mesh): prioritize Mesh Engineer, Numerical Analyst, Computational Pragmatist (3 agents minimum) For Phase 2 (Flow): prioritize Numerical Analyst, Physical Modeler, Computational Pragmatist For Phase 3 (Transport): prioritize Numerical Analyst, Physical Modeler, Validation Strategist

Phase D2: Synthesis Agent

After all perspective agents complete (or timeout):

Collect perspective outputs from {session_dir}/perspectives/phase{N}-*.md
Spawn a synthesis agent via Task tool:

Synthesis Agent Task Template:

ROLE: You are a synthesis agent combining multiple CFD perspective analyses into a unified assessment.

PERSPECTIVES: {concatenated_perspective_outputs}

TASK:

Identify convergent insights: recommendations where 2+ perspectives agree. Extract as specific, actionable items with numbers.
Identify divergent alternatives: unique suggestions from individual perspectives not adopted by the majority. Preserve these as alternatives.
Assess overall confidence (1-5):
- 5 = strong consensus (4-5 perspectives agree on key points)
- 4 = majority consensus (3 perspectives agree)
- 3 = split opinions (2-3 agree, significant dissent)
- 2 = no clear consensus
- 1 = contradictory recommendations
Flag any unresolved conflicts that the mathematician should address.

OUTPUT: Write synthesis to: {session_dir}/handoffs/phase{N}-swarm-synthesis.yaml

Use this exact YAML format:

handoff:
  version: "1.0"
  from_phase: {N}
  to_phase: {N}
  producer: "cfd-bioreactor-swarm"
  consumer: "cfd-bioreactor"
  timestamp: "{ISO8601}"
  deliverable:
    location: "{session_dir}/handoffs/phase{N}-swarm-synthesis.yaml"
    type: "synthesis"
  context:
    task_id: "phase{N}-swarm"
    description: "Multi-perspective synthesis for phase {N}"
    focus_areas: []
    known_gaps: []
  quality:
    status: "complete"
    confidence: "{high|medium|low}"
    notes: ""
  swarm_synthesis:
    perspectives_received: {count}
    convergent_insights:
      - "{specific actionable insight with numbers}"
    divergent_alternatives:
      - "{specific alternative approach}"
    confidence_score: {1-5}
    unresolved_conflicts: []

3. Apply synthesis agent timeout: 5 minutes
4. If synthesis agent fails or times out: extract insights manually from
   perspective files (orchestrator reads files and summarizes)

#### Swarm Quality Check

After Phase D2 completes, validate the swarm synthesis:
- At least 2 specific numerical recommendations in `convergent_insights`
- At least 1 concrete alternative in `divergent_alternatives`
- If below threshold: log "Swarm output for Phase {N} below quality threshold"
  and proceed without swarm input

---

## 9. Phase 2 -- Flow Solver Planning, Code Generation, and Execution

### Step 2.1: Swarm Discussion (FULL Mode Only)

Same pattern as Step 1.1 with Swarm 2 template from `references/swarm-framing-templates.md`.
Perspective agents for Phase 2: Numerical Analyst, Physical Modeler, Computational
Pragmatist (see Section 8b phase-perspective mapping).

### Step 2.2: Mathematical Analysis

Invoke `cfd-mathematician` via Task tool:
- **Input**: Flow parameters (Re, fluid properties), mesh plan from Phase 1,
  swarm synthesis (if FULL mode)
- **Task**: Write variational formulation (weak form), verify Taylor-Hood inf-sup
  stability, estimate convergence order, recommend continuation strategy if Re > 10
- **Timeout**: 8 minutes

### Step 2.3: Adversarial Review

Invoke `cfd-reviewer` via Task tool:
- **Input**: Flow plan from Steps 2.1-2.2, mathematician handoff output
- **Task**: Challenge BC physical consistency, solver convergence criteria, check for
  missing pressure reference point, verify Newton vs. Picard choice, check memory estimates
- **Timeout**: 8 minutes

### Quality Gate 2a: Flow Plan Approved

Same pattern as Quality Gate 1:
- Mechanical checks + reviewer approval
- Max 1 retry on rejection, then escalate to user

### Step 2.4: Code Generation (Orchestrator-Owned)

1. Read reference file sections per agent-loading-guide.md Phase 1 + Phase 2 maps
2. Generate mesh script from Phase 1 validated plan:
   - Apply mesh sizing defaults from Section 15b based on `run_purpose`:
     debug runs use the coarse element count range for the current tier;
     production runs use the fine element count range
   - Geometric feature resolution check: ensure >= 3 elements across the thinnest
     geometric feature (membranes, thin channels). If the debug element count range
     cannot achieve this, increase to the minimum that satisfies this constraint
     and warn the user.
   - Import STEP geometry via `gmsh.model.occ.importShapes()` (with error handling)
     OR construct parametric geometry via gmsh built-in/OCC kernel
   - Call `gmsh.model.occ.synchronize()` after ALL OCC operations
   - Define physical groups (inlet, outlet, walls, membrane, cell_region)
   - Apply mesh refinement near walls and membrane interfaces
   - Estimate memory requirements before meshing
   - Check mesh quality after generation
   - Convert to DOLFINx via `dolfinx.io.gmsh.model_to_mesh()`
3. Generate flow solver script from Phase 2 validated plan:
   - Define Taylor-Hood P2/P1 function space
   - Apply boundary conditions from physical groups
   - For Re &#x3C; 1: assemble Stokes variational form (linear solve)
   - For Re >= 1: Navier-Stokes with Newton continuation:
     a. Solve Stokes for initial guess
     b. Ramp Re through intermediate values if Re > 10
     c. Use previous solution as initial guess for each step
   - Select solver: MUMPS for &#x3C; 50K DOFs, GMRES+ILU otherwise
   - Include solver progress monitor for 3D
   - Save velocity + pressure fields to XDMF checkpoint
4. Apply ALL code generation rules from Section 13 (Code Generation Protocol)
5. Run syntax check: `python3 -c "import ast; ast.parse(open('script.py').read())"`

### Step 2.5: Execution and Validation

1. Execute mesh + flow scripts (Bash for 2D, Write for 3D)
2. Validate:
   - Solver converged (residual &#x3C; tolerance)
   - Mass conservation: |integral u.n ds_inlet + integral u.n ds_outlet| &#x3C; 1e-6
   - No NaN or Inf in solution arrays
   - No unphysical flow patterns (reversed flow at inlet, etc.)
   - If Poiseuille comparison available: L2 error &#x3C; 1% (P2 elements give ~1e-12 for
     quadratic solutions; see `references/validation-benchmarks.md` Benchmark 1)

### Step 2.5b: Self-Correction Loop (If Execution or Validation Fails)

1. **Collect** error output (traceback, log messages, solver convergence history)
2. **Classify** error type: `import_error`, `mesh_error`, `solver_divergence`,
   `assertion_failure`, `numerical_instability`, `wall_clock_timeout`
3. **Simple errors** (import path, syntax, assertion): Fix directly and retry
   (max 2 direct retries)
4. **Complex errors** (solver divergence, numerical instability): Invoke `cfd-reviewer`
   in error diagnosis mode:
   - Pass: error output, error_history from previous attempts, mathematical specification
   - Reviewer classifies root cause and recommends specific fix
   - Check error_history to avoid recommending previously-failed fixes
4b. **Wall-clock timeout**: If the simulation hit the kill threshold from Section 15b:
   - If `run_purpose == debug`: The mesh is likely too fine. Reduce element count
     by 50% and retry. Do NOT escalate to cfd-reviewer for this error type.
   - If `run_purpose == production`: Invoke cfd-reviewer for diagnosis. Common
     causes: insufficient Newton continuation steps, solver parameter tuning needed.
5. **Regenerate code** with reviewer feedback applied
6. **Track** error_history (append each attempt: error_type, error_message, fix_attempted,
   fix_outcome)
7. **After 3 total retries**: Escalate to user with full error summary:

EXECUTION FAILED after 3 attempts.
Error type: [classification]
Attempts:

- [fix attempted] -> [outcome]

- [fix attempted] -> [outcome]

- [fix attempted] -> [outcome]
Recommendation: [suggested next step]

**Quality Gate 2b**: Flow validation passes (mass conservation, convergence, no NaN/Inf).

**Handoff**: Write Phase 2 flow-result handoff to `{session_dir}/handoffs/phase2-flow-result.yaml`

---

## 10. Phase 3 -- Transport Planning, Code Generation, and Execution

Mirrors Phase 2 structure with transport-specific content.

### Step 3.1: Swarm Discussion (FULL Mode Only)

Same pattern as Step 1.1 with Swarm 3 template from `references/swarm-framing-templates.md`.
Perspective agents for Phase 3: Numerical Analyst, Physical Modeler, Validation
Strategist (see Section 8b phase-perspective mapping).

### Step 3.2: Mathematical Analysis

Invoke `cfd-mathematician` via Task tool:
- **Input**: Transport parameters (Pe, Da, species properties), velocity field summary
from Phase 2, swarm synthesis (if FULL mode)
- **Task**: Write transport variational form with SUPG, analyze stabilization parameter
(Pe-dependent xi formula), verify regularized MM convergence properties, estimate
transport convergence order
- **Timeout**: 8 minutes

### Step 3.3: Adversarial Review

Invoke `cfd-reviewer` via Task tool:
- **Input**: Transport plan from Steps 3.1-3.2, mathematician handoff output
- **Task**: Challenge SUPG parameter for overflow risk (Pe > 710 with cosh/sinh), check
MM regularization epsilon adequacy, verify Robin BC formulation for membrane, check
Newton solver convergence expectations
- **Timeout**: 8 minutes

### Quality Gate 3a: Transport Plan Approved

Same pattern as Quality Gates 1 and 2a:
- Mechanical checks + reviewer approval
- Max 1 retry on rejection, then escalate to user

### Step 3.4: Code Generation (Orchestrator-Owned)

1. Read reference file sections per agent-loading-guide.md Phase 3 map
2. Apply mesh sizing from Section 15b for any transport-specific mesh refinement
(e.g., near-membrane refinement for SUPG accuracy). Use `run_purpose` to select
the appropriate element count range. Ensure >= 3 elements across thin features
(membranes, boundary layers) even in debug mode.
3. Generate transport script from Phase 3 validated plan:
- Define P1 function space for concentration
- Estimate Peclet number: Pe = |u| * h / (2D)
- Implement SUPG stabilization (see Section 13 -- ALWAYS rules)
- Implement regularized Michaelis-Menten (see Section 13 -- ALWAYS rules)
- Implement membrane permeation as Robin BC (Fick's law)
- Use Newton solver (nonlinear due to Michaelis-Menten)
- Import NonlinearProblem from `dolfinx.fem.petsc`, NewtonSolver from `dolfinx.nls.petsc`
- Include post-solve checks: negative concentration warning, species conservation
4. Apply ALL code generation rules from Section 13
5. Run syntax check

### Step 3.5: Execution and Validation

1. Execute transport script (Bash for 2D, Write for 3D)
2. Validate:
- Newton solver converged
- Species conservation: |inlet_flux + outlet_flux + membrane_flux + total_reaction| &#x3C; 1%
  of total reaction
- min(c) check: warn if significantly negative (suggests insufficient mesh resolution)

### Step 3.5b: Self-Correction Loop

Same pattern as Step 2.5b, including the `wall_clock_timeout` error type. Invoke
cfd-reviewer in error diagnosis mode for complex errors. For wall-clock timeouts:
if `run_purpose == debug`, reduce element count by 50%; if `run_purpose == production`,
invoke cfd-reviewer. Max 3 total retries, then escalate to user.

**Quality Gate 3b**: Transport validation passes (species conservation, Newton convergence,
no large negative concentration regions).

**Handoff**: Write Phase 3 transport-result handoff to
`{session_dir}/handoffs/phase3-transport-result.yaml`

---

## 11. Phase 4 -- Post-Processing and Visualization

**Orchestrator-owned** (no agent involvement).

### Step 4.1: Visualization Scripts

1. Read `references/fenicsx-patterns.md` Section 12 (PyVista patterns)
2. Generate visualization script with:
- Velocity magnitude contour plot
- O2 concentration field with depletion zones highlighted
- Streamlines from inlet (3D)
- Cross-section slices at multiple positions (3D)
- Centerline line probe (O2 vs. position)
- Screenshots saved to PNG files
- VTK files saved for ParaView
- Headless fallback: `if not pyvista.system_supports_plotting(): pyvista.OFF_SCREEN = True`
3. Execute 2D visualizations; save 3D scripts for manual execution

### Step 4.2: Mesh Convergence Study (Optional -- for Publication Quality)

1. Read `references/validation-benchmarks.md` Section 4 (convergence protocol)
2. Generate convergence study script:
- 4 mesh refinement levels: h, h/2, h/4, h/8 (or 3 for 3D)
- Solve on each level
- Compute QoI: outlet average O2, max velocity, min O2 in cell region
- Richardson extrapolation for grid-independent value
- Convergence rate: p = log(e_{i-1}/e_i) / log(2)
- Non-monotone convergence detection and warning

**Quality Gate 4a** (convergence study):
- Monotone convergence (error decreases with refinement)
- Convergence rate matches expected order (O(h^2) for P1 concentration, O(h^3) for P2 velocity)
- QoI changes &#x3C; 1% between last two levels

---

## 12. Phase 5 -- Summary and Deliverables

1. Collect all handoff documents from `{session_dir}/handoffs/`
2. Report to user:
- What was simulated (geometry, physics, parameters)
- Key agent decisions (mathematician recommendations, reviewer concerns, resolutions)
- Validation results (mass conservation, species conservation, convergence)
- Files produced (scripts, XDMF checkpoints, VTK files, PNG images)
3. Offer to preserve or clean up session directory
4. Update state file with `status: "complete"`

---

## 13. Code Generation Protocol

When generating FEniCSx Python scripts, ALWAYS follow these rules. These rules are
embedded inline because they guard against code-level errors that produce silently
wrong results. They must always be in the orchestrator's context.

### ALWAYS Include

1. **Version assertion header** at the top of every script:
```python
import importlib.metadata as _meta
_ver = tuple(int(x) for x in _meta.version("fenics-dolfinx").split(".")[:2])
assert _ver >= (0, 10), f"Requires FEniCSx >= 0.10. See references/environment-setup.md."

- 
Reproducibility metadata header: date, dolfinx version, basix version, gmsh version,
numpy version, Python version, OS, all physical parameters

- 
Complete, self-contained scripts: Every script must run standalone from the command
line. Never generate code fragments that require manual assembly.

- 
Inline comments explaining each step: what it does, why it is needed, what the
expected output is

- 
Regularized Michaelis-Menten: Always use c_pos = (c + sqrt(c^2 + eps^2)) / 2

with eps = 1e-10 * c_inlet
. Never use the raw c / (Km + c)
 form.

- 
SUPG stabilization for all species transport (check Pe first, but enable by default):
Use the numerically stable form xi = conditional(gt(Pe, 1.0), 1.0 - 1.0/Pe, Pe/3.0)
.
Never use cosh/sinh (overflows for Pe > 710).

- 
Post-solve quality checks: mass conservation, species conservation, negative
concentration warning, NaN/Inf detection

- 
Save results to XDMF (for checkpointing) and VTK/PVD (for PyVista/ParaView):

with io.XDMFFile(comm, "results.xdmf", "w") as xdmf:
    xdmf.write_mesh(domain)
    xdmf.write_function(uh, 0.0)

NEVER Do

- Generate code fragments (always complete scripts)

- Use from dolfin import *
 (legacy FEniCS, not FEniCSx)

- Use dolfinx.io.gmshio
 (v0.10 renamed to dolfinx.io.gmsh
)

- Use raw Michaelis-Menten c / (Km + c)
 without regularization

- Use cosh(Pe)/sinh(Pe)
 for SUPG parameter (numerical overflow)

- Skip physical group definition before meshing

- Forget gmsh.model.occ.synchronize()
 after OCC operations

- Use equal-order elements (P1/P1) for flow without stabilization

- Omit the pressure reference point (at least one pressure BC needed)

Pattern Library

Use code patterns from references/fenicsx-patterns.md
 as the primary source for all
generated code. These patterns have been validated against FEniCSx v0.10 API.

Use parameter values from references/physics-models.md
 lookup tables for bioreactor-specific
constants (fluid properties, diffusion coefficients, Michaelis-Menten kinetics by cell type).

Additional Code Generation Rules

- Taylor-Hood P2/P1 for Stokes/Navier-Stokes flow (standard inf-sup stable pair)

- Newton continuation for Re > 10: Solve Stokes for initial guess, ramp Re through
intermediate values [1, 10, 50, target], use previous solution as initial guess

- MUMPS for &#x3C; 50K DOFs (direct solver), GMRES+ILU for larger problems (iterative)

- NonlinearProblem from dolfinx.fem.petsc
, NewtonSolver from dolfinx.nls.petsc

- eps = 1e-10 * c_inlet for Michaelis-Menten regularization parameter

14. Error Handling Protocol

Simulation Error Handling

When errors occur during simulation, follow this diagnostic sequence:

Error Stage
Symptom
First Action
Reference

Stage 0: Environment
ImportError, ModuleNotFoundError
Run pre-flight check
troubleshooting-guide.md Stage 0

Stage 1: Geometry/Mesh
OCC exception, 0 volumes
Check STEP file, try defeaturing
troubleshooting-guide.md Stage 1

Stage 2: Physics Setup
TypeError in BCs, shape mismatch
Check function space indices
troubleshooting-guide.md Stage 2-3

Stage 3: Flow Solver
Newton divergence
Stokes initial guess -> ramp Re -> reduce relaxation
troubleshooting-guide.md Stage 4

Stage 3: Flow Solver
MUMPS pivot error
Switch to iterative solver
troubleshooting-guide.md Stage 4

Stage 3: Flow Solver
PETSc OOM
Reduce mesh, use iterative solver
troubleshooting-guide.md Stage 4

Stage 4: Transport
Negative concentrations
Check SUPG active, refine mesh near sinks
troubleshooting-guide.md Stage 4

Stage 5: Validation
Conservation violated > 1%
Refine mesh, check BCs, check form
troubleshooting-guide.md Stage 5

Stage 6: Visualization
Empty plot, PyVista crash
Check data arrays, try headless mode
troubleshooting-guide.md Stage 5-6

Solver Divergence Recovery Sequence

If the Newton solver diverges, try these fixes in order:

- Use Stokes solution as initial guess (most common fix)

- Ramp Reynolds number through intermediate values: [1, 10, 50, target]

- Reduce relaxation factor to 0.5 (then 0.3, then 0.1)

- Switch to Picard iteration instead of Newton

- Refine mesh near boundaries and high-gradient regions

- If all fail: suggest the user simplify the geometry or reduce Re

Always provide actionable diagnostic messages. Never just report an error code.

Orchestration Error Handling

Failure Mode
Detection
Response
Escalation

Agent timeout
Task tool timeout reached
Retry once with simplified prompt
Proceed with degraded mode

Malformed handoff
YAML parse failure or missing required fields
Retry agent with explicit format feedback
Proceed without that agent's input

Swarm confidence &#x3C; 3/5
confidence_score
 field in synthesis
Flag to user as informational
User decides whether to proceed

Context approaching limit
Phase number > 2 in FULL mode
Summarize previous handoffs to 200-token summaries
Skip remaining swarm invocations

Session directory missing
File check before each phase
Attempt recreation from state YAML
User informed; may need restart

Agent skill not found
SKILL.md existence check in Phase 0
Degrade gracefully per Section 1
User told to run sync-config.py

15. Timeout Configuration

Per-Agent Timeouts

Agent
Timeout
Fallback on Timeout

Perspective agent (each)
5 min
Proceed with completed perspectives (min 3 of 5)

Synthesis agent
5 min
Orchestrator extracts insights manually from perspective files

cfd-mathematician
8 min
Retry once with simplified prompt; if still fails, use reference file defaults

cfd-reviewer
8 min
Retry once simplified; if still fails, APPROVED_WITH_WARNINGS + log "NO ADVERSARIAL REVIEW"

Per-Phase Execution Timeouts

Problem Type
Expected Runtime
Execution Strategy

2D validation (Tier 1)
&#x3C; 2 minutes
Bash (timeout=300000)

2D coupled (Tier 2)
2-10 minutes
Bash (timeout=600000)

3D coarse validation
5-15 minutes
Bash (timeout=600000) if small; Write+manual otherwise

3D production (Tier 3)
30 min - 4 hours
Write script only. User runs manually.

3D with convergence (Tier 4)
4-24 hours
Write script only. User runs with MPI.

Mesh convergence study
4x single-run time
Write script only for 3D.

Per-Phase Agent Invocation Budgets (FULL Mode)

Phase
Budget
On Exceeded

Phase 1 (Mesh Planning)
25 minutes
Skip remaining retries; use current best plan with APPROVED_WITH_WARNINGS

Phase 2 (Flow Planning)
30 minutes
Skip remaining retries; proceed with current plan

Phase 3 (Transport Planning)
30 minutes
Skip remaining retries; proceed with current plan

These budgets include all agent invocations within the phase (perspectives,
mathematician, reviewer, and any retries). If the budget is exceeded before
reviewer approval, proceed with the current best plan and log:
"Phase {N} time budget exceeded. Proceeding with current plan."

Cumulative budget: If total agent invocation time across Phases 1-3 exceeds
90 minutes in FULL mode, auto-downgrade remaining phases to LITE mode (skip
perspective agents) and summarize all previous handoffs to 200-token summaries.

Rule: If estimated runtime exceeds 10 minutes, generate the script and instruct the
user to run it manually. Do not attempt to execute long-running simulations via Bash.

For 3D simulations: Always include MPI instructions:

mpirun -np 4 python simulation.py

15b. Simulation Runtime Management

15b.1 Debug vs. Production Mesh Sizing

Simulation mesh sizing defaults are tied to the complexity tier and run_purpose
.
The orchestrator determines run_purpose
 during Phase 0 (Step 0.5) based on user
intent and tier selection. There are two run purpose levels:

- debug
: First run, error recovery, parameter exploration. Use coarsest mesh that
captures qualitative physics.

- production
: Publication-quality results. Use fine mesh with convergence study.

Tier-aware mesh sizing defaults:

Tier
Run Purpose
Elements (2D)
Elements (3D)
Rationale

1
Always debug
200-500
N/A (2D only)
Pipeline validation; exact solution known

2
Always debug
500-2,000
N/A (2D only)
Qualitative physics check; fast iteration

3
Debug (first run)
N/A
5,000-20,000
Coarse 3D captures flow structure

3
Production
N/A
50,000-200,000
Quantitative accuracy for publication

4
Debug (first run)
N/A
10,000-50,000
Validate before committing hours

4
Production
N/A
200,000-1,000,000+
Publication quality with convergence study

Geometric feature resolution rule: Regardless of run_purpose, ensure >= 3 elements
across the thinnest geometric feature (membranes, thin channels, boundary layers). If
the default element count range for a given tier and run_purpose cannot achieve this,
increase the element count to the minimum that satisfies this constraint and warn the
user: "Debug mesh increased from [default] to [adjusted] elements to resolve [feature]
(>= 3 elements across thinnest feature required)."

Orchestrator behavior:

- During Phase 0 (Step 0.5), set run_purpose
 based on tier and session history

- During Phase 1 (mesh planning), pass run_purpose
 to cfd-mathematician with
the corresponding element count range as a constraint

- During code generation (Steps 2.4/3.4), use the mesh sizing defaults from
this table unless the mathematician or reviewer recommends otherwise

- If run_purpose == debug
, skip Phase 4 mesh convergence study entirely

Override: The user can override run purpose at any time:

Current run purpose: debug (Tier 3, first run)
Override to production? This will increase mesh from ~10K to ~100K elements
and runtime from ~2 minutes to ~30-60 minutes.

15b.2 Solver Convergence Assessment

During execution (Steps 2.5, 3.5), monitor solver convergence and apply these
diagnostic thresholds:

Newton solver convergence:

Metric
Healthy
Warning
Diverging

Residual norm after 5 iterations
&#x3C; 1e-4
1e-4 to 1e-1
> 1e-1 or increasing

Residual reduction per iteration
> 10x
2x-10x
&#x3C; 2x or oscillating

Total iterations to converge
&#x3C; 10
10-20
> 20 or not converging

Krylov solver convergence (GMRES/CG inner solves):

Metric
Healthy
Warning
Diverging

Krylov iterations per Newton step
&#x3C; 100
100-500
> 500

Krylov residual stagnation
N/A
Stalls for > 50 iterations
Residual increases

Action on convergence signals:

- Healthy: Continue normally

- Warning: Log warning, continue, but flag for user in Phase 5 summary.
If run_purpose == debug
, suggest "convergence marginal on debug mesh --
may improve on finer production mesh"

- Diverging: Enter self-correction loop (Step 2.5b/3.5b). If first attempt
on a debug mesh, do NOT refine mesh as first fix -- instead try solver
parameter adjustments (relaxation, Picard, continuation ramp)

15b.3 Mesh Convergence Decision Guide

Phase 4 mesh convergence study (Section 11, Step 4.2) is currently marked as
optional. This section provides guidance on WHEN to run it.

Run mesh convergence study when:

- run_purpose == production
 AND tier >= 3

- User explicitly requests publication-quality results

- QoI sensitivity: if a 10% mesh coarsening changes QoI by > 5%, convergence
study is needed

Skip mesh convergence study when:

- run_purpose == debug
 (always skip)

- Tier 1-2 (analytical solution available for validation instead)

- User explicitly declines ("I just need qualitative results")

Convergence assessment criteria (extending Quality Gate 4a):

- QoI change between last two refinement levels &#x3C; 1% (existing criterion)

- Observed convergence rate within 0.5 of theoretical order (new criterion):

- P1 concentration: expect p ~ 2.0, accept 1.5-2.5

- P2 velocity: expect p ~ 3.0, accept 2.5-3.5

- If convergence rate is anomalously low (p &#x3C; 1.0): flag as potential
singularity or insufficient boundary layer resolution

15b.4 Wall-Clock Time Limits

Prevent simulations from running indefinitely by setting wall-clock expectations
per tier and detecting stalls.

Expected wall-clock times (serial execution, single core; times are for
simulation execution only, not including agent discussion time):

Tier
Run Purpose
Mesh + Flow
Transport
Total Expected
Kill Threshold (3x)

1
debug
30s
N/A
30s
90s

2
debug
1-3 min
1-2 min
2-5 min
15 min

3
debug
2-5 min
2-5 min
5-10 min
30 min

3
production
15-30 min
10-20 min
30-60 min
3 hours

4
debug
5-15 min
5-10 min
10-25 min
75 min

4
production
1-4 hours
30 min-2 hours
2-6 hours
18 hours

Kill threshold: If wall-clock time exceeds 3x the expected total for
the tier and run purpose, the simulation is likely stalled or diverging.

Orchestrator behavior when kill threshold is reached:

- If running via Bash: timeout will have already killed the process

- If running manually (3D production): include a timeout check in the
generated script:
import time
_start_time = time.time()
_kill_threshold_s = {kill_threshold_seconds}

# Inside solver loop:
if time.time() - _start_time > _kill_threshold_s:
    print(f"TIMEOUT: Simulation exceeded {_kill_threshold_s}s kill threshold.")
    print("Likely causes: mesh too fine for debug, solver diverging, or")
    print("insufficient Newton continuation steps.")
    print("Try: reduce mesh size, increase continuation ramp steps,")
    print("or switch to Picard iteration.")
    sys.exit(1)

- Report timeout cause in Phase 5 summary with specific recommendations

Stall detection heuristic (for solver progress monitoring):

- If solver has not reduced residual by > 10x in the last 5 minutes of wall-clock
time (for production runs) or 30 seconds (for debug runs), flag as stalled

- Stall is different from slow convergence: stall means no progress at all

16. Quality Gate Escalation Protocol

When the mathematician and reviewer cannot agree after 1 retry, present the disagreement
to the user in this structured format:

AGENT DISAGREEMENT: [Phase Name]

Mathematician recommends: [1-2 sentence summary]
Rationale: [1 sentence]

Reviewer objects: [1-2 sentence summary]
Concern: [1 sentence]

Options:
(A) Proceed with mathematician's recommendation (risk: [reviewer's concern])
(B) Proceed with reviewer's preferred approach (trade-off: [mathematician's concern])
(C) Use conservative approach: [orchestrator's auto-generated compromise]
(D) Provide your own parameters

Recommended: (C) [brief justification]

Conservative compromise rule: Always take the more cautious choice from both agents --
finer mesh, more stable solver, lower tolerance, more regularization.

Escalation sequence:

- First rejection: Reviewer's blocking_issues
 become HARD CONSTRAINTS for mathematician retry

- After 1 retry still REJECTED: Escalate to user with the structured format above

- User selects option (A/B/C/D) or provides custom parameters

- Orchestrator proceeds with user's choice

17. Orchestrator Context Management

The orchestrator loads reference file sections on a per-phase basis to control context
window consumption. Agent discussion details stay in handoff files on disk -- they are
NOT loaded back into the orchestrator context.

Phase
Load Into Context
Summarize from Previous
Do NOT Load

0
environment-setup.md Section 4
None
All other references

1
Agent handoffs (current phase)
Phase 0 summary
Full reference files

2
Phase 1 mesh-plan (full) + Phase 2 handoffs
Phase 0 summary
Phase 1 agent discussions

3
Phase 2 flow-result (full) + Phase 3 handoffs
Phases 0-1 summaries
Phase 2 agent discussions

4
Phase 3 transport-result
Phases 0-2 summaries
All agent discussions

5
Final handoffs only
All phases summarized
All discussions

Previous phase summaries: Max 200 tokens per phase. Include only: key decisions,
approval status, validation metrics. Not full discussion content.

Section 13 NEVER rules are always in context (they are embedded inline in this
SKILL.md, not in a reference file).

18. Phase Dependencies

Phase
Produces
Consumed By
Invalidation Rule

0
Session config, mode selection
All phases
Re-run Phase 0 if environment changes

1
mesh-plan handoff
Phase 2 (mesh code gen)
If Phase 1 re-runs, invalidate Phases 2-4

2
flow-result handoff
Phase 3 (velocity field)
If Phase 2 re-runs, invalidate Phases 3-4

3
transport-result handoff
Phase 4 (visualization)
If Phase 3 re-runs, invalidate Phase 4

4
Visualizations, convergence study
Phase 5 (summary)
Can re-run independently

5
User summary
None
Can re-run independently

Rule: If Phase N is re-executed, all phases > N are invalidated and must be re-run.

19. Complexity Tiers

Tier 1: Validation (5-15 minutes) --> DIRECT Mode

2D Poiseuille flow. Tests the entire pipeline without any user-specific physics.
Always start here when the user has never run a FEniCSx simulation before.
Default mesh sizing: 200-500 elements (2D). Always debug purpose.

See: examples/01-2d-channel-flow.md

Tier 2: Single Physics (15-30 minutes) --> LITE Mode

2D user geometry + O2 transport with Michaelis-Menten and membrane BCs.
Demonstrates the full two-phase workflow on a simple geometry.
Default mesh sizing: 500-2,000 elements (2D). Always debug purpose.

See: examples/02-2d-oxygen-transport.md

Tier 3: Coupled Multiphysics (1-4 hours) --> FULL Mode

2D or 3D STEP geometry + full coupled physics. Production-quality simulation with
iterative solvers, Newton continuation, and validation suite.
Default mesh sizing: Debug: 5,000-20,000 elements (3D). Production: 50,000-200,000 elements (3D).
First run always defaults to debug. Override to production after validation passes.

See: examples/03-3d-cartridge-template.md

Tier 4: Production 3D (4-24 hours) --> FULL Mode

Full 3D cartridge with mesh convergence study, publication-quality visualization,
and complete reproducibility metadata. Always run with MPI on a workstation.
Default mesh sizing: Debug: 10,000-50,000 elements (3D). Production: 200,000-1,000,000+ elements (3D).
First run always defaults to debug. Override to production after validation passes.

Progression rule: Suggest the user start at Tier 1 and progress upward. Do not
jump to Tier 3 or 4 without first validating the environment at Tier 1.

Mode mapping:

Tier
Default Mode
Rationale

1
DIRECT
Well-understood benchmark; agents add no value

2
LITE
Mathematician + reviewer sufficient for 2D

3
FULL
3D complexity benefits from swarm discussion

4
FULL
Production quality requires thorough review

20. Session State File Schema

State file location: /tmp/cfd-bioreactor-state-{session-id}.yaml

session:
  session_id: "20260221-143000-12345"
  session_dir: "/tmp/cfd-bioreactor-session-20260221-143000-12345"
  skill_base_path: "/Users/davidangelesalbores/repos/claude/claude-config/skills/cfd-bioreactor"
  mode: "LITE"                    # DIRECT | LITE | FULL
  tier: 2                         # 1-4
  run_purpose: "debug"            # debug | production
  handoff_declined: false         # Set to true if user declines programming-pm handoff
  current_phase: 2                # 0-5
  status: "running"               # running | paused | complete | failed

phases_completed: [0, 1]          # List of completed phase numbers
phases_valid: [0, 1]              # List of phases whose results are still valid

phase_results:
  phase_0:
    status: "complete"
    handoff_path: null
    timestamp: "2026-02-21T14:30:00Z"
  phase_1:
    status: "complete"
    handoff_path: "{session_dir}/handoffs/phase1-mesh-plan.yaml"
    reviewer_status: "APPROVED_WITH_WARNINGS"
    retries_used: 0
    timestamp: "2026-02-21T14:35:00Z"
  phase_2:
    status: "running"
    handoff_path: null
    reviewer_status: null
    retries_used: 0
    execution_retries_used: 0     # For self-correction loop

timestamps:
  started: "2026-02-21T14:30:00Z"
  last_updated: "2026-02-21T14:40:00Z"

errors: []                        # List of non-fatal errors/warnings encountered

21. Session Resume Protocol

On startup, check for existing state files:

- Search for /tmp/cfd-bioreactor-state-*.yaml
 with status != "complete"

- If found: display last completed phase, offer to resume
Found incomplete session from [timestamp].
Last completed phase: Phase [N] ([phase_name])
Mode: [DIRECT/LITE/FULL], Tier: [N]
Resume from Phase [N+1]? (yes/no/restart)

- If user chooses resume:

- Phase 0: Always re-run (fast, catches environment changes)

- Phases 1-4: Resumable if handoff files exist at recorded paths

- Verify handoff file existence before resuming

- If user chooses restart: Create new session, archive old state file

22. Integration with Other Skills

Need
Skill
How to Use

Quick feasibility estimate
calculator

Before full CFD: estimate Re, Pe, Da, O2 depletion depth. A 5-minute calculation can determine if CFD is even needed.

Literature parameter values
bioinformatician
 or researcher

Find Vmax, Km for specific cell types; diffusion coefficients in specific media.

Debug generated notebook
notebook-debugger

If user is running in Jupyter and encounters FEniCSx import/kernel issues.

Cross-check CFD results
calculator

After simulation: verify CFD results against analytical estimates. If they disagree by > 10x, investigate.

Mathematical analysis
cfd-mathematician

Via Task tool. Variational formulations, stability analysis, convergence estimates.

Adversarial review
cfd-reviewer

Via Task tool. Engineering review with severity-rated challenges and approval status.

Multi-perspective analysis
Perspective agents (direct)
Via parallel Task invocations. FULL mode only. 3-5 CFD-specific perspectives at each decision point.

Transition to software project
programming-pm

When simulation scripts evolve into a maintained software project (library packaging, tests/CI, multi-file project, parameter sweep framework). See Section 22b for handoff criteria.

22b. Programming-PM Handoff Criteria

The cfd-bioreactor orchestrator generates standalone simulation scripts. When the user's
needs evolve beyond one-off scripts into maintained software, recommend handoff to
programming-pm
 for software development coordination.

Handoff Trigger Conditions

Recommend programming-pm when ANY of the following are detected:

Trigger
Detection Heuristic
Example User Request

Library packaging
User mentions "package", "library", "module", "import from", "reusable", "API"
"Make this simulation into a library I can call with different parameters"

Test/CI requirements
User mentions "pytest", "CI", "continuous integration", "coverage", "unit test"
"Add unit tests for the simulation pipeline"

Version control integration
User mentions "git repo", "branch", "PR", "version", "release"
"Set up version control for the simulation code"

Multi-file project growth
Session has generated >= 4 scripts AND user requests shared utilities
"Extract the mesh generation into a shared module"

Parameter sweep framework
User wants automated parameter sweeps with result aggregation
"Run this simulation for 20 different inlet velocities and collect results"

Context Filtering Rules

To prevent false-positive handoff triggers, apply these filters:

- 
Phase filtering: NEVER evaluate handoff triggers during Phases 2-4 (active
computation). Only evaluate at phase boundaries (Phase 0, Phase 5) or during
user interaction between phases.

- 
Compound requirement: Require keywords from 2+ different trigger categories
(e.g., "library" + "test") OR a single unambiguous keyword that has no alternate
interpretation in CFD context (e.g., "package this as a library", "set up CI").

- 
Exclusion patterns: Do NOT treat these as handoff triggers:

- "test" in "test this simulation", "test run", "test different parameters"
(means "try", not "unit test")

- "git" in "let me check" or "let me get" (not version control)

- "version" in "FEniCSx version", "Python version" (not software versioning)

- "module" in "Python module not found" (import error, not packaging)

- 
Handoff decline suppression: If user selects option (B) Continue in the
handoff recommendation, set handoff_declined: true
 in session state. Do NOT
re-suggest programming-pm handoff for the remainder of this session.

Handoff Recommendation Format

When a trigger condition passes context filtering, present to the user:

HANDOFF RECOMMENDATION: Software Development Coordination

Your simulation work is evolving into a software project:
- Trigger: [which trigger was detected]
- Current state: [N scripts generated, current tier/mode]

Recommendation: Hand off to programming-pm for:
- Architecture design (module structure, shared utilities)
- Testing strategy (unit tests for simulation components)
- Code review and quality gates
- Version control integration

Options:
(A) Hand off to programming-pm now (recommended)
(B) Continue with cfd-bioreactor (standalone scripts only)
(C) Complete current simulation phase, then hand off

What Gets Handed Off

When handing off to programming-pm, provide:

- All generated scripts from {session_dir}/scripts/

- Session state summary (tier, mode, run_purpose, phases completed)

- Physical parameters and validation results

- List of scripts and their purposes

When NOT to Hand Off

Do NOT recommend programming-pm for:

- Standard tier progression (Tier 1 -> 2 -> 3 -> 4)

- Mesh convergence studies (these are cfd-bioreactor Phase 4)

- One-off parameter variations (just re-run with different inputs)

- Visualization enhancements (cfd-bioreactor Phase 4)

- Any session where handoff_declined
 is already true

23. Reference Files

This skill includes the following reference files. Read them as needed during workflow
execution -- do not load all at once. See references/agent-loading-guide.md
 for
section-level loading maps per agent.

File
When to Read
Purpose

references/environment-setup.md

Phase 0 (pre-flight)
Installation, pre-flight script, degradation modes

references/physics-models.md

Phases 2, 3 (code gen)
Equations, variational forms, parameter tables

references/mesh-generation-guide.md

Phase 1 (code gen)
STEP import, physical groups, refinement, quality

references/fenicsx-patterns.md

Phases 1-4 (all code gen)
FEniCSx v0.10 API patterns (the code library)

references/validation-benchmarks.md

Phases 2, 3 (validation)
Analytical solutions, convergence protocols

references/troubleshooting-guide.md

On error
Error catalog by stage, diagnostic commands

references/orchestrator-handoff-schema.md

Agent invocations
YAML handoff contracts for all agent communication

references/agent-loading-guide.md

Agent invocations
Maps agents to reference file sections to load

references/swarm-framing-templates.md

FULL mode swarms
Pre-written challenge templates for perspective agents

examples/environment.yml

Phase 0 (if not installed)
Conda environment specification

24. Notes

- 
Version pinning: All code patterns target FEniCSx v0.10. If a newer version is
released, the version assertion will catch the mismatch. Update patterns before using
with a new API version.

- 
SI units everywhere: All parameters are in SI (kg, m, s, mol, Pa) unless the user
explicitly specifies otherwise. If a STEP file appears to be in mm (bounding box > 1 m),
warn the user about potential unit mismatch.

- 
Serial execution by default: Generated scripts run in serial (single core). For 3D
production runs, add MPI instructions. The scripts are MPI-aware (use MPI.COMM_WORLD

throughout) and work correctly with mpirun -np N
.

- 
Reproducibility: Every generated script includes a metadata header with all version
numbers, parameters, and settings. Results can be reproduced by re-running the same
script in the same environment.

- 
This skill generates code but does not modify system packages: It writes Python
scripts to the user's working directory. It never installs packages, modifies micromamba
environments, or changes system configuration.

- 
Physical group convention: inlet=1, outlet=2, walls=3, membrane=4, cell_region=5,
fluid_volume=10. Consistent across all examples and generated scripts.

- 
gmsh synchronize: After ANY gmsh.model.occ.*
 operation (addBox, addCylinder,
importShapes, etc.), ALWAYS call gmsh.model.occ.synchronize()
 before accessing
entities, assigning physical groups, or meshing. Forgetting this is the single most
common gmsh error.

- 
v2.0 multi-agent orchestrator: This is v2.0 of the cfd-bioreactor skill. v1.0 was
a single-context generate-explain-validate skill; v2.0 is a multi-agent orchestrator
with cfd-mathematician, cfd-reviewer, and decomposed perspective agent coordination.

cfd-bioreactor

Safety Notice

Copy this and send it to your AI assistant to learn

{perspective_name} Perspective

Key Insight

Supporting Analysis

Confidence: [1-5]

Blind Spots

Source Transparency

Related Skills

bioinformatician

procurement

mathematician

statistician