Autonomous Agentic Loop Skill Creator

Create task-specific skills that enable Claude Code to run autonomously for hours, persisting state across context windows and recovering from failures gracefully.

Core Concept

The Ralph Wiggum Loop externalizes agent memory to the filesystem. A bash while loop continuously invokes Claude Code with the same prompt. Progress persists in files and git — not in the context window. When context fills, a fresh instance continues by reading filesystem state.

┌─────────────────────────────────────────────────────────┐ │ OUTER LOOP (bash) │ │ while true; do claude --print "$PROMPT"; done │ └─────────────────────────────────────────────────────────┘ │ ▲ ▼ │ ┌─────────────────────────────────────────────────────────┐ │ CLAUDE CODE SESSION │ │ 1. Read state files (feature_list.json, progress.txt) │ │ 2. Pick ONE task │ │ 3. Implement → Test → Verify │ │ 4. Update state files │ │ 5. Git commit │ │ 6. Exit (Stop hook validates completion) │ └─────────────────────────────────────────────────────────┘ │ ▲ ▼ │ ┌─────────────────────────────────────────────────────────┐ │ PERSISTENT STATE │ │ • feature_list.json - Task tracking (pass/fail) │ │ • progress.txt - Session handoff notes │ │ • .claude/hooks/ - Validation & control │ │ • Git history - Rollback capability │ └─────────────────────────────────────────────────────────┘

Creating a Task-Specific Loop Skill

Step 1: Define the Goal Structure

Determine what "done" means for your autonomous task:

Goal Type State File Format Example

Feature list JSON with passes: boolean

Building an app feature-by-feature

Checklist Markdown with - [x] items Migration or refactoring tasks

Test suite JSON with test results Test-driven development

Spec compliance JSON with requirement IDs Implementing a specification

Step 2: Design State Files

Create state files that survive context boundaries. Use JSON for structured data (prevents accidental description edits):

// feature_list.json - The agent can ONLY change "passes" field { "features": [ {"id": "auth-001", "name": "User login", "passes": false}, {"id": "auth-002", "name": "Password reset", "passes": false} ] }

progress.txt - Human-readable session notes

Last Session: 2025-02-04 14:30

Completed: auth-001 (user login)
Current: auth-002 (password reset) - blocked on email service
Next: auth-003 (session management)
Constraints: Using bcrypt, no plaintext passwords

Step 3: Configure Hooks

Hooks control agent behavior at lifecycle points. See references/hooks-reference.md for complete patterns.

Essential hooks for autonomous loops:

Stop hook - Prevents premature exit until completion criteria met
PostToolUse hook - Auto-format, auto-test after edits
PreToolUse hook - Block dangerous operations

Minimal Stop hook for feature-list validation:

{ "hooks": { "Stop": [{ "matcher": "", "hooks": [{ "type": "command", "command": "python3 scripts/validate_completion.py" }] }] } }

Step 4: Create the Loop Script

#!/bin/bash

run_loop.sh - The outer autonomous loop

PROMPT=$(cat <<'EOF' Read feature_list.json and progress.txt. Pick ONE incomplete feature (passes: false). Implement it fully with tests. Run tests to verify. Only mark passes: true after ALL tests pass. Update progress.txt with session notes. Commit with descriptive message. EOF )

Safety limits

MAX_ITERATIONS=50 ITERATION=0

while [ $ITERATION -lt $MAX_ITERATIONS ]; do echo "=== Iteration $ITERATION ==="

# Check if all features complete
if python3 -c "import json; f=json.load(open('feature_list.json')); exit(0 if all(x['passes'] for x in f['features']) else 1)"; then
    echo "All features complete!"
    exit 0
fi

# Run Claude Code
claude --print --dangerously-skip-permissions "$PROMPT"
EXIT_CODE=$?

if [ $EXIT_CODE -ne 0 ]; then
    echo "Claude exited with error: $EXIT_CODE"
fi

ITERATION=$((ITERATION + 1))

done

echo "Reached max iterations"

Step 5: Implement Validation Scripts

#!/usr/bin/env python3 """Stop hook validator - blocks exit until criteria met."""

import json import subprocess import sys

def check_tests_pass(): """Run test suite, return True if all pass.""" result = subprocess.run( ["npm", "test", "--", "--passWithNoTests"], capture_output=True, text=True ) return result.returncode == 0

def check_feature_verified(): """Ensure recently modified feature was tested.""" with open("feature_list.json") as f: features = json.load(f)["features"]

passing = [f for f in features if f["passes"]]
if not passing:
    return True  # No claims to verify yet

latest = passing[-1]
# Add your verification logic here
return True

def main(): result = {"decision": "allow"}

if not check_tests_pass():
    result = {
        "decision": "block",
        "reason": "Tests failing. Fix before continuing."
    }
elif not check_feature_verified():
    result = {
        "decision": "block",
        "reason": "Feature marked passing but not verified."
    }

print(json.dumps(result))
return 0 if result["decision"] == "allow" else 2

if name == "main": sys.exit(main())

Skill Structure for Your Task

When creating a task-specific loop skill, organize as:

my-loop-skill/ ├── SKILL.md # Task-specific instructions ├── scripts/ │ ├── run_loop.sh # Outer loop script │ ├── validate_completion.py │ └── circuit_breaker.py ├── references/ │ └── task-details.md # Domain-specific guidance └── assets/ └── templates/ ├── feature_list.json ├── progress.txt └── hooks.json

Critical Patterns

Preventing Failure Modes

Failure Mode Symptom Prevention

Amnesia Loop Same file researched repeatedly Track visited files in progress.txt

Premature Completion "Done!" without verification Stop hook validates all tests pass

Context Rot Degraded reasoning after compaction Clear context every 30 min or at 70%

Token Burn $100+ on impossible task MAX_ITERATIONS limit in loop script

The Initializer Pattern

For complex projects, use two-phase approach:

Initializer session (run once):

Create feature_list.json with all requirements
Write init.sh for dev server setup
Create initial progress.txt
Make baseline git commit

Coding sessions (run in loop):

Read state files
Work on ONE feature
Verify before marking complete
Commit and update notes

Circuit Breaker Implementation

See references/loop-prevention.md for complete patterns.

circuit_breaker.py - Detect stagnation

STAGNATION_SIGNALS = { "no_file_changes": 3, # consecutive loops "same_error": 5, # consecutive occurrences "output_decrease": 0.7, # 70% drop from previous }

CLAUDE.md Configuration

Keep under 300 lines. Include only:

Project: [Name]

Tech Stack

[List technologies]

Commands

Build: npm run build
Test: npm test
Lint: npm run lint

Autonomous Loop Protocol

Read feature_list.json and progress.txt first
Work on ONE feature per session
Run tests before marking complete
Update progress.txt before exiting
Commit with format: feat(scope): description

IMPORTANT

Never mark passes: true without verified tests
If stuck for 3 attempts, document blocker and move on
Check progress.txt for constraints from previous sessions

Quick Reference

Component Purpose Location

Loop script Outer while loop scripts/run_loop.sh

Stop hook Prevent premature exit .claude/hooks.json

Feature list Track progress feature_list.json

Progress notes Session handoff progress.txt

Circuit breaker Detect stagnation scripts/circuit_breaker.py

Additional References

Hooks Reference - Complete hook patterns
CLAUDE.md Patterns - Configuration best practices
MCP Patterns - Tool integration strategies
Loop Prevention - Circuit breakers and recovery

Initialization

Use the initialization script to scaffold a new task-specific loop skill:

python3 scripts/init_loop_skill.py <skill-name> --goal-type feature-list

Options for --goal-type : feature-list , checklist , test-suite , spec-compliance

autonomous-loop

Safety Notice

Copy this and send it to your AI assistant to learn