Agent Builder Skill
Build and deploy Claude Agent SDK applications to your Hetzner VPS.
Metadata
-
Name: agent-builder
-
Version: 1.1.0
-
Author: KnearMe
-
Tags: claude, agent-sdk, skills, vps, deployment
When to Use
Use this skill when:
-
Creating a new AI agent application (coding or non-coding use cases)
-
Designing agent behavior, escalation paths, and decision boundaries
-
Deploying agents to the Hetzner VPS
-
Setting up multi-agent architectures
-
Troubleshooting agent deployments
Non-Dev First Principle
Always consider non-development use cases. Many high-value agents are workflow-focused, not code-focused:
-
Customer support triage
-
Sales qualification
-
Operations checklists
-
Marketing content coordination
-
Finance/QA reviews
-
Hiring or onboarding workflows
If the request is non-dev or mixed, start with discovery and workflow design before writing code.
Start Here (Design → Build)
-
Define the outcome, user, and trigger → references/agent-design-guide.md
-
Choose an architecture pattern → references/architectures.md
-
Select tools with least-privilege → references/tool-development.md
-
Define roles + escalation rules → references/role-definition.md
-
Instrument + improve → references/performance-monitoring.md
- references/improving-agents.md
-
Add UX if needed → references/ui-development.md
-
Test before shipping → references/testing-guide.md
Reference Map (Progressive Disclosure)
-
references/agent-design-guide.md — Discovery, use-case fit, tool decision trees, anti-patterns
-
references/architectures.md — Single, orchestrator+subagents, multi-agent fleets
-
references/role-definition.md — Role prompts, delegation patterns, escalation triggers
-
references/tool-development.md — Tool selection, MCP tools, guardrails
-
references/performance-monitoring.md — Metrics, budgets, alerts, logs
-
references/improving-agents.md — Feedback loops, evals, regression tests
-
references/ui-development.md — Chat UX patterns + component examples
-
references/testing-guide.md — Test strategy, fixtures, mocks, evals
-
references/react-chat-component.tsx — Ready-to-use chat UI (React)
-
references/agent-template.ts — Complete agent server template
-
references/non-dev-templates.md — Expanded non-dev templates (support, sales, ops, marketing, finance)
-
references/background-agents.md — Background/autonomous role execution patterns
-
references/self-improving-agents.md — Self-improving agent loop (AlphaEvolve patterns)
Claude Code Skill Authoring Rules
-
Required frontmatter: name (lowercase, hyphen/underscore) and description (what the skill does + when to use).
-
Optional allowed-tools : If present, it limits tools in Claude Code. If omitted, tools are not restricted by the skill file.
-
Keep it short: Aim to keep SKILL.md under ~500 lines and move long content into references/ .
-
Progressive disclosure: Link reference files at most one level deep (no nested reference chains).
Quick Start
Create New Agent
mkdir my-agent && cd my-agent npm init -y npm install @anthropic-ai/claude-agent-sdk express ws
See references/agent-template.ts for a full server template
Deploy to VPS (Short Version)
-
Run scripts/setup-vps.sh once per server
-
Use scripts/deploy-agent.sh per agent
-
See references/architectures.md for multi-agent deployments
Non-Dev Agent Templates
Use these when the goal is business workflow impact, not coding. Expanded templates: references/non-dev-templates.md
- Support Triage Agent
-
Outcome: Accurate category + next action in <2 minutes
-
Trigger: New ticket or chat created
-
Inputs: Ticket text, user plan, last 5 interactions
-
Outputs: Priority, summary, suggested response, escalation flag
-
Tools: Read, Glob, Grep, AskUserQuestion (Write for drafts)
-
Escalate when: Refunds, security, legal threats, angry users
-
Pattern: Single agent → Orchestrator+Subagents as volume grows
- Sales Qualification Agent
-
Outcome: Score lead + recommended next step
-
Trigger: Inbound form or sales inbox message
-
Inputs: Form fields, source, company size, prior contact
-
Outputs: Fit score, objections, suggested reply
-
Tools: Read, Glob, Grep, AskUserQuestion (Write for drafts)
-
Escalate when: Enterprise/legal requirements, pricing exceptions
-
Pattern: Single agent
- Ops Checklist Agent
-
Outcome: Runbook completed with clear status
-
Trigger: Daily/weekly ops cadence, incident checklist
-
Inputs: Runbook file, system status snapshots
-
Outputs: Checklist state, anomalies, recommended follow-ups
-
Tools: Read, Glob, Grep (Bash only if strictly required)
-
Escalate when: Missing signals, critical thresholds
-
Pattern: Single agent (orchestrator if many systems)
- Marketing Content Coordinator
-
Outcome: On-brand draft + distribution checklist
-
Trigger: New campaign request or content brief
-
Inputs: Brand voice docs, product updates, target persona
-
Outputs: Draft content, channel checklist, CTA options
-
Tools: Read, Glob, Grep, Write, Edit
-
Escalate when: Claims need legal review, sensitive topics
-
Pattern: Orchestrator+Subagents (researcher/writer/reviewer)
- Finance/QA Review Agent
-
Outcome: Flag anomalies + recommend follow-ups
-
Trigger: Monthly close, QA batch, or audit request
-
Inputs: Reports, threshold rules, last period baselines
-
Outputs: Findings list, confidence, next steps
-
Tools: Read, Glob, Grep (no Write unless drafting)
-
Escalate when: Large variances, policy violations
-
Pattern: Single agent with strict constraints
Non-Dev Decision Checklist
-
Is the source of truth clear and accessible?
-
What actions are advisory vs automatic?
-
What requires human approval?
-
What are the escalation triggers?
-
What does success look like in measurable terms?
-
What is the lowest-risk toolset that still works?
Background Agents (Summary)
-
Use job-based execution for discrete tasks (events, cron).
-
Use continuous execution for live routing and monitoring.
-
Map authority to an autonomy ladder and enforce escalation rules.
-
See references/background-agents.md for full patterns.
Self-Improving Agents (Summary)
-
Use a generate → evaluate → promote loop with automated evaluators.
-
Treat best prompts/workflows as a "skill library" that evolves over time.
-
Only promote when metrics improve on a fixed regression suite.
-
See references/self-improving-agents.md for AlphaEvolve-inspired patterns.
Architecture Patterns (Summary)
-
Single Agent: One domain, simple logic
-
Orchestrator + Subagents: Multi-specialty, parallel work
-
Multi-Agent Fleet: Separate products/domains
-
See references/architectures.md for the full decision tree
SDK Essentials (Summary)
const response = query({ prompt, options: { systemPrompt, allowedTools: ["Read", "Glob", "Grep", "Task", "Skill"], agents: SUBAGENTS, permissionMode: "acceptEdits", settingSources: ["user", "project"], }, });
-
Task must be in allowedTools for subagents to spawn
-
Subagents cannot spawn other subagents
-
Skills load only with settingSources and "Skill" in allowedTools
-
Include "project" to load CLAUDE.md instructions
-
allowed-tools frontmatter applies to Claude Code CLI, not SDK
-
See references/agent-template.ts for streaming/session examples
Deployment & Ops (Summary)
- Use scripts/setup-vps.sh
- scripts/deploy-agent.sh
-
Put Nginx + SSL in front of web UIs
-
Track latency, cost, and error budgets (references/performance-monitoring.md )
-
Add feedback loops and evals (references/improving-agents.md )
Testing & Quality (Summary)
-
Test behavior, not phrasing; keep regression fixtures
-
Validate tool safety boundaries and escalation paths
-
See references/testing-guide.md for full patterns
Resources
Claude Agent SDK Docs: https://docs.anthropic.com/claude-code/agent-sdk Hetzner Cloud Console: https://console.hetzner.cloud PM2 Documentation: https://pm2.keymetrics.io/docs Cloudflare Tunnel Docs: https://developers.cloudflare.com/cloudflare-one/connections/connect-apps
Scripts
-
scripts/setup-vps.sh
-
Initial VPS configuration
-
scripts/deploy-agent.sh
-
Deploy a new agent
-
scripts/new-agent.sh
-
Scaffold a new agent project