agent-workflow

Agent Workflow Designer

Overview

This skill guides the design and architecture of AI agent workflows using proven methodologies. When a user presents a problem, this skill helps structure an agent-based solution following the 9-step building process and 8-layer architecture framework validated at Meta.

Workflow Decision Tree

When a user shares a problem or requests agent design help:

Assess the problem scope

Is the problem clearly defined? → Proceed to Problem Analysis
Is the problem vague? → Ask clarifying questions about desired outcomes and constraints

Determine architecture complexity

Simple task (single action)? → Single agent with basic tools
Complex task (multiple sub-tasks)? → Consider multi-agent orchestration
Integration task (connecting systems)? → Focus on Layer 4 (Tooling) design

Follow the appropriate workflow

New agent from scratch → Apply 9-Step Building Process
Existing agent improvement → Focus on specific layers needing enhancement
Tool integration problem → Apply MCP and tooling patterns

9-Step Agent Building Process

Use this sequential workflow when designing a new agent from scratch:

Step 1: Define Purpose and Scope

Key principle: Start with job-to-be-done, not technology.

Ask the user:

What specific outcome does the end user need?
What are the constraints (budget, time, resources)?
What's the success metric?

Bad scope example: "An AI assistant for customer service"

Good scope example: "An agent that takes customer complaints, pulls order history from Shopify API, and drafts refund approvals for orders under $200"

Decision point: Narrow scope = better performance. Resist building Swiss Army knives.

Step 2: Structure Inputs and Outputs

Treat the agent as a function with structured interfaces:

Inputs:

Use JSON schemas or Pydantic models, not free text
Define required vs. optional fields
Specify data types and validation rules

Outputs:

Return data objects, not prose
Define clear error states
Include confidence scores when relevant

Example structure:

Input: { "complaint_text": "string", "customer_id": "string", "order_id": "string (optional)" }

Output: { "action": "approve_refund | escalate | request_info", "refund_amount": "number", "reasoning": "string", "confidence": "number" }

Step 3: Write System Instructions

Critical: Spend 80% of design time here.

Include in system prompt:

Role definition: "You are a sales qualification specialist..."
Behavioral guidelines: "Always ask for budget before proposing solutions"
Output format requirements: Specify JSON structure, word limits, tone
Edge case handling: What to do when data is missing or ambiguous

Testing strategy: A great system prompt can make GPT-3.5 outperform poorly prompted GPT-4.

Step 4: Enable Reasoning and External Actions

ReAct Framework Pattern:

Reason: Analyze the current state and decide next action
Act: Call an API, use a tool, or make a decision
Observe: Review the result and determine if goal is achieved

Start simple:

Begin with if/then logic before complex reasoning chains
Add tools incrementally (don't overwhelm with 50 tools at once)
Test each tool integration independently

Common tools to integrate:

Calculators for math operations
Web browsers for research
Database queries for data retrieval
API calls to external systems

Step 5: Orchestrate Multiple Agents (When Needed)

When to use multi-agent architecture:

Task has clearly separable sub-tasks
Different sub-tasks require different expertise
Parallel processing would improve speed

When NOT to use multi-agent:

Simple linear workflows
Tasks that require continuous context
When handoff complexity exceeds benefit

Common 4-agent pattern:

Research Agent: Gathers information from sources
Analysis Agent: Processes and synthesizes data
Writing Agent: Creates structured outputs
QA Agent: Reviews quality and accuracy

Keep handoffs simple: Complex orchestration = complex failures.

Step 6: Implement Memory and Context

Three types of memory to consider:

Conversation history:

What happened this session
Recent user interactions
Current task state

User context:

User preferences and settings
Past interaction patterns
Historical decisions

Knowledge retrieval:

Relevant information from knowledge base
Similar past cases
Domain-specific context

Implementation guidance:

Start with simple conversation buffers
Add vector databases only when needing semantic search across large datasets
Consider memory retrieval latency in architecture

Step 7: Add Multimedia Capabilities

Modern agents should handle:

Voice input/output for accessibility
Image understanding for visual tasks
Document processing (PDF, DOCX, spreadsheets)

Strategic approach: Add capabilities based on actual user needs, not "nice-to-haves."

Step 8: Format and Deliver Results

Output is your product's UX. Design outputs for:

Human consumption:

Clear formatting and structure
Scannable with headers and bullets
Professional appearance

System consumption:

Valid JSON/XML
Consistent field names
Error codes for handling

Quality standard: Great agent outputs look like a human created them.

Step 9: Build Interface or API

Delivery method options:

Chat interface for conversational tasks
API endpoints for system integration
Integration with existing tools (Slack, email, CRM)

Best practice: The best agents feel invisible—they just make things happen.

8-Layer Architecture Framework

When analyzing agent architecture needs, consider which layers require attention:

Layer 1: Infrastructure

Foundation: Cloud, databases, APIs, compute resources

Key considerations:

GPU/TPU requirements for inference
Data storage and retrieval speed
Load balancing for scale
Monitoring and observability

Common mistake: Underestimating compute needs—agents make more API calls than traditional apps.

Layer 2: Agent Internet

Operating system for agents: Identity, state management, inter-agent communication

Current state: Mostly custom-built, but platforms like LangChain and CrewAI are emerging.

Layer 3: Protocol

Standards for interoperability: MCP (Model Context Protocol) is becoming the standard

Key principle: Bet on open standards, not proprietary solutions. MCP allows any tool to work with any agent.

Layer 4: Tooling Enrichment

Agent superpowers: RAG systems, function calling, external integrations

Quality over quantity: 5 rock-solid tools > 50 flaky integrations

Tool categories:

Data retrieval (databases, APIs)
Computation (calculators, processors)
Communication (email, messaging)
Content creation (documents, reports)

Layer 5: Cognition Reasoning

The brain: Planning, decision-making, error handling

Critical elements:

Guardrails to prevent hallucinations
Error recovery strategies
Confidence scoring
Graceful degradation

User forgiveness: Users forgive agents that fail gracefully, not ones that spiral into nonsense.

Layer 6: Memory Personalization

Human touch: Personal context, preferences, conversation history

Start simple: Store user preferences and conversation context before building complex personalization.

Layer 7: Application

User-facing products: The actual agent functionality users interact with

Focus strategy: Nail one use case before expanding to others.

Layer 8: Ops Governance

Risk management: Monitoring, cost control, privacy, oversight

Build from day one: Retrofitting governance is expensive and painful.

Key components:

Cost tracking per agent action
Privacy enforcement and data handling
Human-in-the-loop for critical decisions
Audit logs and compliance

Problem-to-Solution Workflow

When a user presents a problem:

Step 1: Clarify the problem

What's the current manual process?
What's the desired outcome?
What are the constraints (time, cost, technical)?
What data sources are available?

Step 2: Assess agent appropriateness Not every problem needs an agent. Consider:

Is the task repetitive and rule-based?
Does it require decision-making with context?
Would automation provide significant value?
Is the problem scope clear and bounded?

Step 3: Map to architecture Using the 8 layers, identify which need focus:

Simple task → Focus on Layers 4, 5, 7 (tools, reasoning, application)
Complex integration → Add Layer 3 (protocol) emphasis
Scalability concern → Prioritize Layers 1, 8 (infrastructure, ops)

Step 4: Design workflow Apply the 9-step building process, calling out:

Critical decision points
Tool integration requirements
Multi-agent needs (if any)
Memory and context strategy

Step 5: Identify implementation path Based on user's role and resources:

For PMs: High-level architecture and tool selection
For engineers: Detailed technical implementation with code patterns
For product teams: Full stack from requirements to monitoring

Tool Integration Patterns

MCP (Model Context Protocol) Integration

When tools support MCP:

Agent discovers available tools
Agent calls tools using standardized interface
Tool returns structured response
Agent processes and continues workflow

Advantage: Write once, use with any agent.

Custom API Integration

When building custom integrations:

Define clear API contract (inputs/outputs)
Implement error handling and retries
Add rate limiting and caching
Monitor usage and costs
Document for agent consumption

Common Integration Scenarios

CRM Integration (Salesforce, HubSpot):

Read customer data
Create/update records
Search across objects
Trigger workflows

Communication Tools (Slack, Email):

Send messages/notifications
Read incoming requests
Monitor channels
Respond to mentions

Data Sources (Databases, APIs):

Query structured data
Retrieve documents
Search knowledge bases
Aggregate information

Decision Framework: Single vs. Multi-Agent

Use Single Agent When:

Task is linear and sequential
Context must be maintained throughout
Decision-making is unified
Complexity of orchestration > benefit

Use Multi-Agent When:

Clear task separation exists
Sub-tasks need different expertise
Parallel processing improves performance
Quality benefits from specialization

Example - Customer Support:

Single agent sufficient for: "Take customer complaint, pull order history, draft refund approval"

Multi-agent beneficial for: "Monitor social media, categorize issues, research solutions, generate responses, escalate critical cases, track resolution"

Common Pitfalls and Solutions

Pitfall 1: Scope Creep

Problem: Trying to build a general-purpose assistant Solution: Define narrow, specific job-to-be-done with clear success metrics

Pitfall 2: Tool Overload

Problem: Giving agent 50+ tools upfront Solution: Start with 5 essential tools, add incrementally based on actual needs

Pitfall 3: Skipping System Prompt

Problem: Generic or minimal instructions Solution: Invest 80% of time crafting detailed system prompt with examples and edge cases

Pitfall 4: No Error Handling

Problem: Agent breaks on unexpected inputs Solution: Design graceful degradation, clear error states, and fallback behaviors

Pitfall 5: Ignoring Costs

Problem: Runaway API costs from inefficient agent design Solution: Build cost monitoring from day one, implement caching, optimize prompt length

Pitfall 6: Over-Engineering Architecture

Problem: Building all 8 layers simultaneously Solution: Start with Layers 4, 5, 7 (tools, reasoning, application), add others as needed

Output Format

When providing agent workflow solutions, structure the response as:

Problem Restatement: Confirm understanding of the user's need
Agent Architecture Recommendation: Single vs. multi-agent, with rationale
Step-by-Step Workflow: Apply relevant steps from the 9-step process
Tool Integration Plan: Specific tools needed and integration approach
Layer Analysis: Which of the 8 layers need focus and why
Implementation Guidance: Prioritized next steps based on user's role
Success Metrics: How to measure if the agent is working

Agent Taxonomy Quick Reference

When users ask about existing tools:

Category 1: Consumer Agents (Built-In)

Examples: ChatGPT Agent, Claude, Gemini, Grok
Best for: Quick tasks, research, content creation
User type: Everyone, especially PMs

Category 2: No-Code Builders

Examples: Zapier Central, n8n, Make
Best for: Workflow automation without coding
User type: PMs, operations teams

Category 3: Developer-First Platforms

Examples: LangChain, CrewAI, AutoGen, Swarm
Best for: Custom agent features in products
User type: Engineering teams

Category 4: Specialized Agent Apps

Examples: Cursor (coding), Perplexity (research), Notion AI (writing)
Best for: Specific job-to-be-done with deep specialization
User type: Domain-specific professionals

agent-workflow

Safety Notice

Copy this and send it to your AI assistant to learn

Source Transparency

Related Skills

context-engineering

HITL Protocol — Workflows and Human Decisions for Autonomous Agents

agent-workflow