Agent Workflow Designer
Overview
This skill guides the design and architecture of AI agent workflows using proven methodologies. When a user presents a problem, this skill helps structure an agent-based solution following the 9-step building process and 8-layer architecture framework validated at Meta.
Workflow Decision Tree
When a user shares a problem or requests agent design help:
- •
Assess the problem scope
- •Is the problem clearly defined? → Proceed to Problem Analysis
- •Is the problem vague? → Ask clarifying questions about desired outcomes and constraints
- •
Determine architecture complexity
- •Simple task (single action)? → Single agent with basic tools
- •Complex task (multiple sub-tasks)? → Consider multi-agent orchestration
- •Integration task (connecting systems)? → Focus on Layer 4 (Tooling) design
- •
Follow the appropriate workflow
- •New agent from scratch → Apply 9-Step Building Process
- •Existing agent improvement → Focus on specific layers needing enhancement
- •Tool integration problem → Apply MCP and tooling patterns
9-Step Agent Building Process
Use this sequential workflow when designing a new agent from scratch:
Step 1: Define Purpose and Scope
Key principle: Start with job-to-be-done, not technology.
Ask the user:
- •What specific outcome does the end user need?
- •What are the constraints (budget, time, resources)?
- •What's the success metric?
Bad scope example: "An AI assistant for customer service"
Good scope example: "An agent that takes customer complaints, pulls order history from Shopify API, and drafts refund approvals for orders under $200"
Decision point: Narrow scope = better performance. Resist building Swiss Army knives.
Step 2: Structure Inputs and Outputs
Treat the agent as a function with structured interfaces:
Inputs:
- •Use JSON schemas or Pydantic models, not free text
- •Define required vs. optional fields
- •Specify data types and validation rules
Outputs:
- •Return data objects, not prose
- •Define clear error states
- •Include confidence scores when relevant
Example structure:
Input: {
"complaint_text": "string",
"customer_id": "string",
"order_id": "string (optional)"
}
Output: {
"action": "approve_refund | escalate | request_info",
"refund_amount": "number",
"reasoning": "string",
"confidence": "number"
}
Step 3: Write System Instructions
Critical: Spend 80% of design time here.
Include in system prompt:
- •Role definition: "You are a sales qualification specialist..."
- •Behavioral guidelines: "Always ask for budget before proposing solutions"
- •Output format requirements: Specify JSON structure, word limits, tone
- •Edge case handling: What to do when data is missing or ambiguous
Testing strategy: A great system prompt can make GPT-3.5 outperform poorly prompted GPT-4.
Step 4: Enable Reasoning and External Actions
ReAct Framework Pattern:
- •Reason: Analyze the current state and decide next action
- •Act: Call an API, use a tool, or make a decision
- •Observe: Review the result and determine if goal is achieved
Start simple:
- •Begin with if/then logic before complex reasoning chains
- •Add tools incrementally (don't overwhelm with 50 tools at once)
- •Test each tool integration independently
Common tools to integrate:
- •Calculators for math operations
- •Web browsers for research
- •Database queries for data retrieval
- •API calls to external systems
Step 5: Orchestrate Multiple Agents (When Needed)
When to use multi-agent architecture:
- •Task has clearly separable sub-tasks
- •Different sub-tasks require different expertise
- •Parallel processing would improve speed
When NOT to use multi-agent:
- •Simple linear workflows
- •Tasks that require continuous context
- •When handoff complexity exceeds benefit
Common 4-agent pattern:
- •Research Agent: Gathers information from sources
- •Analysis Agent: Processes and synthesizes data
- •Writing Agent: Creates structured outputs
- •QA Agent: Reviews quality and accuracy
Keep handoffs simple: Complex orchestration = complex failures.
Step 6: Implement Memory and Context
Three types of memory to consider:
Conversation history:
- •What happened this session
- •Recent user interactions
- •Current task state
User context:
- •User preferences and settings
- •Past interaction patterns
- •Historical decisions
Knowledge retrieval:
- •Relevant information from knowledge base
- •Similar past cases
- •Domain-specific context
Implementation guidance:
- •Start with simple conversation buffers
- •Add vector databases only when needing semantic search across large datasets
- •Consider memory retrieval latency in architecture
Step 7: Add Multimedia Capabilities
Modern agents should handle:
- •Voice input/output for accessibility
- •Image understanding for visual tasks
- •Document processing (PDF, DOCX, spreadsheets)
Strategic approach: Add capabilities based on actual user needs, not "nice-to-haves."
Step 8: Format and Deliver Results
Output is your product's UX. Design outputs for:
Human consumption:
- •Clear formatting and structure
- •Scannable with headers and bullets
- •Professional appearance
System consumption:
- •Valid JSON/XML
- •Consistent field names
- •Error codes for handling
Quality standard: Great agent outputs look like a human created them.
Step 9: Build Interface or API
Delivery method options:
- •Chat interface for conversational tasks
- •API endpoints for system integration
- •Integration with existing tools (Slack, email, CRM)
Best practice: The best agents feel invisible—they just make things happen.
8-Layer Architecture Framework
When analyzing agent architecture needs, consider which layers require attention:
Layer 1: Infrastructure
Foundation: Cloud, databases, APIs, compute resources
Key considerations:
- •GPU/TPU requirements for inference
- •Data storage and retrieval speed
- •Load balancing for scale
- •Monitoring and observability
Common mistake: Underestimating compute needs—agents make more API calls than traditional apps.
Layer 2: Agent Internet
Operating system for agents: Identity, state management, inter-agent communication
Current state: Mostly custom-built, but platforms like LangChain and CrewAI are emerging.
Layer 3: Protocol
Standards for interoperability: MCP (Model Context Protocol) is becoming the standard
Key principle: Bet on open standards, not proprietary solutions. MCP allows any tool to work with any agent.
Layer 4: Tooling Enrichment
Agent superpowers: RAG systems, function calling, external integrations
Quality over quantity: 5 rock-solid tools > 50 flaky integrations
Tool categories:
- •Data retrieval (databases, APIs)
- •Computation (calculators, processors)
- •Communication (email, messaging)
- •Content creation (documents, reports)
Layer 5: Cognition Reasoning
The brain: Planning, decision-making, error handling
Critical elements:
- •Guardrails to prevent hallucinations
- •Error recovery strategies
- •Confidence scoring
- •Graceful degradation
User forgiveness: Users forgive agents that fail gracefully, not ones that spiral into nonsense.
Layer 6: Memory Personalization
Human touch: Personal context, preferences, conversation history
Start simple: Store user preferences and conversation context before building complex personalization.
Layer 7: Application
User-facing products: The actual agent functionality users interact with
Focus strategy: Nail one use case before expanding to others.
Layer 8: Ops Governance
Risk management: Monitoring, cost control, privacy, oversight
Build from day one: Retrofitting governance is expensive and painful.
Key components:
- •Cost tracking per agent action
- •Privacy enforcement and data handling
- •Human-in-the-loop for critical decisions
- •Audit logs and compliance
Problem-to-Solution Workflow
When a user presents a problem:
Step 1: Clarify the problem
- •What's the current manual process?
- •What's the desired outcome?
- •What are the constraints (time, cost, technical)?
- •What data sources are available?
Step 2: Assess agent appropriateness Not every problem needs an agent. Consider:
- •Is the task repetitive and rule-based?
- •Does it require decision-making with context?
- •Would automation provide significant value?
- •Is the problem scope clear and bounded?
Step 3: Map to architecture Using the 8 layers, identify which need focus:
- •Simple task → Focus on Layers 4, 5, 7 (tools, reasoning, application)
- •Complex integration → Add Layer 3 (protocol) emphasis
- •Scalability concern → Prioritize Layers 1, 8 (infrastructure, ops)
Step 4: Design workflow Apply the 9-step building process, calling out:
- •Critical decision points
- •Tool integration requirements
- •Multi-agent needs (if any)
- •Memory and context strategy
Step 5: Identify implementation path Based on user's role and resources:
- •For PMs: High-level architecture and tool selection
- •For engineers: Detailed technical implementation with code patterns
- •For product teams: Full stack from requirements to monitoring
Tool Integration Patterns
MCP (Model Context Protocol) Integration
When tools support MCP:
- •Agent discovers available tools
- •Agent calls tools using standardized interface
- •Tool returns structured response
- •Agent processes and continues workflow
Advantage: Write once, use with any agent.
Custom API Integration
When building custom integrations:
- •Define clear API contract (inputs/outputs)
- •Implement error handling and retries
- •Add rate limiting and caching
- •Monitor usage and costs
- •Document for agent consumption
Common Integration Scenarios
CRM Integration (Salesforce, HubSpot):
- •Read customer data
- •Create/update records
- •Search across objects
- •Trigger workflows
Communication Tools (Slack, Email):
- •Send messages/notifications
- •Read incoming requests
- •Monitor channels
- •Respond to mentions
Data Sources (Databases, APIs):
- •Query structured data
- •Retrieve documents
- •Search knowledge bases
- •Aggregate information
Decision Framework: Single vs. Multi-Agent
Use Single Agent When:
- •Task is linear and sequential
- •Context must be maintained throughout
- •Decision-making is unified
- •Complexity of orchestration > benefit
Use Multi-Agent When:
- •Clear task separation exists
- •Sub-tasks need different expertise
- •Parallel processing improves performance
- •Quality benefits from specialization
Example - Customer Support:
Single agent sufficient for: "Take customer complaint, pull order history, draft refund approval"
Multi-agent beneficial for: "Monitor social media, categorize issues, research solutions, generate responses, escalate critical cases, track resolution"
Common Pitfalls and Solutions
Pitfall 1: Scope Creep
Problem: Trying to build a general-purpose assistant Solution: Define narrow, specific job-to-be-done with clear success metrics
Pitfall 2: Tool Overload
Problem: Giving agent 50+ tools upfront Solution: Start with 5 essential tools, add incrementally based on actual needs
Pitfall 3: Skipping System Prompt
Problem: Generic or minimal instructions Solution: Invest 80% of time crafting detailed system prompt with examples and edge cases
Pitfall 4: No Error Handling
Problem: Agent breaks on unexpected inputs Solution: Design graceful degradation, clear error states, and fallback behaviors
Pitfall 5: Ignoring Costs
Problem: Runaway API costs from inefficient agent design Solution: Build cost monitoring from day one, implement caching, optimize prompt length
Pitfall 6: Over-Engineering Architecture
Problem: Building all 8 layers simultaneously Solution: Start with Layers 4, 5, 7 (tools, reasoning, application), add others as needed
Output Format
When providing agent workflow solutions, structure the response as:
- •Problem Restatement: Confirm understanding of the user's need
- •Agent Architecture Recommendation: Single vs. multi-agent, with rationale
- •Step-by-Step Workflow: Apply relevant steps from the 9-step process
- •Tool Integration Plan: Specific tools needed and integration approach
- •Layer Analysis: Which of the 8 layers need focus and why
- •Implementation Guidance: Prioritized next steps based on user's role
- •Success Metrics: How to measure if the agent is working
Agent Taxonomy Quick Reference
When users ask about existing tools:
Category 1: Consumer Agents (Built-In)
- •Examples: ChatGPT Agent, Claude, Gemini, Grok
- •Best for: Quick tasks, research, content creation
- •User type: Everyone, especially PMs
Category 2: No-Code Builders
- •Examples: Zapier Central, n8n, Make
- •Best for: Workflow automation without coding
- •User type: PMs, operations teams
Category 3: Developer-First Platforms
- •Examples: LangChain, CrewAI, AutoGen, Swarm
- •Best for: Custom agent features in products
- •User type: Engineering teams
Category 4: Specialized Agent Apps
- •Examples: Cursor (coding), Perplexity (research), Notion AI (writing)
- •Best for: Specific job-to-be-done with deep specialization
- •User type: Domain-specific professionals