Agent Development Workflow

Complete Standard Operating Procedure (SOP) for building production-ready goal-driven agents.

Overview

This workflow orchestrates specialized skills to take you from initial concept to production-ready agent:

Understand Concepts → /building-agents-core (optional)
Build Structure → /building-agents-construction
Optimize Design → /building-agents-patterns (optional)
Setup Credentials → /setup-credentials (if agent uses tools requiring API keys)
Test & Validate → /testing-agent

When to Use This Workflow

Use this meta-skill when:

Starting a new agent from scratch
Unclear which skill to use first
Need end-to-end guidance for agent development
Want consistent, repeatable agent builds

Skip this workflow if:

You only need to test an existing agent → use /testing-agent directly
You know exactly which phase you're in → use specific skill directly

Quick Decision Tree

"Need to understand agent concepts" → building-agents-core "Build a new agent" → building-agents-construction "Optimize my agent design" → building-agents-patterns "Set up API keys for my agent" → setup-credentials "Test my agent" → testing-agent "Not sure what I need" → Read phases below, then decide "Agent has structure but needs implementation" → See agent directory STATUS.md

Phase 0: Understand Concepts (Optional)

Duration: 5-10 minutes Skill: /building-agents-core

Input: Questions about agent architecture

When to Use

First time building an agent
Need to understand node types, edges, goals
Want to validate tool availability
Learning about pause/resume architecture

What This Phase Provides

Architecture overview (Python packages, not JSON)
Core concepts (Goal, Node, Edge, Pause/Resume)
Tool discovery and validation procedures
Workflow overview

Skip this phase if you already understand agent fundamentals.

Phase 1: Build Agent Structure

Duration: 15-30 minutes Skill: /building-agents-construction

Input: User requirements ("Build an agent that...")

What This Phase Does

Creates the complete agent architecture:

Package structure (exports/agent_name/ )
Goal with success criteria and constraints
Workflow graph (nodes and edges)
Node specifications
CLI interface
Documentation

Process

Create package - Directory structure with skeleton files
Define goal - Success criteria and constraints written to agent.py
Design nodes - Each node approved and written incrementally
Connect edges - Workflow graph with conditional routing
Finalize - Agent class, exports, and documentation

Outputs

✅ exports/agent_name/ package created
✅ Goal defined in agent.py
✅ 3-5 success criteria defined
✅ 1-5 constraints defined
✅ 5-10 nodes specified in nodes/init.py
✅ 8-15 edges connecting workflow
✅ Validated structure (passes python -m agent_name validate )
✅ README.md with usage instructions
✅ CLI commands (info, validate, run, shell)

Success Criteria

You're ready for Phase 2 when:

Agent structure validates without errors
All nodes and edges are defined
CLI commands work (info, validate)
You see: "Agent complete: exports/agent_name/"

Common Outputs

The building-agents-construction skill produces:

exports/agent_name/ ├── init.py (package exports) ├── main.py (CLI interface) ├── agent.py (goal, graph, agent class) ├── nodes/init.py (node specifications) ├── config.py (configuration) ├── implementations.py (may be created for Python functions) └── README.md (documentation)

Next Steps

If structure complete and validated: → Check exports/agent_name/STATUS.md or IMPLEMENTATION_GUIDE.md

→ These files explain implementation options → You may need to add Python functions or MCP tools (not covered by current skills)

If want to optimize design: → Proceed to Phase 1.5 (building-agents-patterns)

If ready to test: → Proceed to Phase 2

Phase 1.5: Optimize Design (Optional)

Duration: 10-15 minutes Skill: /building-agents-patterns

Input: Completed agent structure

When to Use

Want to add pause/resume functionality
Need error handling patterns
Want to optimize performance
Need examples of complex routing
Want best practices guidance

What This Phase Provides

Practical examples and patterns
Pause/resume architecture
Error handling strategies
Anti-patterns to avoid
Performance optimization techniques

Skip this phase if your agent design is straightforward.

Phase 2: Test & Validate

Duration: 20-40 minutes Skill: /testing-agent

Input: Working agent from Phase 1

What This Phase Does

Creates comprehensive test suite:

Constraint tests (verify hard requirements)
Success criteria tests (measure goal achievement)
Edge case tests (handle failures gracefully)
Integration tests (end-to-end workflows)

Process

Analyze agent - Read goal, constraints, success criteria
Generate tests - Create pytest files in exports/agent_name/tests/
User approval - Review and approve each test
Run evaluation - Execute tests and collect results
Debug failures - Identify and fix issues
Iterate - Repeat until all tests pass

Outputs

✅ Test files in exports/agent_name/tests/
✅ Test report with pass/fail metrics
✅ Coverage of all success criteria
✅ Coverage of all constraints
✅ Edge case handling verified

Success Criteria

You're done when:

All tests pass
All success criteria validated
All constraints verified
Agent handles edge cases
Test coverage is comprehensive

Next Steps

Agent ready for:

Production deployment
Integration into larger systems
Documentation and handoff
Continuous monitoring

Phase Transitions

From Phase 1 to Phase 2

Trigger signals:

"Agent complete: exports/..."
Structure validation passes
README indicates implementation complete

Before proceeding:

Verify agent can be imported: from exports.agent_name import default_agent
Check if implementation is needed (see STATUS.md or IMPLEMENTATION_GUIDE.md)
Confirm agent executes without import errors

Skipping Phases

When to skip Phase 1:

Agent structure already exists
Only need to add tests
Modifying existing agent

When to skip Phase 2:

Prototyping or exploring
Agent not production-bound
Manual testing sufficient

Common Patterns

Pattern 1: Complete New Build (Simple)

User: "Build an agent that monitors files" → Use /building-agents-construction → Agent structure created → Use /testing-agent → Tests created and passing → Done: Production-ready agent

Pattern 1b: Complete New Build (With Learning)

User: "Build an agent (first time)" → Use /building-agents-core (understand concepts) → Use /building-agents-construction (build structure) → Use /building-agents-patterns (optimize design) → Use /testing-agent (validate) → Done: Production-ready agent

Pattern 2: Test Existing Agent

User: "Test my agent at exports/my_agent" → Skip Phase 1 → Use /testing-agent directly → Tests created → Done: Validated agent

Pattern 3: Iterative Development

User: "Build an agent" → Use /building-agents-construction (Phase 1) → Implementation needed (see STATUS.md) → [User implements functions] → Use /testing-agent (Phase 2) → Tests reveal bugs → [Fix bugs manually] → Re-run tests → Done: Working agent

Pattern 4: Complex Agent with Patterns

User: "Build an agent with multi-turn conversations" → Use /building-agents-core (learn pause/resume) → Use /building-agents-construction (build structure) → Use /building-agents-patterns (implement pause/resume pattern) → Use /testing-agent (validate conversation flows) → Done: Complex conversational agent

Skill Dependencies

agent-workflow (meta-skill) │ ├── building-agents-core (foundational) │ ├── Architecture concepts │ ├── Node/Edge/Goal definitions │ ├── Tool discovery procedures │ └── Workflow overview │ ├── building-agents-construction (procedural) │ ├── Creates package structure │ ├── Defines goal │ ├── Adds nodes incrementally │ ├── Connects edges │ ├── Finalizes agent class │ └── Requires: building-agents-core │ ├── building-agents-patterns (reference) │ ├── Best practices │ ├── Pause/resume patterns │ ├── Error handling │ ├── Anti-patterns │ └── Performance optimization │ └── testing-agent ├── Reads agent goal ├── Generates tests ├── Runs evaluation └── Reports results

Troubleshooting

"Agent structure won't validate"

Check node IDs match between nodes/init.py and agent.py
Verify all edges reference valid node IDs
Ensure entry_node exists in nodes list
Run: PYTHONPATH=core:exports python -m agent_name validate

"Agent has structure but won't run"

Check for STATUS.md or IMPLEMENTATION_GUIDE.md in agent directory
Implementation may be needed (Python functions or MCP tools)
This is expected - building-agents-construction creates structure, not implementation
See implementation guide for completion options

"Tests are failing"

Review test output for specific failures
Check agent goal and success criteria
Verify constraints are met
Use /testing-agent to debug and iterate
Fix agent code and re-run tests

"Not sure which phase I'm in"

Run these checks:

Check if agent structure exists

ls exports/my_agent/agent.py

Check if it validates

PYTHONPATH=core:exports python -m my_agent validate

Check if tests exist

ls exports/my_agent/tests/

If structure exists and validates → Phase 2 (testing)

If structure doesn't exist → Phase 1 (building)

If tests exist but failing → Debug phase

Best Practices

For Phase 1 (Building)

Start with clear requirements - Know what the agent should do
Define success criteria early - Measurable goals drive design
Keep nodes focused - One responsibility per node
Use descriptive names - Node IDs should explain purpose
Validate incrementally - Check structure after each major addition

For Phase 2 (Testing)

Test constraints first - Hard requirements must pass
Mock external dependencies - Use mock mode for LLMs/APIs
Cover edge cases - Test failures, not just success paths
Iterate quickly - Fix one test at a time
Document test patterns - Future tests follow same structure

General Workflow

Use version control - Git commit after each phase
Document decisions - Update README with changes
Keep iterations small - Build → Test → Fix → Repeat
Preserve working states - Tag successful iterations
Learn from failures - Failed tests reveal design issues

Exit Criteria

You're done with the workflow when:

✅ Agent structure validates ✅ All tests pass ✅ Success criteria met ✅ Constraints verified ✅ Documentation complete ✅ Agent ready for deployment

Additional Resources

building-agents-core: See .claude/skills/building-agents-core/SKILL.md
building-agents-construction: See .claude/skills/building-agents-construction/SKILL.md
building-agents-patterns: See .claude/skills/building-agents-patterns/SKILL.md
testing-agent: See .claude/skills/testing-agent/SKILL.md
Agent framework docs: See core/README.md
Example agents: See exports/ directory

Summary

This workflow provides a proven path from concept to production-ready agent:

Learn with /building-agents-core → Understand fundamentals (optional)
Build with /building-agents-construction → Get validated structure
Optimize with /building-agents-patterns → Apply best practices (optional)
Test with /testing-agent → Get verified functionality

The workflow is flexible - skip phases as needed, iterate freely, and adapt to your specific requirements. The goal is production-ready agents built with consistent, repeatable processes.

Skill Selection Guide

Choose building-agents-core when:

First time building agents
Need to understand architecture
Validating tool availability
Learning about node types and edges

Choose building-agents-construction when:

Actually building an agent
Have clear requirements
Ready to write code
Want step-by-step guidance

Choose building-agents-patterns when:

Agent structure complete
Need advanced patterns
Implementing pause/resume
Optimizing performance
Want best practices

Choose testing-agent when:

Agent structure complete
Ready to validate functionality
Need comprehensive test coverage
Debugging agent behavior

agent-workflow

Safety Notice

Copy this and send it to your AI assistant to learn

Check if agent structure exists

Check if it validates

Check if tests exist

If structure exists and validates → Phase 2 (testing)

If structure doesn't exist → Phase 1 (building)

If tests exist but failing → Debug phase

Source Transparency

Related Skills

hive-test

hive-debugger

hive

hive-patterns