Screenshot Analyzer (Multi-Agent)
Extract product features from UI screenshots using a coordinated multi-agent analysis pipeline.
Core principle: Describe WHAT to build (features/interactions), NOT HOW (no tech stack).
Multi-Agent Architecture
This skill orchestrates 5 specialized agents for comprehensive analysis:
┌─────────────────┐
│ Coordinator │
│ (this skill) │
└────────┬────────┘
│
┌───────────────────┼───────────────────┐
│ │ │
▼ ▼ ▼
┌─────────────────┐ ┌─────────────────┐ ┌─────────────────┐ │ UI Analyzer │ │ Interaction │ │ Business │ │ (parallel) │ │ Analyzer │ │ Analyzer │ │ │ │ (parallel) │ │ (parallel) │ └────────┬────────┘ └────────┬────────┘ └────────┬────────┘ │ │ │ └───────────────────┼───────────────────┘ ▼ ┌─────────────────┐ │ Synthesizer │ │ (sequential) │ └────────┬────────┘ │ ▼ ┌─────────────────┐ │ Reviewer │ │ (sequential) │ └─────────────────┘
Process
Phase 1: Screenshot Collection
Gather all screenshots to analyze:
-
Read the screenshot file(s) provided by the user
-
For each screenshot, note the file path and any context provided
-
If multiple screenshots, determine if they are from the same product
Phase 2: Parallel Analysis
Launch THREE Task agents IN PARALLEL for each screenshot:
Agent 1: screenshot-ui-analyzer
Analyze this screenshot for UI components, layout structure, and design patterns. Screenshot: [file path] Return your analysis as JSON.
Agent 2: screenshot-interaction-analyzer
Analyze this screenshot for user interactions, navigation flows, and state transitions. Screenshot: [file path] Return your analysis as JSON.
Agent 3: screenshot-business-analyzer
Analyze this screenshot for business functions, data entities, and domain logic. Screenshot: [file path] Return your analysis as JSON.
IMPORTANT: Use the Task tool with THREE parallel calls in a single message to maximize efficiency.
Phase 3: Synthesis
After all parallel analyses complete, launch the synthesizer agent:
Agent 4: screenshot-synthesizer
Synthesize these analysis results into a unified development task list.
UI Analysis: [paste UI analyzer result]
Interaction Analysis: [paste Interaction analyzer result]
Business Analysis: [paste Business analyzer result]
Product Name: [product name] Output file: docs/plans/YYYY-MM-DD-<product>-features.md
Phase 4: Review
Launch the reviewer agent to validate the output:
Agent 5: screenshot-reviewer
Review this task list for completeness and quality.
Original screenshot(s): [file paths] Task list: [synthesized output]
If issues found, provide corrections.
Phase 5: Output
-
Write final task list to docs/plans/YYYY-MM-DD-<product>-features.md
-
Use format from references/output-format.md
-
Present summary to user
Key Guidelines
-
Use - [ ] checkbox format for all tasks
-
Break features into small, executable subtasks
-
Focus on user interactions, not implementation details
-
For multiple screenshots: deduplicate features across all screens
-
For competitive analysis: highlight unique features and gaps
Benefits of Multi-Agent Approach
-
Thoroughness - Three specialized perspectives catch more details
-
Speed - Parallel analysis reduces total time
-
Quality - Synthesis + Review ensures coherent, complete output
-
Specialization - Each agent focuses on its domain expertise