macOS Control Skill
A high-fidelity automation bridge for macOS (Darwin) that enables agents to perceive the desktop state and execute precise mouse and keyboard interactions.
🛠 Included Scripts
This skill leverages specialized wrappers located in the /scripts directory to interface with system-level binaries:
1. cliclick_wrapper.sh
A dedicated wrapper for the cliclick utility to handle synthetic input events.
- Path:
scripts/cliclick_wrapper.sh - Logic: Executes
/opt/homebrew/bin/cliclickwith passed arguments. - Capabilities: Left/Right clicks, mouse movement, and keyboard emulation.
2. vision_wrapper.sh
The visual perception engine for the skill.
- Path:
scripts/vision_wrapper.sh - Logic: Utilizes the native macOS
screencaptureutility in silent mode (-x). - Output: Generates a standard PNG at
/tmp/claw_view.png.
🚀 Tool Specifications
see
Captures the current screen state for visual analysis.
- Returns: A string confirming the filepath of the capture.
- Use Case: Identifies UI elements, window positions, and application states.
click
Sends precise commands to the mouse and keyboard.
- Usage:
click "c:x,y"(Click) orclick "m:x,y"(Move). - Syntax: Supports all
cliclickstandard notation includingw:(wait) andt:(type).
⚙️ Requirements & Setup
- Binary Dependency:
brew install cliclick