Agent Browser Skill

Two browser automation options are available, sharing the same Chrome for Testing v148 binary:

OpenClaw browser tool — built-in plugin, managed by gateway, auto-started
agent-browser CLI (v0.26.0) — standalone, extra features (batch, auth vault, visual diff, chat)

Rule of thumb: Public websites → OpenClaw browser tool. Logged-in sites → Tandem Browser profile="user". Complex automation → agent-browser CLI via exec.

Quick Reference

Status

browser doctor           # Check health
browser status           # Running state
browser start            # Launch (auto-start if not running)
browser stop             # Shutdown

Navigate & Read

browser navigate url="https://example.com"
browser snapshot              # Accessibility tree with @ref IDs
browser snapshot refs="aria"  # Playwright aria refs (more stable)
browser screenshot            # Visual capture

Act (use snapshot refs)

browser act request={kind:"click", ref:"@e1"}
browser act request={kind:"fill", ref:"@e3", text:"value"}
browser act request={kind:"type", ref:"@e3", text:"value slowly"}
browser act request={kind:"press", key:"Enter"}
browser act request={kind:"select", ref:"@e2", values:["option1"]}
browser act request={kind:"hover", ref:"@e5"}
browser act request={kind:"wait", timeMs:2000}

Tab Management

browser open url="...", label="my-tab"   # Open with named label
browser focus targetId="my-tab"           # Switch to tab
browser close targetId="t1"               # Close tab
browser tabs                              # List all open tabs

Agent-Browser CLI (via exec)

agent-browser open https://...
agent-browser snapshot -i -c
agent-browser click @e2
agent-browser fill @e3 "text"
agent-browser screenshot --annotate
agent-browser keyboard type "text"
agent-browser auth save <name> --url <url> --username <u> --password <p>
agent-browser diff snapshot
agent-browser batch ["open url" "snapshot -i" "click @e1"]

Operating Loop

Check status — browser status or browser doctor if health unknown
Navigate — browser navigate url="<target>"
Read — browser snapshot to get interactive elements with @ref IDs
Act — Use refs from snapshot for click/fill/press/select
Re-snapshot — After any navigation or UI change, snapshot again (refs become stale)
Recover — If ref fails, snapshot once more; if still broken, report as manual action
Profile pick — Omit profile for public sites; use profile="user" for logged-in Tandem sessions

Tips

Use refs="aria" for more stable element references that survive page re-renders
Label task tabs at open time: label="research"
agent-browser chat "..." for AI-driven natural language browser tasks
agent-browser --headed to show browser window for debugging
agent-browser --profile Default to reuse Chrome login state

agent-browser

Safety Notice

Copy this and send it to your AI assistant to learn

Agent Browser Skill

Quick Reference

Status

Navigate & Read

Act (use snapshot refs)

Tab Management

Agent-Browser CLI (via exec)

Operating Loop

Tips

Source Transparency

Related Skills

Mini Coder Max

Nutrition MCP

image-deck: GPT Image 2 Slide Deck

Auth0 Flask