agent-browser

Browser automation CLI for AI agents. Use this skill to navigate websites, interact with elements, take screenshots, and extract data using a headless browser.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "agent-browser" with this command: npx skills add bowtiedswan/agent-browser-skill/bowtiedswan-agent-browser-skill-agent-browser

Agent Browser Skill

This skill provides access to the agent-browser CLI, a powerful tool for headless browser automation designed for AI agents.

🚀 Usage

The core workflow relies on snapshots and refs. Instead of guessing CSS selectors, you get a snapshot of the page with unique references (like @e1, @e2) for every interactive element.

Basic Workflow

  1. Navigate: agent-browser open <url>
  2. Analyze: agent-browser snapshot -i (Get interactive elements with refs)
  3. Interact: agent-browser click @e1 or agent-browser fill @e2 "text"
  4. Repeat: Take a new snapshot after interactions to see the updated state.

Common Commands

  • Open URL: agent-browser open google.com
  • Get Snapshot: agent-browser snapshot -i (Interactive only, recommended)
  • Click: agent-browser click @e1
  • Type/Fill: agent-browser fill @e2 "search term"
  • Press Key: agent-browser press Enter
  • Go Back: agent-browser back
  • Screenshot: agent-browser screenshot page.png
  • Read Text: agent-browser get text @e1

Advanced

  • Sessions: agent-browser --session my-session open ... (Keep cookies/state separate)
  • Wait: agent-browser wait --text "Success"
  • Help: agent-browser --help

💡 Tips for Claude

  • Always snapshot first: Before clicking or typing, get a fresh snapshot to ensure you have valid refs.
  • Use -i flag: agent-browser snapshot -i filters for interactive elements, reducing noise.
  • Check output: The CLI returns JSON or text. Read it to confirm actions succeeded.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Coding

agent-browser

Browser automation CLI for AI agents. Use when the user needs to interact with websites, including navigating pages, filling forms, clicking buttons, taking screenshots, extracting data, testing web apps, or automating any browser task. Triggers include requests to "open a website", "fill out a form", "click a button", "take a screenshot", "scrape data from a page", "test this web app", "login to a site", "automate browser actions", or any task requiring programmatic web interaction.

Repository SourceNeeds Review
22.4K101.3K
vercel-labs
Coding

agent browser

No summary provided by upstream source.

Repository SourceNeeds Review
-1.2K
am-will
Coding

agent browser

No summary provided by upstream source.

Repository SourceNeeds Review