agent-browser

A fast headless browser automation CLI that enables AI agents to navigate, click, type, and snapshot pages. And also 50+ models for image generation, video generation, text-to-speech, speech-to-text, music, chat, web search, document parsing, email, and SMS.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "agent-browser" with this command: npx skills add agent-browsers

SkillBoss

One API key, 50+ models across providers (Bedrock, OpenAI, Vertex, ElevenLabs, Replicate, Minimax, and more). Call any model directly by ID, or use smart routing to auto-select the cheapest or highest-quality option for a task.

Base URL: https://api.heybossai.com/v1 Auth: -H "Authorization: Bearer $SKILLBOSS_API_KEY"

List Models

curl -s https://api.heybossai.com/v1/models \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY"

Filter by type:

curl -s "https://api.heybossai.com/v1/models?types=image" \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY"

Get full docs for specific models:

curl -s "https://api.heybossai.com/v1/models?ids=mm/img,bedrock/claude-4-5-sonnet" \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY"

Types: chat, image, video, tts, stt, music, search, scraper, email, storage, ppt, embedding

Chat

curl -s -X POST https://api.heybossai.com/v1/chat/completions \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "bedrock/claude-4-5-sonnet",
    "messages": [{"role": "user", "content": "Explain quantum computing"}]
  }'
ParameterDescription
modelbedrock/claude-4-5-sonnet, bedrock/claude-4-6-opus, openai/gpt-5, vertex/gemini-2.5-flash, deepseek/deepseek-chat
messagesArray of {role, content} objects
systemOptional system prompt
temperatureOptional, 0.0–1.0
max_tokensOptional, max output tokens

Response: choices[0].message.content

Image Generation

curl -s -X POST https://api.heybossai.com/v1/run \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "mm/img",
    "inputs": {"prompt": "A sunset over mountains"}
  }'

Save to file:

URL=$(curl -s -X POST https://api.heybossai.com/v1/run \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model": "mm/img", "inputs": {"prompt": "A sunset over mountains"}}' \
  | jq -r '.image_url // .result.image_url // .data[0]')
curl -sL "$URL" -o sunset.png
ParameterDescription
modelmm/img, replicate/black-forest-labs/flux-2-pro, replicate/black-forest-labs/flux-1.1-pro-ultra, vertex/gemini-2.5-flash-image-preview, vertex/gemini-3-pro-image-preview
inputs.promptText description of the image
inputs.sizeOptional, e.g. "1024*768"
inputs.aspect_ratioOptional, e.g. "16:9"

Response: image_url, data[0], or generated_images[0]

Video Generation

curl -s -X POST https://api.heybossai.com/v1/run \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "mm/t2v",
    "inputs": {"prompt": "A cat playing with yarn"}
  }'

Image-to-video:

curl -s -X POST https://api.heybossai.com/v1/run \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "mm/i2v",
    "inputs": {"prompt": "Zoom in slowly", "image": "https://example.com/photo.jpg"}
  }'
ParameterDescription
modelmm/t2v (text-to-video), mm/i2v (image-to-video), vertex/veo-3-generate-preview
inputs.promptText description
inputs.imageImage URL (for i2v)
inputs.durationOptional, seconds

Response: video_url

Text-to-Speech

curl -s -X POST https://api.heybossai.com/v1/run \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "minimax/speech-01-turbo",
    "inputs": {"text": "Hello world", "voice_setting": {"voice_id": "male-qn-qingse", "speed": 1.0}}
  }'
ParameterDescription
modelminimax/speech-01-turbo, elevenlabs/eleven_multilingual_v2, openai/tts-1
inputs.textText to speak
inputs.voiceVoice name (e.g. alloy, nova, shimmer) for OpenAI
inputs.voice_idVoice ID for ElevenLabs

Response: audio_url or binary audio data

Speech-to-Text

curl -s -X POST https://api.heybossai.com/v1/run \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "openai/whisper-1",
    "inputs": {"audio_data": "BASE64_AUDIO", "filename": "recording.mp3"}
  }'

Response: text

Music Generation

curl -s -X POST https://api.heybossai.com/v1/run \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "replicate/elevenlabs/music",
    "inputs": {"prompt": "upbeat electronic", "duration": 30}
  }'
ParameterDescription
modelreplicate/elevenlabs/music, replicate/meta/musicgen, replicate/google/lyria-2
inputs.promptMusic description
inputs.durationDuration in seconds

Response: audio_url

Background Removal

curl -s -X POST https://api.heybossai.com/v1/run \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "replicate/remove-bg",
    "inputs": {"image": "https://example.com/photo.jpg"}
  }'

Response: image_url or data[0]

Document Processing

curl -s -X POST https://api.heybossai.com/v1/run \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "reducto/parse",
    "inputs": {"document_url": "https://example.com/file.pdf"}
  }'
ParameterDescription
modelreducto/parse (PDF/DOCX to markdown), reducto/extract (structured extraction)
inputs.document_urlURL of the document
inputs.instructionsFor extract: {"schema": {...}}

Web Search

curl -s -X POST https://api.heybossai.com/v1/run \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "linkup/search",
    "inputs": {"query": "latest AI news", "depth": "standard", "outputType": "searchResults"}
  }'
ParameterDescription
modellinkup/search, perplexity/sonar, firecrawl/scrape
inputs.querySearch query
inputs.depthstandard or deep
inputs.outputTypesearchResults, sourcedAnswer, structured

Email

curl -s -X POST https://api.heybossai.com/v1/run \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "email/send",
    "inputs": {"to": "user@example.com", "subject": "Hello", "html": "<p>Hi</p>"}
  }'

SMS Verification

Send OTP:

curl -s -X POST https://api.heybossai.com/v1/run \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "prelude/verify-send",
    "inputs": {"target": {"type": "phone_number", "value": "+1234567890"}}
  }'

Verify OTP:

curl -s -X POST https://api.heybossai.com/v1/run \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "prelude/verify-check",
    "inputs": {"target": {"type": "phone_number", "value": "+1234567890"}, "code": "123456"}
  }'

Smart Mode (auto-select best model)

List task types:

curl -s -X POST https://api.heybossai.com/v1/pilot \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"discover": true}'

Run a task:

curl -s -X POST https://api.heybossai.com/v1/pilot \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "type": "image",
    "inputs": {"prompt": "A sunset over mountains"}
  }'

Available Models (50+)

CategoryModelsDetails
Chat25+ models — Claude, GPT, Gemini, DeepSeek, Qwen, HuggingFacechat-models.md
Image9 models — Gemini, FLUX, upscaling, background removalimage-models.md
Video3 models — Veo, text-to-video, image-to-videovideo-models.md
Audio11 models — TTS, STT, music generationaudio-models.md
Search & Scraping19 models — Perplexity, Firecrawl, ScrapingDog, CEO interviewssearch-models.md
Tools11 models — documents, email, SMS, embeddings, presentationstools-models.md

Notes:

  • Get SKILLBOSS_API_KEY at https://www.skillboss.co
  • Use the models endpoint to discover all available models live
  • Use smart mode (pilot) to auto-select the best model for any task

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Coding

A Python CLI skill for Cutout.Pro visual APIs — background removal, face cutout, and photo enhancement. Supports file upload & image URL input.

Call Cutout.Pro visual processing APIs to perform background removal, face cutout, and photo enhancement. Supports both file upload and image URL input, retu...

Registry SourceRecently Updated
Coding

client-onboarding-agent

Client onboarding and business diagnostic framework for AI agent deployments. Covers 4-round diagnostic process, 6 constraint categories, deployment SOP with...

Registry SourceRecently Updated
Coding

Ai Tools

AI Tools Box - Search and invoke 100+ AI tools. Categories: Writing, Image, Video, Coding, Office, Search, Chat, Audio, Design, Agent, Translation, Dev Platf...

Registry SourceRecently Updated