pdf

Comprehensive PDF manipulation toolkit for extracting text, creating, merging, splitting documents, and handling forms. And also 50+ models for image generation, video generation, text-to-speech, speech-to-text, music, chat, web search, document parsing, email, and SMS.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "pdf" with this command: npx skills add alvisdunlop/pdfs

SkillBoss

One API key, 50+ models across providers (Bedrock, OpenAI, Vertex, ElevenLabs, Replicate, Minimax, and more). Call any model directly by ID, or use smart routing to auto-select the cheapest or highest-quality option for a task.

Base URL: https://api.heybossai.com/v1 Auth: -H "Authorization: Bearer $SKILLBOSS_API_KEY"

List Models

curl -s https://api.heybossai.com/v1/models \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY"

Filter by type:

curl -s "https://api.heybossai.com/v1/models?types=image" \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY"

Get full docs for specific models:

curl -s "https://api.heybossai.com/v1/models?ids=mm/img,bedrock/claude-4-5-sonnet" \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY"

Types: chat, image, video, tts, stt, music, search, scraper, email, storage, ppt, embedding

Chat

curl -s -X POST https://api.heybossai.com/v1/chat/completions \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "bedrock/claude-4-5-sonnet",
    "messages": [{"role": "user", "content": "Explain quantum computing"}]
  }'
ParameterDescription
modelbedrock/claude-4-5-sonnet, bedrock/claude-4-6-opus, openai/gpt-5, vertex/gemini-2.5-flash, deepseek/deepseek-chat
messagesArray of {role, content} objects
systemOptional system prompt
temperatureOptional, 0.0–1.0
max_tokensOptional, max output tokens

Response: choices[0].message.content

Image Generation

curl -s -X POST https://api.heybossai.com/v1/run \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "mm/img",
    "inputs": {"prompt": "A sunset over mountains"}
  }'

Save to file:

URL=$(curl -s -X POST https://api.heybossai.com/v1/run \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model": "mm/img", "inputs": {"prompt": "A sunset over mountains"}}' \
  | jq -r '.image_url // .result.image_url // .data[0]')
curl -sL "$URL" -o sunset.png
ParameterDescription
modelmm/img, replicate/black-forest-labs/flux-2-pro, replicate/black-forest-labs/flux-1.1-pro-ultra, vertex/gemini-2.5-flash-image-preview, vertex/gemini-3-pro-image-preview
inputs.promptText description of the image
inputs.sizeOptional, e.g. "1024*768"
inputs.aspect_ratioOptional, e.g. "16:9"

Response: image_url, data[0], or generated_images[0]

Video Generation

curl -s -X POST https://api.heybossai.com/v1/run \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "mm/t2v",
    "inputs": {"prompt": "A cat playing with yarn"}
  }'

Image-to-video:

curl -s -X POST https://api.heybossai.com/v1/run \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "mm/i2v",
    "inputs": {"prompt": "Zoom in slowly", "image": "https://example.com/photo.jpg"}
  }'
ParameterDescription
modelmm/t2v (text-to-video), mm/i2v (image-to-video), vertex/veo-3-generate-preview
inputs.promptText description
inputs.imageImage URL (for i2v)
inputs.durationOptional, seconds

Response: video_url

Text-to-Speech

curl -s -X POST https://api.heybossai.com/v1/run \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "minimax/speech-01-turbo",
    "inputs": {"text": "Hello world", "voice_setting": {"voice_id": "male-qn-qingse", "speed": 1.0}}
  }'
ParameterDescription
modelminimax/speech-01-turbo, elevenlabs/eleven_multilingual_v2, openai/tts-1
inputs.textText to speak
inputs.voiceVoice name (e.g. alloy, nova, shimmer) for OpenAI
inputs.voice_idVoice ID for ElevenLabs

Response: audio_url or binary audio data

Speech-to-Text

curl -s -X POST https://api.heybossai.com/v1/run \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "openai/whisper-1",
    "inputs": {"audio_data": "BASE64_AUDIO", "filename": "recording.mp3"}
  }'

Response: text

Music Generation

curl -s -X POST https://api.heybossai.com/v1/run \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "replicate/elevenlabs/music",
    "inputs": {"prompt": "upbeat electronic", "duration": 30}
  }'
ParameterDescription
modelreplicate/elevenlabs/music, replicate/meta/musicgen, replicate/google/lyria-2
inputs.promptMusic description
inputs.durationDuration in seconds

Response: audio_url

Background Removal

curl -s -X POST https://api.heybossai.com/v1/run \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "replicate/remove-bg",
    "inputs": {"image": "https://example.com/photo.jpg"}
  }'

Response: image_url or data[0]

Document Processing

curl -s -X POST https://api.heybossai.com/v1/run \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "reducto/parse",
    "inputs": {"document_url": "https://example.com/file.pdf"}
  }'
ParameterDescription
modelreducto/parse (PDF/DOCX to markdown), reducto/extract (structured extraction)
inputs.document_urlURL of the document
inputs.instructionsFor extract: {"schema": {...}}

Web Search

curl -s -X POST https://api.heybossai.com/v1/run \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "linkup/search",
    "inputs": {"query": "latest AI news", "depth": "standard", "outputType": "searchResults"}
  }'
ParameterDescription
modellinkup/search, perplexity/sonar, firecrawl/scrape
inputs.querySearch query
inputs.depthstandard or deep
inputs.outputTypesearchResults, sourcedAnswer, structured

Email

curl -s -X POST https://api.heybossai.com/v1/run \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "email/send",
    "inputs": {"to": "user@example.com", "subject": "Hello", "html": "<p>Hi</p>"}
  }'

SMS Verification

Send OTP:

curl -s -X POST https://api.heybossai.com/v1/run \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "prelude/verify-send",
    "inputs": {"target": {"type": "phone_number", "value": "+1234567890"}}
  }'

Verify OTP:

curl -s -X POST https://api.heybossai.com/v1/run \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "prelude/verify-check",
    "inputs": {"target": {"type": "phone_number", "value": "+1234567890"}, "code": "123456"}
  }'

Smart Mode (auto-select best model)

List task types:

curl -s -X POST https://api.heybossai.com/v1/pilot \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"discover": true}'

Run a task:

curl -s -X POST https://api.heybossai.com/v1/pilot \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "type": "image",
    "inputs": {"prompt": "A sunset over mountains"}
  }'

Available Models (50+)

CategoryModelsDetails
Chat25+ models — Claude, GPT, Gemini, DeepSeek, Qwen, HuggingFacechat-models.md
Image9 models — Gemini, FLUX, upscaling, background removalimage-models.md
Video3 models — Veo, text-to-video, image-to-videovideo-models.md
Audio11 models — TTS, STT, music generationaudio-models.md
Search & Scraping19 models — Perplexity, Firecrawl, ScrapingDog, CEO interviewssearch-models.md
Tools11 models — documents, email, SMS, embeddings, presentationstools-models.md

Notes:

  • Get SKILLBOSS_API_KEY at https://www.skillboss.co
  • Use the models endpoint to discover all available models live
  • Use smart mode (pilot) to auto-select the best model for any task

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Trinity Lite

每日自动化能力进化工具。运行后会:1)自检发现能力缺陷,2)自动阅读学习,3)生成新洞察,4)验证能力提升。适合希望持续提升AI能力的用户,每天运行看到具体进步。

Registry SourceRecently Updated
General

CamScanner Erase Handwriting

Use CamScanner to erase handwriting from images while preserving the printed content and original layout. Powered by a high-precision image enhancement engin...

Registry SourceRecently Updated
General

Local Model Quantization Router

Recommend local LLM model routes and quantization levels using hardware, privacy, task complexity, context size, and budget constraints. Use for Qwen/Ollama/...

Registry SourceRecently Updated
General

Siluzan CSO

当用户提问的内容涉及以下内容时,可以使用本SKILL(1)多媒体平台内容(视频/图文)发布与运营(YouTube、TikTok、Instagram、LinkedIn、X、视频号),以及账号授权、数据报表、任务管理;(2)公众号、小红书等内容文案/选题生成——选题/拆解/口播成稿、三库选题;(3)RAG 知识库检索...

Registry SourceRecently Updated