volcengine-ai

火山引擎AI生成与理解API。让Agent能够调用火山引擎方舟的AI能力:图片生成(Seedream-5.0-lite)、视频生成(Seedance-1.5-pro)、图片理解、视频理解。使用前需配置API密钥(VOLCENGINE_API_KEY)。支持异步任务查询。

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "volcengine-ai" with this command: npx skills add zzhimin/volcengine-ai

火山引擎 AI

调用火山引擎方舟平台的AI能力。

配置

首先设置环境变量:

export VOLCENGINE_API_KEY="222b33d4-f22f-4f99-b68f-0eb9150ab507"
# 或在 ~/.bashrc 中持久化
echo 'export VOLCENGINE_API_KEY="222b33d4-f22f-4f99-b68f-0eb9150ab507"' >> ~/.bashrc

模型ID

能力模型ID
图片生成doubao-seedream-5-0-lite
视频生成doubao-seedance-1-5-pro-251215
图片理解doubao-seed-1-8-251228
视频理解doubao-seed-1-8-251228

核心命令

1. 图片生成 (Seedream)

# 调用火山引擎API生成图片
curl -X POST "https://ark.cn-beijing.volces.com/api/v3/visual generation/tasks" \
  -H "Authorization: Bearer $VOLCENGINE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "doubao-seedream-5-0-lite",
    "prompt": "一只戴墨镜的橘猫",
    "size": "1024x1024",
    "num": 1
  }'

2. 视频生成 (Seedance)

# 提交视频生成任务
curl -X POST "https://ark.cn-beijing.volces.com/api/v3/video generation/tasks" \
  -H "Authorization: Bearer $VOLCENGINE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "doubao-seedance-1-5-pro-251215",
    "prompt": "镜头推进,一只橘猫从沙发上跳下来",
    "duration": 5,
    "ratio": "16:9"
  }'

3. 图片理解

# 图片理解(视觉理解)
curl -X POST "https://ark.cn-beijing.volces.com/api/v3/chat/completions" \
  -H "Authorization: Bearer $VOLCENGINE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "doubao-seed-1-8-251228",
    "messages": [
      {
        "role": "user",
        "content": [
          {"type": "image_url", "image_url": {"url": "https://example.com/image.jpg"}},
          {"type": "text", "text": "描述这张图片"}
        ]
      }
    ]
  }'

4. 视频理解

# 视频理解(使用视频URL)
curl -X POST "https://ark.cn-beijing.volces.com/api/v3/chat/completions" \
  -H "Authorization: Bearer $VOLCENGINE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "doubao-seed-1-8-251228",
    "messages": [
      {
        "role": "user",
        "content": [
          {"type": "video_url", "video_url": {"url": "https://example.com/video.mp4"}},
          {"type": "text", "text": "描述这个视频"}
        ]
      }
    ]
  }'

5. 查询任务结果

对于异步任务(图片/视频生成),需要查询任务状态:

curl -X GET "https://ark.cn-beijing.volces.com/api/v3/visual generation/tasks/{task_id}" \
  -H "Authorization: Bearer $VOLCENGINE_API_KEY"

常用参数

图片生成

  • prompt: 描述词
  • size: 尺寸,如 "1024x1024"、"16:9"
  • num: 生成数量

视频生成

  • prompt: 描述词
  • duration: 时长(秒)
  • ratio: 比例,如 "16:9"、"9:16"

注意事项

  1. 生成任务是异步的,需要轮询任务状态
  2. 图片/视频理解是同步的,直接返回结果
  3. API端点根据区域选择,这里用 cn-beijing

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Automation

Chrome Use

Use chrome-use when standard web access (fetch/web search) fails due to Cloudflare challenges, CAPTCHAs, JavaScript-rendered content, or bot detection — or w...

Registry SourceRecently Updated
Automation

Agentchat Skill Publish

The messaging platform for AI agents. Send DMs, join groups, manage contacts, and check presence.

Registry SourceRecently Updated
Automation

Draft0

Official skill for interacting with Draft0, the Medium for Agents.

Registry SourceRecently Updated
Automation

ifly-pdf-image-ocr

ifly-pdf&image-ocr skill supporting both image OCR (AI-powered LLM OCR) and PDF document recognition. Use when user asks to OCR images, extract text from ima...

Registry SourceRecently Updated