volc-digital-human

火山引擎数字人视频生成技能。当用户发送照片并提供对白或配音文案,要求生成数字人口播视频时触发。全自动完成:图片上传、形象创建、TTS配音(自动性别检测、多音色匹配)、视频合成、最后发回给用户。触发词包括数字人、视频合成、口播视频、数字人视频。

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "volc-digital-human" with this command: npx skills add xiaoxiaole2025/volc-digital-human

Volcengine Digital Human Video Generator

⚠️ First-Time Setup Required

This skill requires Volcengine Access Key (AK) and Secret Key (SK).

Get AK/SK

  1. Register Volcengine account: https://console.volcengine.com/
  2. Enable "Digital Human Video Generation" service
  3. Create Access Key: https://console.volcengine.com/iam/keymanage/

Configuration (choose one)

Option 1: Config file (recommended)

Create config.json (in the skill directory):

{
  "ak": "your_access_key_here",
  "sk": "your_secret_key_here"
}

Option 2: Environment variables

export VOLC_AK="your_access_key_here"
export VOLC_SK="your_secret_key_here"

⚠️ Security: Never hardcode AK/SK in scripts or commit to public repos!


⚠️ Privacy Notice

This skill uploads images and generated audio/video to third-party file hosts (catbox.moe, 0x0.st) to create publicly accessible URLs required by the Volcengine API.

  • Do not use with sensitive/private images you don't want uploaded to public hosts
  • User images and generated content will be publicly accessible during the video generation process
  • Download and use videos promptly; URLs may expire

Core Flow

  1. Get image: Fetch from /root/.openclaw/media/inbound/
  2. Gender detection: OpenCV Haar cascade for eyes/nose features
  3. Upload image: Upload to catbox.moe for public URL
  4. Create avatar: Call realman_avatar_picture_create_role API
  5. TTS audio: Auto-match voice by gender + edge-tts → upload to catbox.moe
  6. Video synthesis: Call realman_avatar_picture_v2 API, poll for result
  7. Download video: Save locally, generate thumbnail preview
  8. Deliver: Send thumbnail + video via message tool

Quick Run

cd /root/.openclaw/workspace-employee-xiaozhua
python3 skills/volc-digital-human/scripts/volc_digital_human.py "$image_path" "$dialog_text" [gender]

Parameters:

  • image_path: Image path, None=auto-fetch latest image
  • dialog_text: Script/dialog content
  • gender: Optional, male|female|None (auto-detect)

Voice Matching Rules

Detected GenderHuman VoiceCartoon Voice
femalezh-CN-XiaoxiaoNeural (natural female)zh-CN-XiaoyiNeural (lively female)
malezh-CN-YunxiNeural (sunny male)zh-CN-YunxiaNeural (cute male)
unknownzh-CN-XiaoxiaoNeural (default female)zh-CN-XiaoyiNeural

Manual override:

  • Say "male"/"男生"/"男的" → force male voice
  • Say "female"/"女生"/"女的" → force female voice
  • Say "cartoon"/"卡通角色"/"动物" → use cartoon voice

Detailed API Reference

See references/volc_api.md

Key Parameters

ParameterDescription
image_urlPublic URL (required), uploaded to file host
audio_urlPublic URL for audio MP3 (required)
resource_idAvatar ID returned after creation, can be reused
req_keycreate=realman_avatar_picture_create_role, synthesize=realman_avatar_picture_v2

Notes

  • Image tips: Closed-mouth photos work better; WeChat thumbnails also work
  • Gender detection: Heuristic based on Haar eye/nose features, not 100% accurate; confirm with user if needed
  • Cartoon/animal: Use lively female voice zh-CN-XiaoyiNeural as default
  • Video URL expiry: ~1 hour, download promptly
  • Generation time: Usually 30 sec ~ 3 min
  • Rate limit: Volcengine has request frequency limits; wait 1-5 min if 50430 error
  • TTS: edge-tts (Microsoft free), no API key needed

Error Handling

Error CodeMeaningSolution
50430Rate limitWait 1-5 min, retry
50207Image decode errorUse jpg/png format
401AK/SK errorCheck credentials

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Huo15 Openclaw Enhance

火一五·克劳德·龙虾增强插件 v5.7.8 — 全面适配 openclaw 2026.4.24:peerDep ^4.24 + build/compat 同步到 4.24 + 14 处 api.on 全部去掉 as any 改成 typed hook(hookName 联合类型 + handler 自动推断 Pl...

Registry SourceRecently Updated
General

Content Trend Analyzer

Aggregates and analyzes content trends across platforms to identify hot topics, user intent, content gaps, and generates data-driven article outlines.

Registry SourceRecently Updated
General

Prompt Debugger

Debug prompts that produce unexpected AI outputs — diagnose failure modes, identify ambiguity and conflicting instructions, test variations, compare model re...

Registry SourceRecently Updated
General

Indie Maker News

独行者 Daily - 变现雷达。读对一条新闻,少走一年弯路。每天5分钟,给创业者装上商业雷达。聚焦一人公司、副业、创业变现资讯,智能分类,行动导向。用户下载即能用,无需本地部署!

Registry SourceRecently Updated