avatar-video

Create AI avatar videos with precise control over avatars, voices, scripts, scenes, and backgrounds using HeyGen's v2 API. Use when: (1) Choosing a specific avatar and voice for a video, (2) Writing exact scripts for an avatar to speak, (3) Building multi-scene videos with different backgrounds per scene, (4) Creating transparent WebM videos for compositing, (5) Using talking photos as video presenters, (6) Integrating HeyGen avatars with Remotion, (7) Batch video generation with exact specs, (8) Brand-consistent production videos with precise control.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "avatar-video" with this command: npx skills add heygen-com/skills/heygen-com-skills-avatar-video

Avatar Video

Create AI avatar videos with full control over avatars, voices, scripts, scenes, and backgrounds. Build single or multi-scene videos with exact configuration using HeyGen's /v2/video/generate API.

Authentication

All requests require the X-Api-Key header. Set the HEYGEN_API_KEY environment variable.

curl -X GET "https://api.heygen.com/v2/avatars" \
  -H "X-Api-Key: $HEYGEN_API_KEY"

Tool Selection

If HeyGen MCP tools are available (mcp__heygen__*), prefer them over direct HTTP API calls — they handle authentication and request formatting automatically.

TaskMCP ToolFallback (Direct API)
Check video status / get URLmcp__heygen__get_videoGET /v2/videos/{video_id}
List account videosmcp__heygen__list_videosGET /v2/videos
Delete a videomcp__heygen__delete_videoDELETE /v2/videos/{video_id}

Video generation (POST /v2/video/generate) and avatar/voice listing are done via direct API calls — see reference files below.

Default Workflow

  1. List avatarsGET /v2/avatars → pick an avatar, preview it, note avatar_id and default_voice_id. See avatars.md
  2. List voices (if needed) — GET /v2/voices → pick a voice matching the avatar's gender/language. See voices.md
  3. Write the script — Structure scenes with one concept each. See scripts.md
  4. Generate the videoPOST /v2/video/generate with avatar, voice, script, and background per scene. See video-generation.md
  5. Poll for completionGET /v2/videos/{video_id} until status is completed. See video-status.md

Quick Reference

TaskRead
List and preview avatarsavatars.md
List and select voicesvoices.md
Write and structure scriptsscripts.md
Generate video (single or multi-scene)video-generation.md
Add custom backgroundsbackgrounds.md
Add captions / subtitlescaptions.md
Add text overlaystext-overlays.md
Create transparent WebM videovideo-generation.md (WebM section)
Use templatestemplates.md
Create avatar from photophoto-avatars.md
Check video status / downloadvideo-status.md
Upload assets (images, audio)assets.md
Use with Remotionremotion-integration.md
Set up webhookswebhooks.md

When to Use This Skill vs Create Video

This skill is for precise control — you choose the avatar, write the exact script, configure each scene.

If the user just wants to describe a video idea and let AI handle the rest (script, avatar, visuals), use the create-video skill instead.

User SaysCreate Video SkillThis Skill
"Make me a video about X"
"Create a product demo"
"I want avatar Y to say exactly Z"
"Multi-scene video with different backgrounds"
"Transparent WebM for compositing"
"Use this specific voice for my script"
"Batch generate videos with exact specs"

Reference Files

Core Video Creation

Video Customization

Advanced Features

Integration

Foundation

Best Practices

  1. Preview avatars before generating — Download preview_image_url so the user can see the avatar before committing
  2. Use avatar's default voice — Most avatars have a default_voice_id pre-matched for natural results
  3. Fallback: match gender manually — If no default voice, ensure avatar and voice genders match
  4. Use test mode for development — Set test: true to avoid consuming credits (output will be watermarked)
  5. Set generous timeouts — Video generation often takes 5-15 minutes, sometimes longer
  6. Validate inputs — Check avatar and voice IDs exist before generating

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

heygen

No summary provided by upstream source.

Repository SourceNeeds Review
General

text-to-speech

No summary provided by upstream source.

Repository SourceNeeds Review
General

video-translate

No summary provided by upstream source.

Repository SourceNeeds Review
General

create-video

No summary provided by upstream source.

Repository SourceNeeds Review