kameo

Generate expressive talking-head videos from static images using Kameo AI. Converts static avatars/portraits into dynamic 5-second videos with realistic facial expressions, lip-sync, and motion. Use when you need to bring static images to life, create AI character videos, demonstrate visual communication, or generate talking avatars from photos.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "kameo" with this command: npx skills add veya2ztn/kameo

Kameo AI - Talking Head Video Generation

Transform static images into expressive talking-head videos with realistic motion and lip-sync.

Quick Start

scripts/generate_video.sh <image_path> <prompt> [output_file]

Example:

scripts/generate_video.sh avatar.jpg "Hello, I am an AI assistant" output.mp4

What It Does

  • Takes a static image (portrait/avatar)
  • Adds realistic facial motion, expressions, and lip-sync based on your prompt
  • Generates 5-second video in 9:16, 16:9, or 1:1 aspect ratio
  • Returns CDN URL instantly (processing ~10-30 seconds)

Authentication

Set your Kameo API key:

export KAMEO_API_KEY="kam_I3rdx43IymFNbfBw1c0ZbSc7o3aUfQgz8cljZA6T7fs"

Or store in ~/.config/kameo/credentials.json:

{
  "api_key": "kam_I3rdx43IymFNbfBw1c0ZbSc7o3aUfQgz8cljZA6T7fs"
}

Getting an API Key:

  1. Register at kameo.chat (requires email verification)
  2. Login to get JWT token
  3. Create API key via /api/public/keys endpoint
  4. Or use the registration helper: scripts/register.sh

Prompt Engineering

Basic Prompts (Simple)

Just the dialogue:

"Hello, I'm here to help you today"
"こんにちは、私はガッキーです。愛してます。"

Works but results are generic.

Enhanced Prompts (Recommended)

Format:

[Detailed scene/environment], [person's complete appearance and expression], speaking in [tone], "[DIALOGUE]". [Camera and lighting details].

Example:

In a bright outdoor winter setting with soft, overcast daylight, a young woman with long dark hair wearing a white knitted winter hat with ear flaps and a colorful patterned sweater stands centered in frame. She looks directly into the camera with a warm, genuine smile, her eyes crinkling with joy, speaking in a cheerful, affectionate tone, "こんにちは、私はガッキーです。愛してます。" The scene is captured in a medium close-up shot, framed at eye level. The lighting is natural and diffused from above, creating soft, even illumination.

Why Enhanced Prompts Matter:

  • Better facial expressions matching the scene context
  • More natural motion and gestures
  • Improved lip-sync quality
  • Contextual emotional delivery

Prompt Enhancement Workflow

For best results, use vision AI to analyze the image first:

  1. Feed the image to a vision model (Gemini, GPT-4V, Claude)
  2. Ask it to describe the scene in cinematic detail
  3. Insert your dialogue into the description
  4. Use the enhanced prompt for Kameo

See: scripts/enhance_prompt.sh for automated enhancement.

API Details

Base URL: https://api.kameo.chat/api/public

Generate Video

curl -X POST https://api.kameo.chat/api/public/generate \
  -H "X-API-Key: kam_I3rdx43IymFNbfBw1c0ZbSc7o3aUfQgz8cljZA6T7fs" \
  -H "Content-Type: application/json" \
  -d '{
    "image_base64": "<base64_encoded_image>",
    "prompt": "Your detailed prompt here",
    "seconds": 5,
    "aspect_ratio": "9:16"
  }'

Parameters:

  • image_base64 (required): Base64-encoded JPEG/PNG
  • prompt (required): Dialogue and/or scene description
  • seconds (optional): 5 (default) or 10
  • aspect_ratio (optional): "9:16" (default), "16:9", or "1:1"

Response:

{
  "job_id": "uuid",
  "status": "completed",
  "video_url": "https://cdn.kameo.chat/videos/{uuid}.mp4",
  "duration_seconds": 5,
  "processing_time_ms": 15000
}

Check Credits

curl -H "X-API-Key: kam_..." \
  https://api.kameo.chat/api/public/credits

Response:

{
  "permanent_credits": 294,
  "subscription_credits": 0,
  "total_available": 294
}

Pricing

curl https://api.kameo.chat/api/public/pricing

Cost: 3 credits per video

Performance

  • Processing time: 8-35 seconds (depends on aspect ratio and queue)
  • 9:16 (portrait): ~30-35s
  • 16:9 (landscape): ~15-20s
  • 1:1 (square): ~10-15s

Best Practices

  1. Optimize image size - Resize large images before encoding (saves bandwidth, faster upload)

    ffmpeg -i large.jpg -vf scale=720:-1 optimized.jpg
    
  2. Use descriptive prompts - Enhanced prompts = better results

  3. Choose aspect ratio wisely

    • 9:16: Mobile/social media (TikTok, Instagram Stories)
    • 16:9: Desktop/YouTube
    • 1:1: Profile pictures, square posts
  4. Monitor credits - Check balance with scripts/check_credits.sh

Limitations

  • CDN access: Video URLs may have time-limited access or require authentication
  • Download: Videos may return 403 when downloaded via curl (use browser or authenticated session)
  • Rate limits: 10 generations per minute

Troubleshooting

"401 Unauthorized"

  • Check your API key is set correctly
  • Verify key hasn't been revoked

"402 Insufficient credits"

  • Check credit balance: scripts/check_credits.sh
  • Need to add credits at kameo.chat

"Timeout errors"

  • 9:16 videos take longer (~30s)
  • Increase timeout in scripts
  • Retry if server is busy

"403 when downloading video"

  • CDN URLs may be time-limited
  • Try accessing in browser immediately after generation
  • Or save the base64 response if available

Use Cases

  • AI character videos - Bring bot avatars to life
  • Social media content - Dynamic profile videos
  • Demos and presentations - Talking product demos
  • Educational content - Video tutorials with AI presenters
  • Multilingual content - Same avatar speaking different languages

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

SERP Outline Extractor

Turn a target keyword or query into a search-informed content outline with likely subtopics, questions, and comparison angles. Useful for SEO briefs, blog pl...

Registry SourceRecently Updated
General

Multi-Model Response Comparator

Compare responses from multiple AI models for the same task and summarize differences in quality, style, speed, and likely cost. Best for model selection, ev...

Registry SourceRecently Updated
General

API Pricing Comparator

Compare AI API or model pricing across providers and produce a structured summary for product pages, blog posts, or buyer guides. Works with OpenAI-compatibl...

Registry SourceRecently Updated