p-video-avatar

Generate talking head avatar videos from a single portrait image via inference.sh CLI.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "p-video-avatar" with this command: npx skills add inference-sh/skills@infsh-cli

Pruna P-Video-Avatar

Generate talking head avatar videos from a single portrait image via inference.sh CLI.

P-Video-Avatar is the fastest and most cost-effective avatar video model available. Quality on par with Veo 3.0, 18x faster and 6x cheaper than alternatives like Fabric, OmniHuman, and HeyGen.

Quick Start

Requires inference.sh CLI (belt ). Install instructions

belt login

Generate avatar from portrait + text script

belt app run pruna/p-video-avatar --input '{ "image": "https://portrait.jpg", "voice_script": "Hello, welcome to our product demo!", "voice": "Zephyr (Female)" }'

Full Workflow: Generate Portrait + Avatar Video

Use Pruna P-Image to generate the portrait, then P-Video-Avatar to animate it:

1. Generate a portrait image with P-Image

belt app run pruna/p-image --input '{ "prompt": "professional headshot portrait of a young woman, neutral background, looking at camera, studio lighting, photorealistic", "aspect_ratio": "9:16" }'

2. Use the generated image URL to create the avatar video

belt app run pruna/p-video-avatar --input '{ "image": "<image-url-from-step-1>", "voice_script": "Hi there! Let me walk you through our latest features.", "voice": "Zephyr (Female)", "resolution": "720p" }'

Examples

Text Script with Voice Selection

belt app run pruna/p-video-avatar --input '{ "image": "https://portrait.jpg", "voice_script": "Welcome to our product walkthrough. Today I will show you three key features.", "voice": "Puck (Male)", "voice_language": "English (US)", "resolution": "720p" }'

Audio-Driven Avatar

Provide your own audio file instead of using built-in TTS:

belt app run pruna/p-video-avatar --input '{ "image": "https://portrait.jpg", "audio": "https://speech.mp3" }'

When both audio and voice_script are provided, audio takes priority.

1080p with Custom Style

belt app run pruna/p-video-avatar --input '{ "image": "https://portrait.jpg", "voice_script": "This is exciting news for our community!", "voice": "Aoede (Female)", "voice_prompt": "Enthusiastic and energetic tone, slightly faster pace", "video_prompt": "The person is presenting on stage with dramatic lighting", "resolution": "1080p" }'

Multilingual Content

Spanish

belt app run pruna/p-video-avatar --input '{ "image": "https://portrait.jpg", "voice_script": "Bienvenidos a nuestra demostración de producto.", "voice": "Kore (Female)", "voice_language": "Spanish" }'

Japanese

belt app run pruna/p-video-avatar --input '{ "image": "https://portrait.jpg", "voice_script": "こんにちは、製品デモへようこそ。", "voice": "Leda (Female)", "voice_language": "Japanese" }'

Reproducible Generation

belt app run pruna/p-video-avatar --input '{ "image": "https://portrait.jpg", "voice_script": "Consistent results every time.", "seed": 42 }'

Available Voices

Female: Zephyr, Kore, Leda, Aoede, Callirrhoe, Autonoe, Despina, Erinome, Laomedeia, Achernar, Gacrux, Pulcherrima, Vindemiatrix, Sulafat

Male: Puck, Charon, Fenrir, Orus, Enceladus, Iapetus, Umbriel, Algenib, Algieba, Schedar, Achird, Zubenelgenubi, Sadachbia, Sadaltager, Alnilam, Rasalgethi

Supported Languages

English (US), English (UK), Spanish, French, German, Italian, Portuguese (Brazil), Japanese, Korean, Hindi

Parameters

Parameter Type Default Description

image

file required Portrait image (jpg, jpeg, png, webp)

voice_script

string

Text for the person to say

audio

file

Audio file (overrides voice_script)

voice

string "Zephyr (Female)" Voice selection

voice_language

string "English (US)" Output language

resolution

string "720p" 720p or 1080p

video_prompt

string "The person is talking." Control avatar behavior and background

voice_prompt

string "Say the following." Control tone, pacing, emotion

seed

int random Reproducible generation

disable_safety_filter

bool true Disable content filter

disable_prompt_upsampling

bool false Skip prompt enhancement

Pricing

Resolution Price

720p $0.025 per second of output video

1080p $0.045 per second of output video

Example: 30-second 720p video = $0.75

Free Launch Weekend

P-Video-Avatar is completely free from Thursday May 1, 2026 4:00 PM CET through Sunday May 4, 2026 11:59 PM CET. All costs are on us during this window — no billing, no limits on resolution.

Competitive Advantage

Feature P-Video-Avatar Fabric 1.0 OmniHuman 1.5 HeyGen Avatar 4

Speed (per sec of video) ~1.83s/s ~34s/s (18x slower) ~28s/s (15x slower) ~26s/s (14x slower)

Cost per second $0.025 $0.14 (5.6x more) $0.16 (6.4x more) $0.075 (3x more)

Built-in TTS Yes Yes No Yes

Dynamic Background Yes Yes No Yes

1080p Support Yes No No Yes

Use Cases

  • Marketing: Product demos, UGC-style ads with AI presenters

  • Education: Course videos, explainers, tutoring content

  • Localization: Dub content across 10 languages from one image

  • Social Media: Consistent virtual influencer content

  • Corporate: Training, onboarding, announcements

  • Gaming: Character avatars, NPC dialogue videos

  • Customer Support: Personalized video responses

Tips

  • Use high-quality portrait photos (front-facing, good lighting)

  • The output video aspect ratio matches the input image

  • Use video_prompt to control dynamic backgrounds and body language

  • Use voice_prompt to control speaking style, emotion, and pacing

  • Keep videos under 3 minutes for best visual consistency

  • Generate portraits with pruna/p-image using aspect ratio 9:16 for vertical avatar videos

Related Pruna Models

Generate portrait images

belt app run pruna/p-image --input '{"prompt": "professional headshot portrait"}'

General video generation

belt app run pruna/p-video --input '{"prompt": "cinematic scene"}'

Image editing

belt app run pruna/p-image-edit --input '{"prompt": "change background", "image": "https://photo.jpg"}'

Related Skills

Full platform skill (all 250+ apps)

npx skills add inference-sh/skills@infsh-cli

Pruna video generation

npx skills add inference-sh/skills@p-video

Pruna image generation

npx skills add inference-sh/skills@p-image

All video generation models

npx skills add inference-sh/skills@ai-video-generation

Image generation (for creating portraits)

npx skills add inference-sh/skills@ai-image-generation

Browse all Pruna apps: belt app list --namespace pruna

Documentation

  • Running Apps - How to run apps via CLI

  • Streaming Results - Real-time progress updates

  • Content Pipeline Example - Building media workflows

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Coding

ai-image-generation

Generate images with 50+ AI models via inference.sh CLI.

Repository Source
Coding

ai-video-generation

Generate videos with 40+ AI models via inference.sh CLI.

Repository Source
207.6K389infsh-skills
Coding

agent-tools

Run 250+ AI apps in the cloud with a simple CLI. No GPU required.

Repository Source
207.1K389infsh-skills
Coding

infsh-cli

Run 250+ AI apps in the cloud with a simple CLI. No GPU required.

Repository Source