videoagent-video-studio

Generate short AI videos from text or images — text-to-video, image-to-video, and reference-based generation — with zero API key setup. Use when the user wants to create a video clip, animate an image, or generate video from a description.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "videoagent-video-studio" with this command: npx skills add pexoai/pexo-skills/pexoai-pexo-skills-videoagent-video-studio

🎬 VideoAgent Video Studio

Use when: User asks to generate a video, create a video from text, animate an image, make a short clip, or produce AI video.

Generate short AI videos with 7 backends. This skill picks the right mode (text-to-video or image-to-video), enhances the prompt for best results, and returns the video URL.


Quick Reference

User IntentModeTypical Duration
"Make a video of..." (no image)text-to-video4–10 s
"Animate this image" / "Make this move"image-to-video4–6 s
"Turn this into a video with..."image-to-video4–6 s
Cinematic, story, adPrefer text-to-video with detailed prompt5–10 s

Generation Modes

ModeDescriptionModels
text-to-videoText prompt only → videominimax, kling, veo, hunyuan, grok, seedance
image-to-videoSingle image + prompt → animated clipminimax, kling, veo, pixverse, grok, seedance
reference-basedReference images/video → consistent outputminimax, kling, veo, hunyuan, grok, seedance

Models (use --model <id>)

Model IDT2VI2VReferenceNotes
minimaxSubject reference image, character consistency
klingMulti-element / character / keyframe (O3)
veoGoogle Veo 3.1, multiple reference images
hunyuanVideo-to-video style transfer
pixverseStylized image-to-video
grokVideo editing via reference video
seedanceSeedance 1.5 Pro, synchronized audio, 4–12 s

Full model details and endpoint reference: references/models.md.


How to Generate a Video

Step 1 — Choose mode and enhance the prompt

  • Text-to-video: Expand with subject, action, camera movement, lighting, and style. Be specific about motion (e.g. "camera slowly zooms in", "character walks left to right").
  • Image-to-video: Describe the motion to apply to the image (e.g. "gentle breeze in the hair", "camera pans across the scene"). See references/prompt_guide.md for patterns.

Step 2 — Run the script

Text-to-video:

node {baseDir}/tools/generate.js \
  --mode text-to-video \
  --prompt "<enhanced prompt>" \
  --duration <seconds> \
  --aspect-ratio <ratio>

Image-to-video:

node {baseDir}/tools/generate.js \
  --mode image-to-video \
  --prompt "<motion description>" \
  --image-url "<public image URL>" \
  --duration <seconds> \
  --aspect-ratio <ratio>

Parameters:

ParameterDefaultDescription
--modetext-to-videotext-to-video or image-to-video
--prompt(required)Scene or motion description
--image-urlRequired for image-to-video; public image URL
--duration5Length in seconds (typically 4–10)
--aspect-ratio16:916:9, 9:16, 1:1, 4:3, 3:4
--modelautoModel ID (e.g. kling, veo, grok, seedance); auto = proxy picks

Other commands:

CommandDescription
node tools/generate.js --list-modelsList available models from the proxy
node tools/generate.js --status --job-id <id>Check async job status

Step 3 — Return the result

The script returns JSON:

{
  "success": true,
  "mode": "text-to-video",
  "videoUrl": "https://...",
  "duration": 5,
  "aspectRatio": "16:9"
}

Send videoUrl to the user.


Example Conversations

User: "Generate a short video of a cat walking in the rain, cinematic."

node {baseDir}/tools/generate.js \
  --mode text-to-video \
  --prompt "A cat walking through rain, wet streets, neon reflections, cinematic lighting, slow motion, 4K" \
  --duration 5 \
  --aspect-ratio 16:9

User: "Animate this photo" (user uploads a landscape)

node {baseDir}/tools/generate.js \
  --mode image-to-video \
  --prompt "Gentle clouds moving across the sky, subtle grass movement, cinematic atmosphere" \
  --image-url "https://..." \
  --duration 5 \
  --aspect-ratio 16:9

User: "Make a 10-second vertical video of a coffee pour, slow motion."

node {baseDir}/tools/generate.js \
  --mode text-to-video \
  --prompt "Close-up of coffee pouring into a white cup, slow motion, steam rising, soft lighting, product shot" \
  --duration 10 \
  --aspect-ratio 9:16

User: "Use Google Veo for a cinematic shot."

node {baseDir}/tools/generate.js \
  --mode text-to-video \
  --model veo \
  --prompt "A dragon flying through cloudy skies, cinematic lighting, 8s" \
  --duration 8 \
  --aspect-ratio 16:9

User: "Animate this portrait."

node {baseDir}/tools/generate.js \
  --mode image-to-video \
  --model grok \
  --prompt "Gentle smile, subtle head turn" \
  --image-url "https://..." \
  --duration 5

Setup

Zero API keys by default. Requests go through a hosted proxy. Set these for a custom proxy or token:

VariableRequiredDescription
VIDEO_STUDIO_PROXY_URLNoProxy base URL
VIDEO_STUDIO_TOKENNoAuth token if the proxy requires it

Knowledge Base

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Automation

Video Agent (Deprecated)

[DEPRECATED] Use `create-video` for prompt-based video generation or `avatar-video` for precise avatar/scene control. This legacy skill combines both workflo...

Registry SourceRecently Updated
64.5K
Profile unavailable
Automation

Create Video

Create videos from a text prompt using HeyGen's Video Agent. Use when: (1) Creating a video from a description or idea, (2) Generating explainer, demo, or ma...

Registry SourceRecently Updated
065
Profile unavailable
General

Hailuo Video Generator

MiniMax Hailuo 视频生成技能 - 使用 S2V-01 模型进行主体参考视频生成。可以生成视频、查询任务状态、下载视频文件。

Registry SourceRecently Updated
0153
Profile unavailable