Text and Image to Video Skill

Create a video generation task based on provided text content and images. The task is submitted immediately; the system will automatically poll the task status and retrieve the video link.

Usage Scenarios

✅ Recommended for these situations:

"Turn this image into a video as I describe"
"Please help me make a video from this image according to my requirements"
"Use this image to generate a video as requested"
"Generate a video based on text and an image"

Not for These Scenarios

❌ Do not use for the following cases:

The user asks for video editing, trimming, or adding special effects → Please use a video editing tool
The user requests screen recording or capture → Please use a screen recording tool
The user only wants to check the progress of an existing video task → Please guide them to check in the related file or system

Prerequisites

export MAGIC_API_KEY="your-key"

MAGIC_API_KEY is the required environment variable for the remote video service client.

Overall Workflow (Agent Guide)

Extract the full text (TEXT) and image address or path (IMAGE) from the user's message.
Use the video-create subcommand to create the task, read the stdout JSON output, and extract the task_id.
Clearly inform the user of the task_id in the chat by outputting "Video generation task has been created, task ID: task_id. I will keep checking the task status and inform you when the video link is ready."
Use the video-wait subcommand with --task-id to poll the task until completion. Task status equal to 2 means success.
Extract the video_url from the video-wait command's stdout.
Clearly inform the user of the final video link video_url in the chat. If timeout occurs, report it as well.

Python Client (Step-by-Step Example & Chat Output)

Step 1: Create the Video Task and Show the `task_id` in Chat

Obtain the desired video text from the user and store it in TEXT; get the image address and store it in IMAGE.
- If the text contains double quotes ", be sure to escape them (e.g., replace " with \") to prevent command parsing errors.
Run the following command (invoked by the agent tool; {baseDir} will be replaced with the skill directory):

python3 {baseDir}/scripts/media_gen_client.py video-create \
  --text  "TEXT" --image "IMAGE"

Read the command's standard output (stdout), which is JSON, for example:

   {
    "biz_code": 10000,
    "msg": "Success",
    "data": {
        "task_id": "2032443088023777280"
    },
    "trace_id": "664c6e22-1edd-11f1-bf4c-8262dce7d13f"
  }

Parse the task_id from the JSON (e.g. "abc-123"), and inform the user in the chat:

Output: "Video generation task has been created, task ID: task_id. I will keep checking the task status and inform you when the video link is ready."

Step 2: Poll Task Status and Output the Final `video_url` in Chat

Use the task_id obtained in the previous step.
Execute this command (poll every 10 seconds, wait up to 600 seconds; if timeout, please try again later):

python3 {baseDir}/scripts/media_gen_client.py video-wait --task-id YOUR_TASK_ID --poll 10 --timeout 600

Read the standard output. On success, the JSON output looks like:

   {
    "biz_code": 10000,
    "msg": "Success",
    "data": {
        "task_id": "1234567890",
        "task_status": 2,
        "video_url": "https://www.magiclight.com/examplevideo.mp4"
    },
    "trace_id": "c89aeca8-1edd-11f1-bf4c-8262dce7d13f"
}

Parse the key fields from the output:

Task status (e.g., task_status: 2), where status 2 means success
Video link (e.g., video_url: "https://example.com/path/to/video.mp4")

Recommended chat reply flow:

Summarize the key info in plain language, for example:

"Task complete ✅
task_id: abc-123
Video link: https://example.com/path/to/video.mp4"

If the result shows task failure or timeout (e.g., success is false, video_url is empty, or error is timeout):

Explain the failure reason (include error info if possible), and inform the user they can retry later or check possible issues like input or quota.

Script Output Requirements

The agent must always:
- Parse stdout JSON.
- Clearly inform the user of both the task ID and video link in the chat.

magic-image2video

Safety Notice

Copy this and send it to your AI assistant to learn

Text and Image to Video Skill

Usage Scenarios

Not for These Scenarios

Prerequisites

Overall Workflow (Agent Guide)

Python Client (Step-by-Step Example & Chat Output)

Step 1: Create the Video Task and Show the `task_id` in Chat

Step 2: Poll Task Status and Output the Final `video_url` in Chat

Script Output Requirements

Source Transparency

Related Skills

Content Refresher

AssemblyAI Transcriber

mac-node-snapshot

Amazon Asin Lookup Api Skill

magic-image2video

Safety Notice

Copy this and send it to your AI assistant to learn

Text and Image to Video Skill

Usage Scenarios

Not for These Scenarios

Prerequisites

Overall Workflow (Agent Guide)

Python Client (Step-by-Step Example & Chat Output)

Step 1: Create the Video Task and Show the task_id in Chat

Step 2: Poll Task Status and Output the Final video_url in Chat

Script Output Requirements

Source Transparency

Related Skills

Content Refresher

AssemblyAI Transcriber

mac-node-snapshot

Amazon Asin Lookup Api Skill

Step 1: Create the Video Task and Show the `task_id` in Chat

Step 2: Poll Task Status and Output the Final `video_url` in Chat