PoYo Grok Imagine Video Generation
Use this skill for grok-imagine jobs on PoYo. It covers short text-to-video, image-to-video, and mode-based styling.
Use When
- The user explicitly asks for
Grok Imagineorgrok-imagine. - The task is a 6-second or 10-second clip.
- The workflow needs text-to-video, image-to-video, or
modestyling.
Core Capability
grok-imagineis a single-model video entry point. Useimage_urlsfor image-to-video andmodeforfun,normal, orspicystyle control.
Key Inputs
promptis required.image_urlsis for image-to-video and supports one image.durationsupports6and10.aspect_ratiosupports1:1,2:3,3:2for text-to-video.modesupportsfun,normal,spicy.
Execution
- Read
references/api.mdfor endpoint details, model ids, key fields, example payloads, and polling notes. - Use
scripts/submit_grok_imagine.shto submit a raw JSON payload from the shell. - If the user only needs a curl example, adapt the example from
references/api.mdinstead of rewriting from scratch. - After submission, report the
task_idclearly so follow-up polling is easy.
Output expectations
When helping with this model family, include:
- chosen model id
- final payload or a concise parameter summary
- whether reference images are involved
- returned
task_idif a request was actually submitted - next step: poll status or wait for webhook