grok-imagine

Generate images and videos using xAI Grok Imagine Extended. Text-to-image, image editing, text-to-video, image-to-video. Use when: user asks to generate, create, or draw an image, or create/animate a video. NOT for: image analysis/understanding (use the image tool instead). Triggers: generate image, create image, draw, grok imagine, make a picture, text to image, generate video, animate, text to video.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "grok-imagine" with this command: npx skills add ademczuk/grok-imagine-extended

Grok Imagine Extended (xAI Image & Video Generation)

Generate images and videos from text prompts using the xAI API.

Image Generation

python3 {baseDir}/scripts/generate_image.py --prompt "your image description" --filename "output.png"

With options:

python3 {baseDir}/scripts/generate_image.py --prompt "a cyberpunk city at night" --filename "city.png" --resolution 2k --aspect-ratio 16:9

Image Editing

Single source image:

python3 {baseDir}/scripts/generate_image.py --prompt "make it a watercolor painting" --filename "edited.png" -i "/path/to/source.jpg"

Multiple source images (up to 3):

python3 {baseDir}/scripts/generate_image.py --prompt "combine into one scene" --filename "combined.png" -i img1.png -i img2.png

Video Generation

Text-to-video:

python3 {baseDir}/scripts/generate_image.py --prompt "a cat walking through flowers" --filename "cat.mp4" --video --duration 5

Image-to-video (animate a still):

python3 {baseDir}/scripts/generate_image.py --prompt "add gentle camera zoom and wind" --filename "animated.mp4" --video -i photo.jpg --duration 5

Models

ModelTypeCost
grok-imagine-imageImage (default)$0.02/img
grok-imagine-image-proImage (high quality)$0.07/img
grok-imagine-videoVideo (auto for --video)$0.05/sec

Select model with --model grok-imagine-image-pro. Video mode always uses grok-imagine-video.

All Options

FlagDescription
--prompt, -pText description (required)
--filename, -fOutput path (required)
-iInput image for editing/animation (repeatable, max 3 for images, 1 for video)
--model, -mImage model (default: grok-imagine-image)
--aspect-ratio, -a1:1, 16:9, 9:16, 4:3, 3:4, etc.
--resolution, -rImage: 1k/2k. Video: 480p/720p
--nNumber of images 1-10 (default 1)
--videoGenerate video instead of image
--duration, -dVideo duration 1-15 seconds (default 5)
--api-key, -kOverride XAI_API_KEY

API Key

  • XAI_API_KEY env var
  • Or set skills."grok-imagine".apiKey / skills."grok-imagine".env.XAI_API_KEY in ~/.openclaw/openclaw.json
  • Or auto-read from ~/keys.txt

Notes

  • Use timestamps in filenames: 2026-03-01-cyberpunk-city.png
  • The script prints a MEDIA: line for OpenClaw to auto-attach on supported chat providers
  • Do not read the image back; report the saved path only
  • Image URLs from xAI are temporary; the script downloads them immediately
  • Video generation is async and polls until done (can take 1-5 minutes)
  • 2k resolution returns PNG; 1k returns JPEG

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

M3U8 Downloader

Download encrypted m3u8/HLS videos using parallel downloads. Use when given an m3u8 URL to download a video, especially encrypted HLS streams with AES-128.

Registry SourceRecently Updated
General

Data Analyst Cn

数据分析助手 - 数据清洗、统计分析、可视化建议。适合:数据分析师、产品经理、运营。

Registry SourceRecently Updated
General

QuantumOS

Install and manage QuantumOS, an AI command center dashboard for OpenClaw. Use when the user wants to set up QuantumOS, start/stop the dashboard, troubleshoo...

Registry SourceRecently Updated