Image Generation Skill

This skill enables AI-powered image generation, editing, and asset creation using Google Gemini (Gemini) and OpenAI GPT-Image.

When to Use

Activate this skill when the user wants to:

Generate images from text descriptions
Edit or modify existing images
Create project assets (icons, favicons, social images)
Generate design inspiration (moodboards)
Create consistent character designs
Compare different AI image providers

Available Commands

Command Use For

/imagegen:generate

Generate images from prompts

/imagegen:edit

Edit existing images

/imagegen:iterate

Refine images through multiple steps

/imagegen:compare

Compare Google vs OpenAI

/imagegen:assets

Generate project assets

/imagegen:moodboard

Create design inspiration sets

/imagegen:character

Create consistent character sheets

/imagegen:config

Configure defaults

Delegation

For complex image generation tasks, delegate to the image-generator subagent which has access to all generation scripts and can handle multi-step workflows.

Quick Reference

Providers

Google Gemini (Gemini)

Models: gemini-2.5-flash-image , gemini-3-pro-image-preview
Best for: Character consistency, multi-turn iteration, style variety
API Key: GEMINI_API_KEY or GOOGLE_API_KEY

OpenAI GPT-Image

Models: gpt-image-1.5 , gpt-image-1 , gpt-image-1-mini
Best for: Text in images, transparent backgrounds, precise edits
API Key: OPENAI_API_KEY

Common Sizes/Aspect Ratios

Format Google OpenAI

Square 1:1 1024x1024

Landscape 16:9 1536x1024

Portrait 9:16 1024x1536

Wide 21:9

Example Interactions

User: "Generate an image of a sunset over mountains" Action: Use /imagegen:generate --prompt "A sunset over mountains"

User: "Create app icons for my project" Action: Use /imagegen:assets --type icons --prompt "[ask for description]"

User: "Edit this image to add rain" Action: Use /imagegen:edit --image [path] --prompt "Add rain falling"

User: "I want to iterate on this design" Action: Use /imagegen:iterate --image [path] --prompt "[refinement]"

User: "Which provider would be better for logos?" Action: Explain Google is better for style variety, OpenAI for text, and suggest /imagegen:compare to test both.

Prerequisites Check

Before generating, verify:

Required Python packages: google-genai , openai , Pillow (for resizing)
API keys set in environment
Output directory accessible

Install packages

pip install google-genai openai Pillow

Set API keys (user's responsibility)

export GEMINI_API_KEY=your_key export OPENAI_API_KEY=your_key

Prompt Tips

Help users craft effective prompts:

Be descriptive but concise
Specify style (photorealistic, watercolor, minimalist)
Include lighting (golden hour, dramatic, soft)
Mention composition (close-up, wide shot, centered)
For characters, include distinctive features
For logos, specify simplicity level

imagegen

Safety Notice

Copy this and send it to your AI assistant to learn

Wide 21:9

Install packages

Set API keys (user's responsibility)

Source Transparency

Related Skills

ui-ux-pro-max

filament-resource

svg-icon-generator