fal-model-selector

Helps choose the right fal.ai model before API calls. Provides quick decision matrix for video generation (text-to-video, image-to-video), image editing (object removal, relighting, style transfer, virtual try-on, outpaint), image generation, background removal, upscaling, TTS, and 3D. Use when user wants to generate video, generate images, edit images, remove background, upscale, or use any fal.ai model. Triggers on "which fal model", "fal.ai", "generate video", "image to video", "text to video", "video generation", "edit image", "remove background", "upscale", "style transfer", "which model should I use".

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "fal-model-selector" with this command: npx skills add phy-fal-model-selector

FAL Model Selector

Quick decision guide for choosing fal.ai models. Read references/ for full parameters.

Key Concepts

Image-to-Video vs Reference-to-Video

Image-to-VideoReference-to-Video
Input1 image (starting frame)Multiple images (1-5 references)
PurposeAnimate that exact imageMaintain subject consistency in new scenes
The image becomes...The first frame of the videoA "template" for what subject looks like
Camera/sceneStarts from that exact viewCan be completely different scenes

Image-to-Video = "animate this exact image"

  • Product photo → animate it rotating
  • Portrait → make the person smile/blink

Reference-to-Video = "generate video featuring this subject"

  • 3 photos of mascot → video of mascot in new scene
  • Product from different angles → consistent commercial

Quick Decision Matrix

Video: Text → Video

NeedModelPrice/5sWhy
Best qualityfal-ai/kling-video/v2.6/pro/text-to-video$0.35-0.70Cinematic, native audio
Google (Veo)fal-ai/veo3.1$1-2Google's best, 4K, natural sound
OpenAI (Sora)fal-ai/sora-2/text-to-video$0.50Detailed narratives, lip-sync
Budgetfal-ai/ltx-2/text-to-video$0.30Good quality, includes audio

Video: Image → Video

NeedModelPrice/5sWhy
Best qualityfal-ai/kling-video/v2.1/pro/image-to-video$0.45Cinematic, precise motion
Google (Veo)fal-ai/veo3.1/image-to-video$1-24K resolution, Google quality
OpenAI (Sora)fal-ai/sora-2/image-to-video$0.50OpenAI quality, realistic
Multi-scenewan/v2.6/image-to-video$0.50-0.75Scene segmentation
Budgetfal-ai/pixverse/v5.5/image-to-video$0.15-0.40Multiple styles
Start+End framefal-ai/bytedance/seedance/v1.5/pro/image-to-video$0.26Control both ends

Video: Reference → Video (Multiple Images)

NeedModelPrice/5sWhy
Google (Veo)fal-ai/veo3.1/reference-to-video$0.50-1.75Up to 5 refs, best consistency
Multi-reffal-ai/vidu/q2/reference-to-video$0.50Multiple reference images
First+Last framefal-ai/veo3.1/first-last-frame-to-video$1-3Controlled transitions

Video: Extend/Remix

NeedModelPrice/5sWhy
Extend (Google)fal-ai/veo3.1/extend-video$1-2Up to 30s total
Remix (OpenAI)fal-ai/sora-2/video-to-video/remix$0.50Restyle existing videos

Image: Generate (Text → Image)

NeedModelPriceWhy
Best qualityfal-ai/flux-2-pro$0.03/MPZero-config, production-ready
Google (Gemini)fal-ai/gemini-3-pro-image-preview$0.15-0.30Up to 4K, web search
OpenAI (GPT)fal-ai/gpt-image-1.5variesDALL-E quality, editing
Text/typographyfal-ai/ideogram/v3$0.03-0.09Best for logos, posters
Budgetfal-ai/flux-2$0.012/MPGood quality, cheaper
Custom stylefal-ai/flux-2/lora$0.021/MPUp to 3 LoRAs

Image: Edit

NeedModelPriceWhy
General editfal-ai/flux-2/edit$0.012/MPMulti-image, natural language
OpenAI (GPT)fal-ai/gpt-image-1.5/editvariesDALL-E editing quality
Google (Gemini)fal-ai/gemini-3-pro-image-preview/edit$0.15-0.304K editing capability
Text in imagefal-ai/qwen-image-edit-plus-Superior text editing
Fast iterationfal-ai/reve/edit$0.04/imgQuick, remix support
Style transferfal-ai/flux/dev/image-to-image$0.03/MPPreserve composition

Image: Specialized Apps

NeedModelWhy
Virtual try-onfal-ai/image-apps-v2/virtual-try-onClothing on person
Product photofal-ai/image-apps-v2/product-photographyStudio lighting
Relightingfal-ai/image-apps-v2/relighting17 lighting presets
Remove objectfal-ai/image-apps-v2/object-removalText-based targeting
Style transferfal-ai/image-apps-v2/style-transfer26 artistic presets
Outpaintfal-ai/image-apps-v2/outpaintExtend boundaries

Image: Enhancement

NeedModelPriceWhy
Upscale (portrait)clarityai/crystal-upscaler$0.016/MPFacial detail focus
Remove backgroundfal-ai/birefnet/v2-Multiple models, up to 2304px
Segmentationfal-ai/sam-3/image$0.005Text/point/box prompts

Image: Character/Identity

NeedModelWhy
Consistent characterfal-ai/photomakerIdentity-preserving portraits
Brand stylefal-ai/flux-2/loraCustom trained LoRAs

Video: Tools

NeedModelPriceWhy
Upscaleclarityai/crystal-video-upscaler$0.10/MP-secUp to 5K
Lipsyncfal-ai/sync-lipsync/v2/pro~1.67x baseNatural features

Other

NeedModelPriceWhy
TTSfal-ai/maya$0.002/secEmotions, accents
Image → 3Dfal-ai/hunyuan3d-v3/image-to-3d$0.375GLB/OBJ/FBX

Detailed References

  • references/video-generation.md - 16+ video models (incl. all Veo/Sora endpoints)
  • references/image-tools.md - 20+ image models (incl. Gemini/GPT Image)
  • references/other-tools.md - TTS, 3D, upscaling, lipsync

Common Patterns

import fal_client

# Product demo video
result = fal_client.subscribe(
    "fal-ai/kling-video/v2.1/pro/image-to-video",
    arguments={
        "prompt": "Product rotates smoothly, camera zooms in",
        "image_url": "https://example.com/product.jpg",
        "duration": "5"
    }
)

# Remove background
result = fal_client.subscribe(
    "fal-ai/birefnet/v2",
    arguments={
        "image_url": "https://example.com/photo.jpg",
        "model": "General"
    }
)

# Studio product photo
result = fal_client.subscribe(
    "fal-ai/image-apps-v2/product-photography",
    arguments={
        "product_image_url": "https://example.com/product.jpg",
        "aspect_ratio": "1:1"
    }
)

# Virtual try-on
result = fal_client.subscribe(
    "fal-ai/image-apps-v2/virtual-try-on",
    arguments={
        "person_image_url": "https://example.com/person.jpg",
        "clothing_image_url": "https://example.com/shirt.jpg"
    }
)

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Automation

AI Image

Full AI image creation workflow — intent classification, prompt enhancement, multi-direction generation via fal.ai, and error recovery. Triggers on "generate...

Registry SourceRecently Updated
0106
Profile unavailable
General

Pixshop MCP — 28+ AI Creative Tools

Pixshop MCP 集成 — 28+ AI 图片视频创意工具,Claude 直接调用 / Pixshop MCP — 28+ AI creative tools for image & video generation, editing, effects in Claude. Use when user wa...

Registry SourceRecently Updated
0124
Profile unavailable
General

MenuVision

Build beautiful HTML photo menus from restaurant URLs, PDFs, or photos using Gemini Vision and AI image generation

Registry SourceRecently Updated
0310
Profile unavailable