fmt-highdef-dynamic

FMT肠菌移植科普高清动态图生成器(插画版+4K静态图+1080P动态视频+批量模式+风格一致)。生成配套数字手绘插画静态PNG(4K超清),并转化为高清晰度1080P动态视频。风格活泼生动,适合科普教育。触发词:生成FMT动态图 / 生成高清动态 / 插画转高清视频。

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "fmt-highdef-dynamic" with this command: npx skills add zmy1006-sudo/fmt-highdef-dynamic

FMT高清动态图生成器 v6.0

专为《探秘人体微宇宙 · FMT肠菌移植健康科普(插画版).docx》文档设计 v6.0(2026-05-12):新增风格参考图+批量生成+质量核查+降级策略


⚠️ 四大核心问题修复(必读)

问题1:图片清晰度不足 → 4K分辨率

  • 必须在 image_synthesize 时指定 resolution: "4K",输出4096px宽超清图像
  • 4K分辨率下文字边缘更锐利,AI渲染更精准

问题2:中文字清晰准确 → 双轨策略

  • 策略A(首选):Prompt中正确引导,文字放在色块背景上,白字蓝底对比
  • 策略B(保底):AI生成无文字版 → Python PIL叠加精准中文字层

问题3:动态视频文字消失 → 文字保留强制指令

  • 每条 gen_videos Prompt必含文字保留模块(见第四章)

问题4:内容单薄 → 三层场景结构强制要求

  • 前景(装饰粒子)+ 中景(多角色互动)+ 背景(场景延伸)

一、工具与输出规格

工具用途输出规格
image_synthesize静态插画PNG(首选)4K(4096px),16:9
nano-banana-pro静态插画PNG(备选,中文更好)1K/2K/4K
gen_videos(I2V模式)动态视频(首选)1080P,6秒,MP4
batch_image_to_video动态视频(备选)768P,6秒,MP4

⚠️ gen_videos 默认选6秒1080P;10秒1080P会自动降级为768P。


二、风格一致性保障(新增·改善1)

首次使用本技能时,必须先生成"风格参考图",作为后续所有图片的风格锚点。

步骤0:生成风格参考图(每批次只做一次)

目的:锁定标准角色外观、线条风格、配色体系,确保20张图风格一致

生成命令:

image_synthesize({
  requests: [{
    aspect_ratio: "16:9",
    output_file: "/workspace/fmt_style_guide.png",
    prompt: "[风格参考图Prompt,见下方标准模板]",
    resolution: "4K"
  }]
})

风格参考图Prompt(固定不变,每次套用):

[4K ULTRA HIGH DEFINITION - 4096px width, 16:9]

[STYLE REFERENCE SHEET - All future images must follow this exact style]

[TITLE AREA] "FMT肠菌移植 · 风格标准" in bold white Chinese on #B5DDE5 blue banner at top

[CHARACTER REFERENCE ROW - 5 chibi characters side by side]:
Character 1: Doctor - blue medical coat, stethoscope, round glasses, kind smile, #B5DDE5 outfit
Character 2: Nurse - light cyan uniform, medical cap, rosy cheeks, waving hand, #A8D8C8 outfit
Character 3: Patient (adult) - casual clothes, slightly worried expression turning hopeful, #FDE3D1 blush
Character 4: Child patient - big head, small body, bright curious eyes, brave smile, colorful clothes
Character 5: Gut bacteria (good) - small round colorful cute characters, different shapes/colors representing diverse flora

[STYLE RULES BOX]:
- Line width: 6-8px rounded thick strokes, soft and friendly
- Eye style: Bean eyes with white highlight dot, black pupil center
- Cheek: Soft peach blush oval on both cheeks
- Body proportion: Big head (50% of body height), small body, tiny limbs
- Shadow: NONE - flat design only
- Border: None, pure shapes

[COLOR PALETTE REFERENCE]:
Primary blue: #B5DDE5 (light lake blue)
Accent cyan: #A8D8C8 (soft teal)
Warm gold: #E8C98A (soft earth yellow)
Peach blush: #FDE3D1 (warm peach)
Deep text: #4A5E6F (gray-blue for text/icons)
White: #FFFFFF (background and text on dark)
Background: #F7FBFC (very light blue-cream)

[DECORATION REFERENCE]:
- Bubbles: Semi-transparent light blue circles, sizes 15px-60px
- Particles: Tiny white dots with soft glow
- Medical icons: Shield, cross, heart - #4A5E6F color, rounded style
- No shadows, no gradients, flat only

[LINE STYLE SAMPLE]:
Draw a sample line style card showing: thick rounded lines, no sharp corners,
all elements connected smoothly, children's illustration aesthetic

[BRAND AREA] Small: hospital icon + "苏州市立医院" in small Chinese text bottom left
[VERSION] "v6.0 Style Guide" bottom right in small grey text

保存路径/workspace/fmt_style_guide.png

后续所有图片Prompt开头必须附上风格锚定指令

[FOLLOW STYLE GUIDE: /workspace/fmt_style_guide.png]
Style rules: chibi characters, thick rounded lines, flat design, pastel palette as defined.
Use the same character designs, line width, eye style, and color palette as the reference.

三、批量生成模式(新增·改善2)

批量指令格式

用户可用以下任一格式一次性提交多页需求:

格式A:范围型

批量生成第3-7页,主题分别是:肠道菌群失衡、临床表现、诊断标准、治疗方案、康复管理

格式B:列表型

批量生成:p1=肠菌移植概述,p2=适应证,p3=禁忌证,p4=供体筛选,p5=制备流程

格式C:全文型

为《FMT肠菌移植科普.docx》全文生成插画,共20页,每页独立生成

批量处理流程(自动执行)

Step 1: 解析需求 → 生成页面列表 [p1,p2,...pN]
Step 2: 检查是否已有风格参考图
         - 无 → 先生成风格参考图(仅1次)
         - 有 → 跳过
Step 3: 逐页生成(每页独立):
         3a. 构建Prompt(锚定风格参考图)
         3b. image_synthesize → 4K PNG
         3c. 质量核查(见第四章)
         3d. 若通过 → gen_videos → 1080P视频
         3e. 若中文不准 → 策略B(PIL叠加)
         3f. 整理到 /workspace/fmt_pages/p[N]_主题/
Step 4: 完成后汇总报告

页面输出目录结构

/workspace/fmt_pages/
├── style_guide.png                    ← 风格参考图(只生成1次)
├── p01_肠菌移植概述/
│   ├── static.png                     ← 4K静态插画
│   ├── HD.mp4                         ← 1080P动态视频
│   └── metadata.json                  ← 页面元数据
├── p02_适应证/
│   └── ...
└── ...

metadata.json格式

{
  "page": 1,
  "title": "肠菌移植概述",
  "status": "completed",
  "static_generated": "2026-05-12T12:00:00",
  "video_generated": "2026-05-12T12:02:00",
  "chinese_quality": "pass",
  "notes": "中文清晰准确"
}

四、质量核查清单(新增·改善3)

每张图生成后、发送给用户前,必须执行以下核查:

核查A:图片清晰度

  • 图片尺寸 ≥ 4096px 宽(4K)
  • 无明显模糊区域
  • 线条边缘锐利,无锯齿

核查B:中文文字(策略A使用时)

  • 标题文字清晰可读,无乱码
  • 文字与背景对比度高(白字蓝底 / 深字白底)
  • 字号足够大(标题≥72px,正文≥48px)
  • 无文字错误或缺字

核查C:三层构图

  • 前景层:存在装饰粒子/气泡(5-8个)
  • 中景层:存在多角色互动(≥2人)+ 核心场景
  • 背景层:存在场景延伸,非空白

核查D:风格一致性

  • 与风格参考图(fmt_style_guide.png)风格一致
  • 角色外观符合参考图规范
  • 配色在标准色板范围内

核查E:动态视频(生成后)

  • 视频时长6秒
  • 分辨率1080P
  • 文字/图标/编号全程可见,未消失
  • 动效柔和,无剧烈抖动

核查结果处理:

  • 全部通过 → 发送给用户
  • 任一项失败 → 记录失败项 → 重新生成或切换策略B → 重新核查

五、降级策略(新增·改善4)

降级路径总览

image_synthesize 失败 → nano-banana-pro 备选
gen_videos 失败 → batch_image_to_video 降级768P
中文不准 → 策略B:PIL叠加字层
风格漂移 → 重新锚定风格参考图

详细降级规则

降级1:静态图生成失败

触发条件:image_synthesize 返回错误或图片质量极差(无法识别内容)
降级动作:切换到 nano-banana-pro
  uv run ~/.codex/skills/nano-banana-pro/scripts/generate_image.py \
    --prompt "[原Prompt精简版,不含resolution参数]" \
    --filename "/workspace/fmt_page_[N]_[主题].png" \
    --resolution 4K
修复后:继续原流程

降级2:动态视频生成失败

触发条件:gen_videos 返回错误或输出文件损坏
降级动作:切换 batch_image_to_video
  batch_image_to_video({
    count: 1,
    image_file_list: ["/workspace/fmt_page_[N]_[主题].png"],
    output_file_list: ["/workspace/fmt_page_[N]_[主题]_HD.mp4"],
    prompt_list: ["[动态Prompt精简版,不含分辨率指令]"],
    duration_list: [6],
    resolution_list: ["768P"]
  })
修复后:告知用户视频为768P备选版本

降级3:中文不准(PIL叠加字层)

触发条件:image_synthesize 生成的中文出现错误/乱码/模糊
修复动作:
  Step A: 生成无文字版本(在原Prompt后追加 "NO TEXT, NO CHINESE CHARACTERS")
  Step B: 使用Python PIL叠加中文字层
    python3 -c "
    from PIL import Image, ImageDraw, ImageFont
    img = Image.open('/workspace/fmt_page_[N]_[主题].png').convert('RGBA')
    w, h = img.size
    scale = w / 4096  # 4K基准
    # 标题栏
    bar = Image.new('RGBA', (w, int(h*0.10)), (181,205,214,245))
    img.paste(bar, (0,0), bar)
    draw = ImageDraw.Draw(img)
    try:
        font_title = ImageFont.truetype('/usr/share/fonts/opentype/noto/NotoSansCJK-Regular.ttc', int(64*scale))
        font_sub = ImageFont.truetype('/usr/share/fonts/opentype/noto/NotoSansCJK-Regular.ttc', int(36*scale))
    except:
        font_title = ImageFont.load_default()
        font_sub = ImageFont.load_default()
    draw.text((w//2, int(h*0.05)), '[标题文字]', fill=(255,255,255,255), font=font_title, anchor='mm')
    draw.text((w//2, int(h*0.08)), '[副标题文字]', fill=(200,220,240,255), font=font_sub, anchor='mm')
    img.convert('RGB').save('/workspace/fmt_page_[N]_[主题]_cn.png')
    "
  Step C: 用 _cn.png 作为基础重新调用 gen_videos

降级4:风格漂移

触发条件:多张图风格不一致(角色外观、线条粗细、配色偏差)
修复动作:
  重新调用 image_synthesize,在Prompt开头强制附上:
  "[MUST FOLLOW STYLE GUIDE: /workspace/fmt_style_guide.png]"
  "Exact same line width, eye style, body proportion, color palette"

六、中文字解决策略(核心)

策略A:AI直出中文(首选)

关键技巧:

  1. 用英文括号包裹中文:"肠菌移植" in bold white Chinese
  2. 强化指令:"render Chinese text accurately, crisp and clear"
  3. 指定字体风格:"Chinese text in clean sans-serif font, thick strokes"
  4. 文字放在色块背景上,白字蓝底,对比度最高
  5. 字号占图高3-5%(4K图上约120-200px)

配色对比强化中文清晰度:

背景色文字色效果
#B5DDE5(浅湖蓝)#FFFFFF(白色)✅ 最清晰
#4A5E6F(深灰蓝)#FFFFFF(白色)✅ 清晰
#E8C98A(金色)#4A5E6F(深色)✅ 清晰
#FFFFFF(白色)#4A5E6F(深色)⚠️ 可用
透明/渐变任意颜色❌ 模糊,禁止

七、静态插画生成规范(4K超清)

构图层次要求

[前景层 - FOREGROUND] 5-8个元素
- 彩色气泡(半透明浅蓝色,多种尺寸)
- 光点粒子(白色微光点)
- 小装饰图标(医疗符号、爱心、星标)

[中景层 - MIDDLE GROUND] 核心内容层
- 主要角色(3-5个Q版卡通人物互动)
- 核心场景主体(器官/设备/流程道具)
- 信息色块(带中文标题的色块卡片)
- 编号圆圈①②③标注流程步骤

[背景层 - BACKGROUND]
- 场景延伸(肠道内壁纹理/医院走廊/自然环境)
- 渐变色调(从浅到深,暗示景深)

视觉风格规范

属性要求
风格数字手绘插画(Digital Hand-drawn Illustration)
人物Q版大头,豆豆眼(深色+白色高光),淡淡腮红,粗圆润线条
线条柔和圆润,6-8px粗,无锐角,无阴影
色彩低饱和马卡龙色系,纯扁平无阴影
背景极浅纯色(#F7FBFC),充足留白
画布16:9,4K(4096px宽),横版

配色方案

颜色色值用途
浅湖蓝#8ECCD6 / #B5DDE5主色调/标题栏
浅土黄#D4A96A / #E8C98A强调、点缀
沙色#F2E4C9 / #F5E8D3背景辅助
淡青#A8D8C8 / #B8E4D4辅助色调
雾蓝#D6EAF0 / #E8F4FA装饰、留白
深灰蓝#4A5E6F中文字、图标底色
白色#FFFFFF背景/白字底色

八、动态视频生成规范(1080P+文字保留)

动效Prompt结构

[FOLLOW STYLE GUIDE: /workspace/fmt_style_guide.png]
The chibi cartoon medical illustration comes gently to life with rich layered motion.

[LAYER 1 - BACKGROUND ANIMATION]:
- Background texture gently breathes with soft color waves, subtle parallax drift
- Organic gut-wall pattern slowly shifts, very subtle

[LAYER 2 - MIDDLE GROUND ANIMATION]:
- [核心角色] gently bob up and down, arms wave slowly, smile widens gradually
- [主要场景] twinkle with warm glow, pulse gently, never disappear
- [流程步骤] ①②③ numbered circles pulse in gentle sequence
- [中文字区域] stable with soft breathing glow, text always visible

[LAYER 3 - FOREGROUND ANIMATION]:
- Bubbles float upward slowly with slight wobble
- Light particles twinkle and sparkle gently
- Small medical icons pulse with warm glow

[CRITICAL - TEXT STABILITY]:
- Every text element, Chinese label, number badge remains FIXED in position
- NO text fades, disappears, or becomes blurry at any point
- Text may have gentle shimmer/glow but must always remain legible
- Chinese characters stay crisp and stable throughout entire animation
- All ①②③ circles stay clearly visible with steady light pulse

[ANIMATION QUALITY]:
- High definition 1080P, ultra crisp and clear
- Gentle looping animation, dreamy and warm, never frantic
- Dreamy atmosphere, educational and friendly tone
Duration: 6 seconds. All characters and text in place throughout.

九、中文优化Prompt片段库

标题型

[TOP BANNER] Full-width #B5DDE5 light blue banner at top:
"肠菌移植" in large bold white Chinese, very large font, crisp and clear
"重建肠道健康" in smaller white Chinese below
Chinese font: bold rounded sans-serif, thick strokes, white-on-blue high contrast

步骤标注型

[STEP INDICATORS] Three numbered circles:
Circle 1: "① 评估" white text on #4A5E6F dark blue circle
Circle 2: "② 供体" white text on #B5DDE5 light blue circle
Circle 3: "③ 移植" white text on #E8C98A gold circle
All: bold font, crisp Chinese, high contrast, evenly spaced

对话气泡型

[SPEECH BUBBLE] White rounded rectangle, soft #B5DDE5 border:
"医生:您好,我来为您详细说明。"
in #4A5E6F dark blue Chinese text, 48px on 4K image, clean rounded font

信息卡片型

[INFO CARDS] Three cards with colored top strips:
Card top strip: #B5DDE5, body: white
Title "适用人群" in #4A5E6F, large number "3" bold, small Chinese description
All Chinese: bold, high contrast, minimum 48px

十、构图模板(4K版)

模板A:中央主体+三层场景

[前景] 光点粒子 + 气泡漂浮(5-8个)
[中景] 大型核心主体(器官/设备)+ 2-3个卡通人物围绕
       顶部大标题:「肠菌移植」(白字蓝底,≥72px)
[背景] 渐变肠道纹理/医院场景延伸
[底部] 医院图标(无中文)

模板B:流程叙事型(三步骤)

[顶部] 标题栏:「肠菌移植流程」(白字蓝底色块)
[左侧] 步骤①:角色+①②③圆形标注
[中间] 步骤②:核心互动场景(多角色)
[右侧] 步骤③:结果展示(图标+色块)
[背景] 肠道壁纹理连续延伸
[前景] 气泡/粒子装饰层

模板C:多角色互动型

[顶部] 标题:「肠菌移植」(白字蓝底大字)
[中央] 3-5人卡通人物互动
[四周] 设备/图标/流程框辅助
[背景] 丰富场景(肠道壁+装饰)
[前景] 气泡+光效
[底部] 医院图标

十一、输出规范

类型路径规格
风格参考图/workspace/fmt_style_guide.png16:9,4K
批量输出根目录/workspace/fmt_pages/子目录结构
静态插画fmt_pages/p[NN]_[主题]/static.png16:9,4K,PNG
高清动态视频fmt_pages/p[NN]_[主题]/HD.mp416:9,1080P,6秒,MP4
页面元数据fmt_pages/p[NN]_[主题]/metadata.jsonJSON

更新于:2026-05-12 v6.0 — 新增:风格参考图锚定+批量生成+质量核查+降级策略

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

FMT视频制作工具 v1.0

FMT肠菌移植科普视频制作工具(视频合成+字幕+SRT+配音TTS+片头片尾)。将静态图片/动态视频素材合成为完整科普视频,支持中文字幕、旁白配音、片头片尾合成。触发词:合成视频/添加字幕/视频配音/制作FMT视频。

Registry SourceRecently Updated
00Profile unavailable
General

PPT to Video Generator

Convert PowerPoint presentations into narrated videos with Chinese voiceover, synchronized subtitles, and page-by-page audio sync. Use this skill when the us...

Registry SourceRecently Updated
00Profile unavailable
General

URL to Video Generator

Convert any website into a promotional video using Remotion + React. Use when the user wants to generate a video from a website URL, create a promo video, ma...

Registry SourceRecently Updated
00Profile unavailable
General

Health Guide

의료/건강 상담 스킬. 10개 인텐트 라우팅(Symptom Check~Beginner Guide), hira-hospital 병원 연계 + 국가건강정보포털 API + 식약처 의약품 API 기반 Flash+Deep-Dive 2겹 리포트. 트리거: "증상", "어디 아파", "진료과"...

Registry SourceRecently Updated
9160Profile unavailable