vision-action-evolution-loop

视觉-动作-进化闭环框架 —— 将感知、规划、执行、评估、进化五阶段融合为自迭代认知循环

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "vision-action-evolution-loop" with this command: npx skills add kingofzhao/vision-action-evolution-loop

Vision-Action-Evolution Loop Skill

元数据

字段
名称vision-action-evolution-loop
版本1.0.0
作者KingOfZhao
发布日期2026-03-31
置信度96%

核心哲学

认知世界的本质是无穷层级的框架节点。本Skill是两个框架碰撞后的涌现节点:

diepre-vision-cognition (视觉感知)
        ⊗
self-evolution-cognition (自进化)
        ↓
vision-action-evolution-loop (视觉-动作-进化闭环)

五阶段闭环(映射 SOUL 五律)

阶段SOUL 五律技术实现输出
1. Perceive 感知已知 vs 未知2D视觉检测(OpenCV管道)→ 3D空间理解特征图 + 置信度
2. Plan 规划四向碰撞多路径碰撞(VLA/2D→3D/工具增强)→ 选最优动作序列
3. Execute 执行执行不表演机器人臂抓取/折叠/装配物理动作
4. Evaluate 评估人机闭环质检视觉复检 + 人类确认通过/退回 + 反馈
5. Evolve 进化文件即记忆更新世界模型 + 调整参数新认知节点

三阶段桥接架构(非端到端)

Stage 1: 2D Detection (已实现)
  手机照片 → 透视矫正 → 二值化 → 线条检测 → SVG/DXF
  [diepre-vision-cognition]

Stage 2: 3D Spatial Understanding (桥接层)
  2D线条 → 参数化3D → 空间坐标映射 → 折叠顺序推理
  [参考文献: arXiv:2412.11892]

Stage 3: Action Planning (动作规划)
  3D模型 → 抓取点计算 → 力控参数 → 动作序列生成
  [参考文献: arXiv:2510.11027, arXiv:2510.17111]

工具增强策略

不是用VLA端到端替换现有管道,而是将OpenCV管道封装为可调用工具:

# 现有管道封装为工具
tools = {
    "detect_dieline": diepre_vision.analyze,       # 2D检测
    "correct_perspective": opencv.correct_perspective, # 透视矫正
    "generate_dxf": vectorizer.to_dxf,               # 矢量化
    "estimate_3d": spatial_estimator.from_2d,         # 3D估算
    "plan_grasp": grasp_planner.calculate,            # 抓取规划
}
# VLA模型调用这些工具,而非自己做所有事

学术参考文献

  1. Vlaser: Synergistic Embodied Reasoning — 具身推理VLA模型,动作规划的理论基础
  2. Efficient VLA Models for Embodied Manipulation — VLA高效优化,适合本地部署
  3. From 2D CAD to 3D Parametric via VLM — 2D→3D桥接层的核心技术
  4. SAGE: Multi-Agent Self-Evolution — 四Agent闭环=闭环迭代的学术对应
  5. Tool-Augmented VLLMs for CAD (ICCV 2025) — 工具增强策略的理论支撑
  6. Self-evolving Embodied AI — 记忆自更新+任务自切换+模型自进化

安装命令

clawhub install vision-action-evolution-loop
# 或手动安装
cp -r skills/vision-action-evolution-loop ~/.openclaw/skills/

调用方式

from skills.vision_action_evolution_loop import VisionActionEvolutionLoop

loop = VisionActionEvolutionLoop(workspace=".")

# 单次闭环
result = loop.run_cycle(
    image_path="path/to/box_photo.jpg",
    known=["2D检测已验证6/6", "Bobst±0.15mm精度"],
    unknown=["3D折叠顺序", "力控参数优化"]
)

# result 包含五个阶段输出
print(result.perception.confidence)   # 感知置信度
print(result.plan.action_sequence)     # 动作序列
print(result.evolution.new_knowledge)  # 新增认知

# 持续进化(多次迭代)
for i in range(10):
    result = loop.run_cycle(...)
    loop.inject_feedback(result.evolution.feedback)
    # 每次迭代都会更新内部世界模型

与其他 Skill 的关系

self-evolution-cognition (父节点: 自进化框架)
    ├── vision-action-evolution-loop (本Skill: 视觉-动作-进化)
    │       └── diepre-vision-cognition (子节点: 2D视觉检测)
    └── human-ai-closed-loop (兄弟节点: 人机闭环)

arxiv-collision-cognition (交叉引用: 论文碰撞输入)

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Research

Embodied Ai News

Aggregates publicly available Embodied AI and Robotics news from curated sources (robotics media, arXiv, company blogs). Delivers structured briefings on hum...

Registry SourceRecently Updated
6212Profile unavailable
General

DiePre Vision Cognition

DiePre 视觉认知 Skill —— 将包装/模切机器视觉感知与 SOUL 推理融合的认知框架

Registry SourceRecently Updated
910Profile unavailable
Automation

Solo Hub Guide

Interactive step-by-step tutor for Solo Hub — guides a human through account setup, model browsing, team management, credits, and fine-tuning (LLM and VLA) u...

Registry SourceRecently Updated
1281Profile unavailable
Automation

具身智能前沿追踪系统

具身智能领域前沿动态追踪与视频素材采集系统;覆盖顶会论文(ICRA/IROS/CoRL/CVPR/NeurIPS)、开源项目、实验室动态;优先采集有Demo视频的爆款工作;生成含发布时间/主页/代码/视频链接的结构化日报,支持视频号内容创作

Registry SourceRecently Updated
770Profile unavailable