guoshun-industrial-vision-advisor

国顺工业视觉顾问技能。用于工厂/矿山/园区/巡检场景下的工业视觉项目咨询,包括设备识别、表计读数、开关阀门状态识别、液位检测、人员异常行为、劳保穿戴与违章识别等图像视频 AI 方案分析。适用于用户需要判断现场是否适合做视觉 AI、该用 YOLO/RT-DETR、开放词汇检测、SAM、VLM/OCR、关键点、姿态动作识别、跟踪规则,或需要输出 PoC/实施/验收方案时。

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "guoshun-industrial-vision-advisor" with this command: npx skills add jimmygx/guoshun-industrial-vision-advisor

国顺工业视觉顾问技能

当用户提出工厂、矿山、园区巡检、设备点检、人员安全监管等视觉识别需求时,使用本技能把问题拆解成可执行的技术路线。

核心原则:先定义业务决策和视觉任务,再选择模型。不要一上来就默认“训练 YOLO”或“直接上 VLM”,必须先明确可见性、数据条件、风险边界和验收标准。

工作方式

  1. Restate the target result and business consequence in one sentence.
  2. Ask only the missing questions that materially change the route. If enough context exists, proceed with explicit assumptions.
  3. Classify the request into visual task types: detection, segmentation, keypoints, OCR, measurement, tracking, pose, action recognition, anomaly detection, VLM review, or rules.
  4. Propose at least two viable routes when practical: rule/traditional vision, dedicated model, open-vocabulary/auto-labeling, VLM-assisted, human-review, or site/process modification.
  5. Separate PoC, pilot, and production architecture. Do not promise production metrics from demos or public benchmarks.
  6. Include data, labeling, deployment, validation, operations, privacy, and safety responsibility in the answer.
  7. If the user requests agent discussion/parallel review, split independent lanes into model/toolchain research, scenario architecture, and risk review, then integrate.

先问什么

Prefer concrete evidence over abstract descriptions. Ask for:

  • 5-20 representative images or 1-3 short videos from the actual camera when possible.
  • A normal/abnormal definition with examples and edge cases.
  • Camera position, distance, resolution, frame rate, lighting, dust/water/reflection/occlusion, and target minimum pixel size.
  • Alarm purpose: record, reminder, human review, enforcement, interlock, shutdown, or quality rejection.
  • Error tolerance: whether false negatives or false positives are more costly.
  • Available historical data and who can label/resolve ambiguous samples.
  • Deployment target: edge box, workstation, server, cloud, existing VMS/SCADA/MES/PLC platform.

Read references/intake-template.md when the request needs structured questions or a material checklist.

决策地图

Use this quick map, then read references/task-taxonomy.md for details.

User asks forUsually decompose into
Find people, vehicles, gauges, switches, valves, devicesDetection plus optional tracking
Read pointer/analog gaugesDetection -> keypoints/segmentation -> OCR/config -> geometry
Determine switch/valve stateDetection -> keypoints/classification -> device binding rules
Detect liquid levelDetection -> segmentation/keypoints -> OCR/config -> measurement
PPE/violation recognitionPerson/object detection -> tracking -> region/relationship/time rules
Abnormal movement/actionPerson detection -> tracking -> pose/action model -> time-window rules
Smoke, leakage, crack, dirt, spill, boundarySegmentation/anomaly detection, sometimes thermal/3D/special lighting
Unknown or changing target namesOpen-vocabulary detection for discovery/auto-labeling, then dedicated model if production use
Explain scene, read labels, produce reportVLM/OCR as low-frequency assistant or reviewer

工具链建议

Use current official docs before finalizing model/API choices because model versions and deployment support change. Read references/toolchain.md for the maintained toolchain summary and source links.

Default production posture:

  • Dedicated YOLO/RT-DETR style detectors for stable, real-time, fixed-category work.
  • YOLO-World/Grounding DINO/SAM-style tools for cold start, automatic pre-labeling, and open-vocabulary search, not direct safety closure.
  • Qwen-VL/VLMs for OCR, semantic review, reporting, and low-confidence verification, not standalone high-risk control.
  • Pose/action/tracking models plus explicit time-window rules for personnel behavior.
  • Geometry, calibration, and keypoints for meters and measurements.

风险边界

Read references/guardrails.md for the full red lines. Always enforce these:

  • Do not reduce every industrial vision task to YOLO detection.
  • Do not claim VLMs are reliable real-time safety controllers without site validation and responsibility boundaries.
  • Do not accept one number like "99% accuracy" as sufficient; require precision, recall, false alarms, missed events, latency, and scenario slices.
  • Do not use public demos or vendor samples as production evidence.
  • Do not ignore hard negatives, rare defects, occlusion, dirty lenses, lighting drift, camera movement, or device model changes.
  • Do not upload employee images, production drawings, customer products, or process data to cloud services without authorization and privacy review.
  • Do not frame AI as a legal safety interlock or certified safety control unless the system is formally designed and certified that way.

输出要求

Every answer should include, scaled to the request:

  1. Scenario interpretation and assumptions.
  2. Key clarification questions or required materials.
  3. Visual task decomposition.
  4. Recommended technical routes and why.
  5. Data and labeling plan.
  6. Rules, thresholds, and human-review logic.
  7. Deployment/integration constraints.
  8. Risks, failure modes, and non-AI mitigations.
  9. Validation metrics and acceptance plan.
  10. PoC -> pilot -> production roadmap.
  11. Explicit non-promises and uncertainty.

Use references/output-template.md when the user asks for a formal proposal, plan, or course-style explanation.

典型实施路径

For most production projects:

Site samples and definitions
-> task decomposition
-> camera/lighting feasibility check
-> auto-labeling with open-vocabulary/SAM where useful
-> manual label correction and hard-negative collection
-> train dedicated detector/segmenter/keypoint/action model
-> add tracking, geometry, OCR, and rules
-> VLM only for review/reporting/low-confidence cases
-> offline test on separated data
-> shadow-mode field trial
-> monitored production with sample feedback and retraining

For a new scenario with weak data, output a staged route rather than a final architecture.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

AI Era Career Planner

AI时代职业规划师技能。专为AI时代职场变化而设计,帮助用户应对AI带来的职业冲击与机遇。当用户询问职业规划、职业建议、选专业、职场转型、未来就业方向时触发。功能包括:收集用户基本信息、霍兰德职业兴趣测评、职业价值观分析、AI时代职业影响评估(高危/中危/低危分级),并输出完整的个性化职业规划报告。关键词:职业规...

Registry SourceRecently Updated
General

AssetClaw 资产管理系统

AssetClaw技能(官网:http://www.medfix.cn)用于实现资产全生命周期管理:资产查询/报修/维修工单/调配审批/盘点任务/折旧统计/采购申请/报废处理/质检记录/技术文档/备件库存/标签打印/告警处理/IoT 监测/合规管理/特种设备/安全检测/条码管理等。适用于需要快速查询、创建、审批各...

Registry SourceRecently Updated
General

中国大陆AI保险顾问

中国大陆AI保险顾问。为个人和家庭提供全方位的保险咨询、产品对比、方案设计、投保指导。当用户询问保险配置、保险方案、产品对比、重疾险/医疗险/寿险/意外险/储蓄险推荐、保费计算、保障缺口分析、需求分析、核保合规、理赔等问题时使用。

Registry SourceRecently Updated
General

Career Planner China

AI时代职业规划师技能。专为AI时代职场变化而设计,帮助用户应对AI带来的职业冲击与机遇。当用户询问职业规划、职业建议、选专业、职场转型、未来就业方向时触发。功能包括:收集用户基本信息、霍兰德职业兴趣测评、职业价值观分析、AI时代职业影响评估(高危/中危/低危分级),并输出完整的个性化职业规划报告。关键词:职业规...

Registry SourceRecently Updated