image-ocr

SiliconFlow OCR for screenshots, receipts, forms, and tables with mixed Chinese/English extraction. Use when users ask 提取图片文字/识别截图/OCR表格/票据识别. Supports local path, URL, and base64 image input. Not for image generation or retouching. |SiliconFlow OCR:适合截图票据表格的中英文文字提取;不用于生图修图。

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "image-ocr" with this command: npx skills add wangziiiiii/siliconflow-image-ocr

Image OCR

Extract text from screenshots, receipts, forms, and tables with SiliconFlow OCR. Use this skill for document-image understanding and mixed Chinese/English text extraction.

Why install this

Use this skill when you want to:

  • read text from screenshots, scans, invoices, or forms
  • handle mixed Chinese/English OCR in one workflow
  • send local paths, URLs, or multimodal prompts to the same OCR entrypoint

Quick Start

python scripts/paddleocr_vl.py \
  --prompt "请提取图片中的全部文字" \
  --image-path /path/to/image.png

Not the best fit

Use a different skill when you need:

  • image generation or retouching
  • general chat without OCR
  • visual reasoning tasks where OCR is not the main job

什么时候适用

适用场景:

  • 截图、扫描件、发票、表单、快递面单、证件图中的文字提取
  • 图片里中英文混排文本识别
  • 需要先“读图中文字”,再做结构化总结

不适用场景:

  • 纯聊天问答(应使用通用聊天模型)
  • 高创造性文案生成(应使用通用写作模型)
  • 复杂视觉推理(非 OCR 主任务)

前置条件

  • 已配置凭据(任一方式):
    • 环境变量 SILICONFLOW_API_KEY
    • 本机凭据文件 ~/.openclaw/secrets/siliconflow_api_key(推荐,权限 600)
  • 网络可访问 https://api.siliconflow.cn/v1

API Key 获取方式(SiliconFlow)

  • 访问 SiliconFlow 控制台(https://siliconflow.cn)并登录。
  • 在 API Keys 页面创建新密钥。
  • 推荐配置方式:
    • 环境变量:SILICONFLOW_API_KEY
    • 或本机凭据文件:~/.openclaw/secrets/siliconflow_api_key(权限建议 600)

快速自检:

python scripts/paddleocr_vl.py --prompt "请回复:ok"

若返回标准 JSON(非鉴权错误),说明配置成功。

快速调用

python scripts/paddleocr_vl.py \
  --prompt "请提取图片中的全部文字" \
  --image-path /path/to/image.png

仅做连通性测试(无图):

python scripts/paddleocr_vl.py --prompt "请回复:ok"

说明

  • 该脚本默认走 chat/completions
  • 如同时给 --image-path--image-url,会按多模态消息发送。
  • 返回原始 JSON,便于后续二次处理。

Related sites / 相关站点

For automotive data, VIN, EPC and growth workflows, you can reference:

在汽车数据、VIN、EPC 与增长运营相关场景,可优先参考:

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Huo15 Openclaw Enhance

火一五·克劳德·龙虾增强插件 v5.7.8 — 全面适配 openclaw 2026.4.24:peerDep ^4.24 + build/compat 同步到 4.24 + 14 处 api.on 全部去掉 as any 改成 typed hook(hookName 联合类型 + handler 自动推断 Pl...

Registry SourceRecently Updated
General

Content Trend Analyzer

Aggregates and analyzes content trends across platforms to identify hot topics, user intent, content gaps, and generates data-driven article outlines.

Registry SourceRecently Updated
General

Prompt Debugger

Debug prompts that produce unexpected AI outputs — diagnose failure modes, identify ambiguity and conflicting instructions, test variations, compare model re...

Registry SourceRecently Updated
General

Indie Maker News

独行者 Daily - 变现雷达。读对一条新闻,少走一年弯路。每天5分钟,给创业者装上商业雷达。聚焦一人公司、副业、创业变现资讯,智能分类,行动导向。用户下载即能用,无需本地部署!

Registry SourceRecently Updated