rapidocr

Extract text from local image files with RapidOCR. Use when the user wants OCR on a JPG, PNG, WEBP, BMP, or TIFF image and may want plain text or JSON output.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "rapidocr" with this command: npx skills add rapidai/rapidocr

RapidOCR

ClawHub skill for local image OCR with RapidOCR.

When to use

  • The user wants text extracted from a local image file.
  • The input is a local png, jpg, jpeg, webp, bmp, tif, or tiff file.
  • The user wants either plain text output or structured JSON output.

Do not use

  • Remote image URLs.
  • PDF OCR.
  • Relative paths when an absolute path is available.

Execution

  1. Pass the user's original request directly to the wrapper script.
  2. Run node "{baseDir}/run_rapidocr.js" "{{input}}".
  3. If the host does not substitute {baseDir}, resolve the directory containing this SKILL.md and run the sibling file run_rapidocr.js from there.
  4. For local testing, RAPIDOCR_PYTHON=/path/to/python can be used to force a specific interpreter.
  5. Otherwise the wrapper auto-discovers python3, python, or py.
  6. If the user asks for JSON, preserve the script's JSON output exactly.
  7. If dependencies are missing, tell the user to run <python> -m pip install rapidocr onnxruntime with the interpreter they intend to use.
  8. Publish this folder to ClawHub with slug rapidocr.

Behavior

  • The wrapper extracts a local image path from natural language, JSON-like input, or a direct path.
  • The wrapper only reads existing local image files.
  • Default output is plain text, one recognized line per line.
  • JSON mode returns text, lines, boxes, scores, and source.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

GigaChat (Sber AI) Proxy

Integrate GigaChat (Sber AI) with OpenClaw via gpt2giga proxy

Registry SourceRecently Updated
3600smvlx
General

TencentCloud Video Face Fusion

通过提取两张人脸核心特征并实现自然融合,支持多种风格适配,提升创意互动性和内容传播力,广泛应用于创意营销、娱乐互动和社交分享场景。

Registry SourceRecently Updated
General

TencentCloud Image Face Fusion

图片人脸融合(专业版)为同步接口,支持自定义美颜、人脸增强、牙齿增强、拉脸等参数,最高支持8K分辨率,有多个模型类型供选择。

Registry SourceRecently Updated
General

YoudaoNote News

有道云笔记资讯推送:基于收藏笔记分析关注话题,推送最新相关资讯。支持对话触发与每日定时推送(如早上9点)。触发词:资讯推送、设置资讯推送、生成资讯推送。

Registry SourceRecently Updated
1.5K1lephix