image-ocr-parse

Extract text from images via the PDFAPIHub cloud OCR API. Images are uploaded to pdfapihub.com for Tesseract OCR processing. Supports preprocessing (grayscale, sharpen, threshold, resize) and word-level bounding boxes. Requires a CLIENT-API-KEY header.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "image-ocr-parse" with this command: npx skills add rishabhdugar/image-ocr-parse

Image OCR Parse

What It Does

Extracts text from images via the PDFAPIHub hosted OCR API. Your image is uploaded to PDFAPIHub servers where Tesseract OCR processes it and returns the extracted text. Supports optional image preprocessing to improve OCR quality on low-resolution or noisy inputs.

When to Use

  • Extract text from photos of receipts, signs, or documents
  • OCR business cards, ID cards, or labels
  • Process low-quality images with preprocessing

Required Inputs

Provide one of:

  • image_url — URL to an image
  • base64_image — base64-encoded image
  • Multipart upload with file field

Authentication

This skill calls the PDFAPIHub hosted API at https://pdfapihub.com/api. Your image is uploaded to PDFAPIHub servers for OCR processing.

Send your API key in the CLIENT-API-KEY header.

Get your free API key at https://pdfapihub.com. Full API documentation is available at https://pdfapihub.com/docs.

Privacy note: Images you process are uploaded to PDFAPIHub's cloud service and the extracted text is returned. Do not send sensitive images unless you trust the service. Files are auto-deleted after 30 days.

Use Cases

  • Receipt Scanning — Extract text from receipt photos for expense tracking
  • Business Card Reader — OCR business card images to extract name, phone, email
  • License Plate Recognition — Extract plate numbers from photos (with char_whitelist)
  • Meter Reading — Extract digits from utility meter photos for automated logging
  • Whiteboard Capture — OCR whiteboard or handwritten note photos into text
  • Product Label Scanning — Extract ingredient lists or nutrition info from product label photos

Image Preprocessing Options

ParamDefaultDescription
grayscalefalseConvert to grayscale
sharpenfalseApply sharpening
threshold0Binarization threshold (1–255)
resize0Scale factor (max 4x)

Example Usage

curl -X POST https://pdfapihub.com/api/v1/image/ocr/parse \
  -H "CLIENT-API-KEY: your_api_key" \
  -H "Content-Type: application/json" \
  -d '{
    "image_url": "https://pdfapihub.com/sample-invoicepage.png",
    "lang": "eng",
    "grayscale": true,
    "sharpen": true,
    "detail": "words"
  }'

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Subtitle To Video

add subtitle files, video into captioned videos with this skill. Works with MP4, MOV, AVI, WebM files up to 500MB. YouTubers, content creators use it for add...

Registry SourceRecently Updated
General

Api Changelog Generator

Generate and maintain API changelogs from OpenAPI/Swagger specs — track endpoints added, removed, deprecated, or modified between versions. Detect breaking c...

Registry SourceRecently Updated
General

Free Music

Skip the learning curve of professional editing software. Describe what you want — add upbeat background music to my video for free — and get music-backed vi...

Registry SourceRecently Updated
General

Indie Maker News

独行者 Daily - 变现雷达。读对一条新闻,少走一年弯路。每天5分钟,给创业者装上商业雷达。聚焦一人公司、副业、创业变现资讯,智能分类,行动导向。用户下载即能用,无需本地部署!

Registry SourceRecently Updated