ocr

OCR skill for extracting text from images and PDFs. Use when you need to read text from screenshots, photos, scanned documents, or any image file. Supports Chinese, English, and 100+ languages.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "ocr" with this command: npx skills add mr-shaper/opencode-skills-paddle-ocr/mr-shaper-opencode-skills-paddle-ocr-ocr

OCR Skill

Usage

To extract text from an image or PDF, run:

python3 "/Users/mrshaper/Library/Application Support/com.differentai.openwork/workspaces/starter/.opencode/skills/paddle-ocr/scripts/ocr.py" "/path/to/image.png"

Options

OptionDescription
--prompt "text"Custom prompt (e.g., "Extract table as markdown")
--fastUse faster PaddleOCR instead of DeepSeek-OCR
--jsonOutput as JSON format

Examples

# Basic OCR
python3 scripts/ocr.py image.png

# Extract table as markdown
python3 scripts/ocr.py table.png --prompt "Extract this table as markdown"

# Fast mode
python3 scripts/ocr.py image.png --fast

# PDF OCR
python3 scripts/ocr.py document.pdf

Supported Formats

Images: PNG, JPG, JPEG, BMP, GIF, WEBP, TIFF Documents: PDF

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Coding

ocr

No summary provided by upstream source.

Repository SourceNeeds Review
Automation

ocr

No summary provided by upstream source.

Repository SourceNeeds Review
General

ocr

No summary provided by upstream source.

Repository SourceNeeds Review