deepseek-ocr

OCR text recognition using DeepSeek-OCR model. Use when user asks for OCR, text recognition, image text extraction, screenshot recognition, or converting images to text/markdown.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "deepseek-ocr" with this command: npx skills add qianjunye/ucloud-deepseek-ocr

DeepSeek OCR

Recognize text in images using the DeepSeek-OCR model.

Quick start

{baseDir}/scripts/ocr.sh /path/to/image.jpg

Usage

{baseDir}/scripts/ocr.sh <image_path> [output_format]

Parameters:

  • <image_path>: Local image file (jpg, png, webp, gif, bmp)
  • [output_format]: Optional, defaults to markdown. Can be text, json, etc.

Examples

# Convert to markdown (default)
{baseDir}/scripts/ocr.sh /path/to/image.jpg

# Convert to plain text
{baseDir}/scripts/ocr.sh /path/to/image.png text

# Extract table as JSON
{baseDir}/scripts/ocr.sh /path/to/table.jpg "extract table as json"

Remote URL images

The model only supports base64-encoded images. For remote URLs, download first:

curl -s -o /tmp/image.jpg "https://example.com/image.jpg"
{baseDir}/scripts/ocr.sh /tmp/image.jpg

API key

Set DEEPSEEK_OCR_API_KEY, or configure in ~/.openclaw/openclaw.json:

{
  skills: {
    "deepseek-ocr": {
      apiKey: "YOUR_KEY_HERE",
    },
  },
}

Default API URL: https://api.modelverse.cn/v1/chat/completions Override with DEEPSEEK_OCR_API_URL if needed.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Zoom

Zoom API integration with managed OAuth. Manage meetings, webinars, recordings, and user profiles. Use this skill when users want to schedule meetings, manag...

Registry SourceRecently Updated
General

Kleinanzeigen.de Helper

Erstelle und verwalte Verkaufsanzeigen speziell auf kleinanzeigen.de. Verwende diesen Skill wenn der Human sagt, er will etwas auf kleinanzeigen.de verkaufen...

Registry SourceRecently Updated
General

Poku

Sends and receives phone calls and messages (like SMS, WhatsApp, Slack), and reserves dedicated phone numbers using the Poku API. Example use cases: calling...

Registry SourceRecently Updated
General

IMAP/SMTP Email - Maddy Fix

Read and send email via IMAP/SMTP. Check for new/unread messages, fetch content, search mailboxes, mark as read/unread, and send emails with attachments. Sup...

Registry SourceRecently Updated