pic2md

Image to Markdown - extract text from images (PNG, JPG, WebP) to Markdown with OCR. Use when reading text from screenshots, photos, scanned pages, or any image file.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "pic2md" with this command: npx skills add tanis90/pic2md

Pic2MD - Picture to Markdown Parser

Extract text from images to Markdown using MinerU Open API. No API key required.

Quick Start

# Pic2MD - Picture to Markdown Parser
mineru-open-api flash-extract screenshot.png

# Pic2MD - Picture to Markdown Parser
mineru-open-api flash-extract https://example.com/image.png

# Pic2MD - Picture to Markdown Parser
mineru-open-api flash-extract photo.jpg -o ./output/

# Pic2MD - Picture to Markdown Parser
mineru-open-api flash-extract scan.jpg --language en

Language Rule

You MUST reply to the user in the SAME language they use. This is non-negotiable.

Capabilities

OCR text extraction from PNG, JPG, JPEG, WebP, BMP, TIFF
Supports both local files and URLs directly
Language hint with --language (default: ch, use en for English)
No API key, no signup, no authentication
Max 10MB per image

When to Use

User asks to "read", "extract", or "OCR" an image
User shares a screenshot and asks what it says
User wants text from a photo of a document or whiteboard
User needs image content converted to Markdown

CLI Reference

Run mineru-open-api flash-extract --help for all available options.

Data Privacy

flash-extract uploads the image to MinerU's cloud API for processing and returns the result. No account or API key is required.
Images are processed in real-time and are not stored after extraction.
For details, see https://mineru.net

Notes

Output is Markdown text extracted via OCR
For higher precision or batch processing, use mineru-open-api extract (requires auth via mineru-open-api auth)
If the CLI cannot be installed via npm/uv/go, download it from https://mineru.net/ecosystem?tab=cli

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Open Registry Record Open in ClawHub

Related Skills

Related by shared tags or category signals.

General

Gigo Lobster Resume

🦞 GIGO · gigo-lobster-resume: 续跑入口：v2 stable 当前会清理旧 checkpoint 并从头重跑；保留此 slug 作为旧 checkpoint 兼容入口。 Triggers: 继续试吃 / 恢复评测 / resume tasting / continue lobster...

Registry SourceRecently Updated

3160gigolab

General

YiHui CONTEXT MODE

context-mode is an MCP server that saves 98% of your context window by sandboxing tool outputs. It routes large file reads, shell outputs, and web fetches th...

Registry SourceRecently Updated

001yihui

General

xinyi-drink

Use when users ask about 新一好喝/新一咖啡 drinks, stores, menu, activities, Skill用户大礼包, today drink recommendations, afternoon tea, feeling sleepy, or personalized...

Registry SourceRecently Updated

2710domilin

General

vedic-destiny

吠陀命盘分析中文入口。用于完整命盘研判、命主盘 Rashi chart 与九分盘 Navamsha chart 联读、既往事件回看、出生时间稳定度判断、事业主题、婚姻主题、时空盘专题，以及基于 Jagannatha Hora PDF、星盘截图或文本命盘数据的系统拆盘。当用户提到完整星盘、事业方向、婚姻问题、关系窗...

Registry SourceRecently Updated

00seanding1998