Pic2MD - Picture to Markdown Parser
Extract text from images to Markdown using MinerU Open API. No API key required.
Quick Start
# Pic2MD - Picture to Markdown Parser
mineru-open-api flash-extract screenshot.png
# Pic2MD - Picture to Markdown Parser
mineru-open-api flash-extract https://example.com/image.png
# Pic2MD - Picture to Markdown Parser
mineru-open-api flash-extract photo.jpg -o ./output/
# Pic2MD - Picture to Markdown Parser
mineru-open-api flash-extract scan.jpg --language en
Language Rule
You MUST reply to the user in the SAME language they use. This is non-negotiable.
Capabilities
- OCR text extraction from PNG, JPG, JPEG, WebP, BMP, TIFF
- Supports both local files and URLs directly
- Language hint with
--language(default:ch, useenfor English) - No API key, no signup, no authentication
- Max 10MB per image
When to Use
- User asks to "read", "extract", or "OCR" an image
- User shares a screenshot and asks what it says
- User wants text from a photo of a document or whiteboard
- User needs image content converted to Markdown
CLI Reference
Run mineru-open-api flash-extract --help for all available options.
Data Privacy
flash-extractuploads the image to MinerU's cloud API for processing and returns the result. No account or API key is required.- Images are processed in real-time and are not stored after extraction.
- For details, see https://mineru.net
Notes
- Output is Markdown text extracted via OCR
- For higher precision or batch processing, use
mineru-open-api extract(requires auth viamineru-open-api auth) - If the CLI cannot be installed via npm/uv/go, download it from https://mineru.net/ecosystem?tab=cli