mineru

Parse PDFs, Word docs, PPTs, and images into clean Markdown using MinerU's VLM engine. Use when: (1) Converting PDF/Word/PPT/image to Markdown, (2) Extracting text/tables/formulas from documents, (3) Batch processing multiple files, (4) Saving parsed content to Obsidian or knowledge bases. Supports LaTeX formulas, tables, images, multilingual OCR, and async parallel processing.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "mineru" with this command: npx skills add tsekaluk/mineru-skill

MinerU Document Parser

Convert PDF, Word, PPT, and images to clean Markdown using MinerU's VLM engine — LaTeX formulas, tables, and images all preserved.

Setup

  1. Get free API token at https://mineru.net/user-center/api-token
export MINERU_TOKEN="your-token-here"

Limits: 2000 pages/day · 200 MB per file · 600 pages per file

Supported File Types

TypeFormats
📕 PDF.pdf — papers, textbooks, scanned docs
📝 Word.docx — reports, manuscripts
📊 PPT.pptx — slides, presentations
🖼️ Image.jpg, .jpeg, .png — OCR extraction

Commands

Single File

python3 scripts/mineru_v2.py --file ./document.pdf --output ./output/

Batch Directory with Resume

python3 scripts/mineru_v2.py \
  --dir ./docs/ \
  --output ./output/ \
  --workers 10 \
  --resume

Direct to Obsidian

python3 scripts/mineru_v2.py \
  --dir ./pdfs/ \
  --output "~/Library/Mobile Documents/com~apple~CloudDocs/Obsidian/VaultName/" \
  --resume

Chinese Documents

python3 scripts/mineru_v2.py --dir ./papers/ --output ./output/ --language ch

Complex Layouts (Slow but Most Accurate)

python3 scripts/mineru_v2.py --file ./paper.pdf --output ./output/ --model vlm

CLI Options

--dir PATH          Input directory (PDF/Word/PPT/images)
--file PATH         Single file
--output PATH       Output directory (default: ./output/)
--workers N         Concurrent workers (default: 5, max: 15)
--resume            Skip already processed files
--model MODEL       Model version: pipeline | vlm | MinerU-HTML (default: vlm)
--language LANG     Document language: auto | en | ch (default: auto)
--no-formula        Disable formula recognition
--no-table          Disable table extraction
--token TOKEN       API token (overrides MINERU_TOKEN env var)

Model Version Guide

ModelSpeedAccuracyBest For
pipeline⚡ FastHighStandard docs, most use cases
vlm🐢 SlowHighestComplex layouts, multi-column, mixed text+figures
MinerU-HTML⚡ FastHighWeb-style output, HTML-ready content

Script Selection

ScriptUse When
mineru_v2.pyDefault — async parallel (up to 15 workers)
mineru_async.pyFast network, need maximum throughput
mineru_stable.pyUnstable network — sequential, max retry

Output Structure

output/
├── document-name/
│   ├── document-name.md    # Main Markdown
│   ├── images/             # Extracted images
│   └── content.json        # Metadata

Performance

WorkersSpeed
1 (sequential)1.2 files/min
53.1 files/min
155.6 files/min

Error Handling

  • 5x auto-retry with exponential backoff
  • Use --resume to continue interrupted batches
  • Failed files listed at end of run

API Reference

For detailed API documentation, see references/api_reference.md.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Tsconfig Validator

Validate and lint tsconfig.json files for common mistakes, conflicting compiler options, strictness gaps, and best practices. Use when asked to lint, validat...

Registry SourceRecently Updated
General

API Diff

Compare two OpenAPI 3.x or Swagger 2.0 specs and generate a changelog of breaking and non-breaking changes. Detect removed endpoints, new required parameters...

Registry SourceRecently Updated
General

Eslint Flat Config Validator

Validate ESLint v9+ flat config files (JSON-exported) for structural correctness, language options, rules configuration, plugin hygiene, file patterns, and b...

Registry SourceRecently Updated
General

Migration Safety Checker

Check database migrations for safety — detect data loss risks, locking operations, backward compatibility issues, and deployment ordering problems across SQL...

Registry SourceRecently Updated