mineru

Parse PDFs, Word docs, PPTs, and images into clean Markdown using MinerU's VLM engine. Use when: (1) Converting PDF/Word/PPT/image to Markdown, (2) Extracting text/tables/formulas from documents, (3) Batch processing multiple files, (4) Saving parsed content to Obsidian or knowledge bases. Supports LaTeX formulas, tables, images, multilingual OCR, and async parallel processing.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "mineru" with this command: npx skills add nebutra/mineru-skill/nebutra-mineru-skill-mineru

MinerU Document Parser

Convert PDF, Word, PPT, and images to clean Markdown using MinerU's VLM engine — LaTeX formulas, tables, and images all preserved.

Setup

  1. Get free API token at https://mineru.net/user-center/api-token
export MINERU_TOKEN="your-token-here"

Limits: 2000 pages/day · 200 MB per file · 600 pages per file

Supported File Types

TypeFormats
📕 PDF.pdf — papers, textbooks, scanned docs
📝 Word.docx — reports, manuscripts
📊 PPT.pptx — slides, presentations
🖼️ Image.jpg, .jpeg, .png — OCR extraction

Commands

Single File

python3 scripts/mineru_v2.py --file ./document.pdf --output ./output/

Batch Directory with Resume

python3 scripts/mineru_v2.py \
  --dir ./docs/ \
  --output ./output/ \
  --workers 10 \
  --resume

Direct to Obsidian

python3 scripts/mineru_v2.py \
  --dir ./pdfs/ \
  --output "~/Library/Mobile Documents/com~apple~CloudDocs/Obsidian/VaultName/" \
  --resume

Chinese Documents

python3 scripts/mineru_v2.py --dir ./papers/ --output ./output/ --language ch

Complex Layouts (Slow but Most Accurate)

python3 scripts/mineru_v2.py --file ./paper.pdf --output ./output/ --model vlm

CLI Options

--dir PATH          Input directory (PDF/Word/PPT/images)
--file PATH         Single file
--output PATH       Output directory (default: ./output/)
--workers N         Concurrent workers (default: 5, max: 15)
--resume            Skip already processed files
--model MODEL       Model version: pipeline | vlm | MinerU-HTML (default: vlm)
--language LANG     Document language: auto | en | ch (default: auto)
--no-formula        Disable formula recognition
--no-table          Disable table extraction
--token TOKEN       API token (overrides MINERU_TOKEN env var)

Model Version Guide

ModelSpeedAccuracyBest For
pipeline⚡ FastHighStandard docs, most use cases
vlm🐢 SlowHighestComplex layouts, multi-column, mixed text+figures
MinerU-HTML⚡ FastHighWeb-style output, HTML-ready content

Script Selection

ScriptUse When
mineru_v2.pyDefault — async parallel (up to 15 workers)
mineru_async.pyFast network, need maximum throughput
mineru_stable.pyUnstable network — sequential, max retry

Output Structure

output/
├── document-name/
│   ├── document-name.md    # Main Markdown
│   ├── images/             # Extracted images
│   └── content.json        # Metadata

Performance

WorkersSpeed
1 (sequential)1.2 files/min
53.1 files/min
155.6 files/min

Error Handling

  • 5x auto-retry with exponential backoff
  • Use --resume to continue interrupted batches
  • Failed files listed at end of run

API Reference

For detailed API documentation, see references/api_reference.md.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Automation

openclaw-async-telegram-bot

No summary provided by upstream source.

Repository SourceNeeds Review
Research

learn-anything-in-one-hour

Teach users any new skill/knowledge X in ~1 hour using a fixed 4-step workflow optimized for complete beginners, focusing on 80/20 rule for maximum value in minimum time. Triggers when user asks to learn something new quickly, or mentions "learn X in one hour".

Archived SourceRecently Updated
Research

X/Twitter Research

# X/Twitter Research Skill

Archived SourceRecently Updated
Research

council

Convene the Council of High Intelligence — multi-persona deliberation with historical thinkers for deeper analysis of complex problems.

Archived SourceRecently Updated