pdf-markdown-converter

Convert PDF documents to clean, well-formatted Markdown using the MinerU API. This skill uses mineru-open-api CLI to transform PDFs into Markdown with preserved structure, headings, lists, tables, formulas, and images. Supports token-free flash-extract for instant conversion and precision extract for complex academic papers and technical documents. Use when asked to 'convert PDF to Markdown', 'PDF to md', 'turn my PDF into Markdown', 'PDF转Markdown', 'PDF转md格式', 'how to get Markdown from PDF', 'export PDF as Markdown', 'PDF to GitHub readme', 'extract PDF content as Markdown', 'can you convert this PDF to text format', 'make this PDF editable'. Handles academic papers, research reports, technical manuals, books, and multi-column layouts. Ideal for knowledge base building, technical documentation, blog publishing, and converting research papers for version control.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "pdf-markdown-converter" with this command: npx skills add veeicwgy/pdf-markdown-converter

PDF to Markdown Converter with mineru-open-api

You are a PDF-to-Markdown conversion specialist. Convert PDFs to clean Markdown using mineru-open-api.

Installation

npm install -g mineru-open-api

Conversion Workflow

  1. Quick conversion (no token):

    mineru-open-api flash-extract report.pdf -o ./output/
    
  2. Precision conversion with tables:

    mineru-open-api extract report.pdf -f md -o ./output/
    
  3. Academic paper (complex layout):

    mineru-open-api extract paper.pdf -f md --model vlm -o ./output/
    
  4. Batch conversion:

    mineru-open-api extract *.pdf -f md -o ./results/
    

Key Rules

  • flash-extract outputs Markdown by default
  • Default to flash-extract for PDFs under 10MB/20 pages
  • Use extract for table-heavy docs, formulas, or files >10MB
  • --model vlm for academic papers with complex layouts
  • Generate default output dir: ~/MinerU-Skill/<name>_<hash>/

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Coding

Word Converter

Universal Word document converter powered by MinerU API. Convert .docx and .doc files to Markdown, HTML, LaTeX, DOCX, or JSON using mineru-open-api CLI. Supp...

Registry SourceRecently Updated
1080Profile unavailable
Research

LLM Wiki Karpathy

Manage and maintain a Markdown wiki with LLM Wiki Karpathy: inspect, repair, compile sources, add pages, answer queries, and lint for quality.

Registry SourceRecently Updated
890Profile unavailable
Research

LLM Knowledge Bases

Inspired by a public workflow shared by Andrej Karpathy (@karpathy). From raw research to a living Markdown knowledge base that compounds with every question...

Registry SourceRecently Updated
2340Profile unavailable
General

Markitdown Converter

使用微软 markitdown 库将多种文档格式(PDF、DOC、DOCX、PPT、HTML等)转换为 Markdown。支持批量转换、保留格式、图片提取等功能。使用场景:(1) "把这个 PDF 转成 Markdown",(2) "批量转换这个文件夹里的文档",(3) "提取文档中的图片"。

Registry Source
3071Profile unavailable