pdf-to-markdown

Convert PDF to clean Markdown with image content described as text. Use when user wants to convert a PDF to markdown, extract content from PDF, or prepare PDF content for AI tools.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "pdf-to-markdown" with this command: npx skills add krishagel/geoffrey/krishagel-geoffrey-pdf-to-markdown

PDF to Markdown Converter

Convert PDF files to clean, well-structured Markdown. Tables become markdown tables. Images and graphics are described as text (no image files generated).

Quick Start

uv run skills/pdf-to-markdown/scripts/convert_to_markdown.py input.pdf

Output: ~/Desktop/{filename}.md

Options

FlagDescription
--no-llmSkip LLM processing (faster, images become [Image] placeholders)
--force-ocrForce OCR on all pages (for scanned PDFs)
--page-range "0,5-10"Process specific pages only

Common Use Cases

Convert a PDF with default settings

uv run skills/pdf-to-markdown/scripts/convert_to_markdown.py ~/Documents/report.pdf

Specify output location

uv run skills/pdf-to-markdown/scripts/convert_to_markdown.py report.pdf ~/Documents/report.md

Fast conversion (no image descriptions)

uv run skills/pdf-to-markdown/scripts/convert_to_markdown.py --no-llm report.pdf

Scanned PDF (force OCR)

uv run skills/pdf-to-markdown/scripts/convert_to_markdown.py --force-ocr scanned_doc.pdf

Extract specific pages

uv run skills/pdf-to-markdown/scripts/convert_to_markdown.py --page-range "0-5" large_report.pdf

Output

  • Pure Markdown text (no embedded images)
  • Tables converted to Markdown table format
  • Images/charts described as text using LLM
  • Clean formatting suitable for AI processing

Requirements

  • GEMINI_API_KEY: Required for LLM image descriptions (loaded from 1Password)
  • Use --no-llm flag if you don't have Gemini API access

First Run Note

The first run downloads ML models (~1-2GB) which are cached at ~/.cache/marker/. Subsequent runs are faster.

Technical Details

Uses Marker library:

  • 31k+ GitHub stars
  • Best-in-class PDF conversion accuracy
  • Surya OCR for 90+ languages
  • Gemini LLM integration for image understanding

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

morning-briefing

No summary provided by upstream source.

Repository SourceNeeds Review
General

browser-control

No summary provided by upstream source.

Repository SourceNeeds Review
General

google-workspace

No summary provided by upstream source.

Repository SourceNeeds Review
General

omnifocus-manager

No summary provided by upstream source.

Repository SourceNeeds Review