PDF to Markdown Converter
Convert PDF documents to clean Markdown format using local processing only.
Quick Start
python scripts/pdf2md.py input.pdf output.md
Features
- Local Processing: No API calls, no network requests
- Smart Formatting: Auto-detect headers and lists
- Clean Output: Properly formatted Markdown
Requirements
pip install pdfplumber
Usage
# Basic usage
python scripts/pdf2md.py document.pdf
# Specify output file
python scripts/pdf2md.py document.pdf output.md
How It Works
- Extracts text from PDF using pdfplumber
- Cleans up formatting issues
- Detects headers and lists
- Outputs clean Markdown
Metadata
metadata:
openclaw:
requires:
bins: ["python3"]
pypi: ["pdfplumber"]
permissions:
- "file:read"
- "file:write"
behavior:
network: none
telemetry: none
description: "Converts PDF to Markdown using local pdfplumber. No network requests."