pdf-conversion

Converts PDF files to clean, readable Markdown format with proper formatting, image extraction, and table preservation.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "pdf-conversion" with this command: npx skills add rangerrick337/operator-os/rangerrick337-operator-os-pdf-conversion

PDF Conversion Skill

Overview

Converts PDF files to clean, readable Markdown format with proper formatting, image extraction, and table preservation.

When to Use

  • "Convert this PDF to markdown"

  • "Extract text from PDF"

  • "Transform PDF to MD format"

  • "Process PDFs in folder X"

Related Resources

  • SOP: Operator Team OS/1. SOPs/convert_pdfs_to_markdown.md

Scripts

Script Purpose

pdf_to_md.py

Main PDF to Markdown converter

Usage

python3 scripts/pdf_to_md.py /path/to/input.pdf /path/to/output.md

Features

  • Text extraction with OCR fallback

  • Image extraction and embedding

  • Table detection and formatting

  • Heading structure preservation

  • Automatic cleanup of formatting artifacts

Edge Cases

  • Scanned PDFs: Falls back to OCR

  • Complex tables: Best-effort conversion

  • Encrypted PDFs: Will fail (requires password)

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

docx

No summary provided by upstream source.

Repository SourceNeeds Review
General

xlsx

No summary provided by upstream source.

Repository SourceNeeds Review
General

skill-creator

No summary provided by upstream source.

Repository SourceNeeds Review