markdown-tools

Convert documents to markdown with image extraction and Windows/WSL path handling.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "markdown-tools" with this command: npx skills add nguyendinhquocx/code-ai/nguyendinhquocx-code-ai-markdown-tools

Markdown Tools

Convert documents to markdown with image extraction and Windows/WSL path handling.

Quick Start

Install markitdown with PDF Support

IMPORTANT: Use [pdf] extra for PDF support

uv tool install "markitdown[pdf]"

Or via pip

pip install "markitdown[pdf]"

Basic Conversion

markitdown "document.pdf" -o output.md

Or redirect: markitdown "document.pdf" > output.md

PDF Conversion with Images

markitdown extracts text only. For PDFs with images, use this workflow:

Step 1: Convert Text

markitdown "document.pdf" -o output.md

Step 2: Extract Images

Create assets directory alongside the markdown

mkdir -p assets

Extract images using PyMuPDF

uv run --with pymupdf python scripts/extract_pdf_images.py "document.pdf" ./assets

Step 3: Add Image References

Insert image references in the markdown where needed:

Description

Step 4: Format Cleanup

markitdown output often needs manual fixes:

  • Add proper heading levels (# , ## , ### )

  • Reconstruct tables in markdown format

  • Fix broken line breaks

  • Restore indentation structure

Path Conversion (Windows/WSL)

Windows → WSL conversion

C:\Users\name\file.pdf → /mnt/c/Users/name/file.pdf

Use helper script

python scripts/convert_path.py "C:\Users\name\Documents\file.pdf"

Common Issues

"dependencies needed to read .pdf files"

Install with PDF support

uv tool install "markitdown[pdf]" --force

FontBBox warnings during PDF conversion

  • These are harmless font parsing warnings, output is still correct

Images missing from output

  • Use scripts/extract_pdf_images.py to extract images separately

Resources

  • scripts/extract_pdf_images.py

  • Extract images from PDF using PyMuPDF

  • scripts/convert_path.py

  • Windows to WSL path converter

  • references/conversion-examples.md

  • Detailed examples for batch operations

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Coding

ppt-creator

No summary provided by upstream source.

Repository SourceNeeds Review
Coding

pitch-deck

No summary provided by upstream source.

Repository SourceNeeds Review
Coding

pdf

No summary provided by upstream source.

Repository SourceNeeds Review