markitdown

Convert any document or file to Markdown using Microsoft's MarkItDown library. Use this skill when the user asks to convert a file to markdown, extract text from a document, turn a PDF/Word/Excel/PowerPoint/HTML/image/audio file into markdown, or parse document content. Supports PDF, DOCX, PPTX, XLSX, HTML, images (with OCR), audio, CSV, JSON, XML, ZIP, EPub, and more.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "markitdown" with this command: npx skills add dalehurley/phpbot/dalehurley-phpbot-markitdown

Skill: markitdown

When to Use

Use this skill when the user asks to:

  • Convert a document or file to Markdown
  • Extract text/content from a PDF, Word, Excel, PowerPoint, or other document
  • Parse a file into readable text
  • Turn a file into Markdown format
  • Read/extract content from images (OCR)
  • Transcribe audio files to text
  • Convert HTML pages to Markdown
  • Process CSV, JSON, or XML files as Markdown
  • Extract content from ZIP archives or EPub files

Supported Formats

FormatExtensionsNotes
PDF.pdfText extraction with structure preservation
Word.docxHeadings, lists, tables preserved
PowerPoint.pptxSlide content and notes
Excel.xlsxTables converted to Markdown tables
Images.jpg, .png, .gif, .webp, .bmp, .tiffEXIF metadata; OCR if available
Audio.mp3, .wavEXIF metadata; transcription if available
HTML.html, .htmFull HTML-to-Markdown conversion
CSV.csvConverted to Markdown tables
JSON.jsonStructured as Markdown
XML.xmlStructured as Markdown
ZIP.zipIterates over archive contents
EPub.epubBook content extraction

Input Parameters

ParameterRequiredDescriptionExample
file_pathYesPath to the file to convert/Users/me/Documents/report.pdf
output_pathNoPath to save the Markdown output. If omitted, outputs to stdout./output.md

Procedure

  1. Determine the file path from the user's request. If not provided, ask via ask_user.
  2. Verify the file exists using bash or read_file.
  3. Run the conversion script:
    python3 skills/markitdown/scripts/convert.py "{{file_path}}"
    
    Or save to a file:
    python3 skills/markitdown/scripts/convert.py "{{file_path}}" -o "{{output_path}}"
    
  4. The script will auto-install markitdown[all] if not already available.
  5. Present the Markdown output to the user, or confirm the output file was written.

Bundled Scripts

ScriptTypeDescription
scripts/convert.pyPythonConvert any supported file to Markdown using markitdown

Script Usage

# Convert and print to stdout
python3 skills/markitdown/scripts/convert.py "/path/to/document.pdf"

# Convert and save to file
python3 skills/markitdown/scripts/convert.py "/path/to/document.pdf" -o output.md

# Convert with verbose output (shows metadata)
python3 skills/markitdown/scripts/convert.py "/path/to/document.pdf" -v

Example

Example requests that trigger this skill:

convert this PDF to markdown
extract the text from report.docx
turn this spreadsheet into markdown
parse the content of presentation.pptx
convert ~/Downloads/invoice.pdf to markdown

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Automation

xlsx

No summary provided by upstream source.

Repository SourceNeeds Review
Automation

docx

No summary provided by upstream source.

Repository SourceNeeds Review
Automation

pdf

No summary provided by upstream source.

Repository SourceNeeds Review