doc2markdown

Lightweight document utility designed to convert files to Markdown (MD), built specifically for intelligent agents (e.g., OpenClaw, ClaudeCode) to read and process content. Requires no external dependencies and accurately preserves document structure and formatting. Supported formats include docx, doc, pdf, ppt, pptx, xls, xlsx, jpg, jpeg, png, ceb, teb, caj, odt, ofd, cebx, odp, ott, wps, ods, et, dps, epub, chm, sdc, sdd, sdw, mobi

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "doc2markdown" with this command: npx skills add haoyt27/doc2markdown

doc2markdown

Document conversion assistant that automatically converts documents to Markdown (MD), saving output to the same directory as the source file. Designed to help intelligent agents read and process document content in various formats.

Quick Start

# Convert document (auto-polls for 60s, downloads if complete, returns doc ID if timeout)
node scripts/doc2markdown.js convert <file_path>     # Downloads MD package
node scripts/doc2markdown.js convert <file_path> --md  # Downloads single MD file

# Check status and download (for documents that exceeded timeout)
node scripts/doc2markdown.js check <doc_id> <original_file_path>     # Downloads MD package
node scripts/doc2markdown.js check <doc_id> <original_file_path> --md  # Downloads single MD file

Capabilities

  • Supported formats: docx, doc, pdf, ppt, pptx, xls, xlsx, jpg, jpeg, png, ceb, teb, caj, odt, ofd, cebx, odp, ott, wps, ods, et, dps, epub, chm, sdc, sdd, sdw, mobi, etc.
  • Preserves document structure, tables, and images
  • No API Key or account required, zero external dependencies
  • Downloaded ZIP files are extracted to {doc_id}_{filename}/ under the source file's parent directory; single MD files are saved directly there

When to Use

  • User requests to "read", "extract", "convert", or "view" a document
  • User provides a document path and asks about its content
  • User needs to summarize or analyze a document
  • User needs to convert document content to Markdown package

Download Modes

This tool supports two download modes:

  • --md mode: Downloads a single merged MD file to the source file's parent directory. Images are not included
  • MD package: Downloads and extracts a ZIP package to {doc_id}_{filename}/ in the source file's parent directory. Includes image files and tables, tables are rendered in HTML format

Choosing the Right Mode

User IntentExample PhrasesMode to Use
Read / view / analyze a document"read this file", "what's in this doc", "summarize this PDF"--md (single MD file)
Explicitly convert to MD"convert to MD", "export as markdown", "转成MD"MD package (default, no --md); use --md only if user specifically asks for a single file

Workflow

convert — Convert Document

  1. Invoke file parsing service
  2. Auto-poll conversion status (up to 60 seconds)
  3. Completes within 60s → Auto-download to source file directory
  4. Exceeds 60s → Return doc ID for subsequent check query

check — Query and Download

  1. Provide the previously returned doc ID
  2. Download if complete, otherwise continue polling for 60 seconds
  3. Prompt to retry later if still not complete

Data & Privacy

  • convert uploads files to the docchain cloud service (lab.hjcloud.com) for parsing. Results are returned as a ZIP archive and extracted locally.
  • All transfers use HTTPS encryption.
  • Users should ensure that documents do not contain sensitive or confidential information unless they have verified the service's data handling practices.
  • Service endpoint: https://lab.hjcloud.com/llmdoc

Feedback & Support

For parsing errors, format issues, or other problems, please submit an issue on GitHub: https://github.com/wct-lab/docchain-skills

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Coding

Miaoji Asin Clinic Pro

亚马逊ASIN诊所Pro版,90天行动计划+竞品对标+季节性优化日历。 从合规度、广告度、评论度、视觉度、内容度五维升级为可执行的长期作战方案。 基础功能可使用 miaoji-asin-clinic 免费版。

Registry SourceRecently Updated
Coding

wechat-publish-pro

Pure Python tool to convert Markdown to styled HTML and publish articles to WeChat official account drafts with AI-based content refinement and theme support.

Registry SourceRecently Updated
Coding

Miaoji Asin Clinic

基于ASIN和品类,快速诊断亚马逊Listing五维健康指数并智能排序修复优先级,提供详细分析与个性化修复方案。

Registry SourceRecently Updated
Coding

Toonany

A Claude Code skill for creating AI-generated short dramas (漫剧) from novels and stories. Use when user mentions "漫剧创作", "小说转剧本", "分镜生成", "短剧制作", "故事线生成", "大纲...

Registry SourceRecently Updated