paper-parse

Parse academic PDF papers into markdown with figure extraction.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "paper-parse" with this command: npx skills add chen-li-17/paper-parse-figures

Paper Parse

Parse academic PDF papers into structured markdown with figure extraction using PyMuPDF.

Usage

uv run {baseDir}/scripts/parse_paper.py --pdf /path/to/paper.pdf [--output-dir ./output]

Output

The tool generates:

  • {paper_name}_content.md - Full paper content in markdown
  • {paper_name}_parsed.json - Structured metadata including:
    • Paper title
    • Number of pages
    • Extracted figures with captions and paths
  • cover_title_authors.png - First-page snapshot focused on title + authors region
  • figures/ - Directory containing high-resolution figure screenshots

Example

uv run scripts/parse_paper.py --pdf ~/papers/my-paper.pdf --output-dir ./parsed

Output structure:

./parsed/
├── my-paper_content.md
├── my-paper_parsed.json
└── figures/
    ├── figure_1.png
    ├── figure_2.png
    └── ...

Dependencies

  • PyMuPDF (fitz) - PDF parsing and rendering
  • pymupdf4llm - Markdown conversion

These are automatically managed by uv via the inline script metadata.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Research

dependency-manager

You are a task dependency analysis and management specialist implementing DAG-based execution patterns from workflow orchestration systems. Use when: depende...

Registry SourceRecently Updated
Research

debugger

Expert debugger specializing in complex issue diagnosis, root cause analysis, and systematic problem-solving. Masters debugging tools, techniques, and method...

Registry SourceRecently Updated
Research

Autism Spectrum Disorder Behavior Analysis Tool | 孤独症谱系障碍行为分析工具

Performs special video analysis on behavioral characteristics of children with autism, identifies core symptom features, provides structured analysis reports...

Registry SourceRecently Updated
Research

Outdoor Sports Event Risk Analysis Tool | 户外体育赛事风险分析工具

Conducts video safety risk analysis for participants in outdoor sports competitions, long-distance running, marathons, etc.; identifies sports injuries and s...

Registry SourceRecently Updated
1040Profile unavailable