pdf-batch-processor

Batch process PDF files - merge multiple PDFs, split PDF into multiple files, rotate pages, extract text, extract images, compress PDFs. Use when you need to process multiple PDF files in bulk.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "pdf-batch-processor" with this command: npx skills add potatosolo/pdf-batch-processing-tool

PDF Batch Processor

Batch process multiple PDF files with common operations. No need for expensive online services - process locally, keep your data private.

Core Capabilities

1. Merge multiple PDFs

  • Combine multiple PDF files into one
  • Preserve page order
  • Add table of contents optional

2. Split a PDF

  • Split by page ranges
  • Split each page into a separate file
  • Extract specific pages

3. Rotate pages

  • Rotate all pages or specific page ranges
  • Support 90/180/270 degree rotation

4. Extract text

  • Extract text from all pages
  • Export to plain text or markdown
  • Batch extract from multiple PDFs in a folder

5. Extract images

  • Save all images from a PDF to separate image files
  • Preserve original image quality when possible

6. Compress PDF

  • Reduce file size for web/email
  • Three compression levels (low/medium/high)

Usage Examples

Merge multiple PDFs

python scripts/merge_pdfs.py --output combined.pdf file1.pdf file2.pdf file3.pdf

Split PDF into individual pages

python scripts/split_pdfs.py --input document.pdf --output output-folder/ --mode pages

Extract all text from PDFs in a folder

python scripts/extract_text.py --input ./pdfs/ --output ./text/

Rotate all pages 90 degrees clockwise

python scripts/rotate_pdf.py --input input.pdf --output output.pdf --degrees 90

Installation

pip install pypdf pillow

When to use this skill

Use when:

  • You have multiple PDFs that need the same operation
  • You want to keep processing local (private, no uploads needed)
  • You need to automate PDF processing in a workflow

Don't use when:

  • You only need to edit one page manually (use a GUI PDF editor)
  • The PDF is encrypted/scanned image-only (needs OCR first)
  • You need advanced editing (add/remove content, edit text)

Notes

  • Works with standard PDF files
  • For scanned/image PDFs you need OCR first (use an OCR tool before processing)
  • All processing is local - your files never leave your machine

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

HeyCube AI Memory Butler

引导安装 HeyCube 黑方体个人档案管理服务到 OpenClaw。分步配置:设置环境变量、安装 SQLite 工具、安装口令触发 Skill。 用户主动说"安装黑方体"、"配置 HeyCube"、"heycube setup"时触发。

Registry SourceRecently Updated
1880Profile unavailable
General

airoom-ltd-global-finance-data-platform

Automatically logs into a WordPress site and downloads all downloadable files from a specified page with error handling and configurable limits.

Registry SourceRecently Updated
2790Profile unavailable
General

Windows Local Embedding

在 Windows 上为 OpenClaw 配置本地 embedding / 本地记忆检索时使用。适用于:下载并接入 `nomic-embed-text-v1.5.Q8_0.gguf`、把 `memorySearch.provider` 改成 `local`、检查 `openclaw memory status...

Registry SourceRecently Updated
2730Profile unavailable
General

Auto Create AI Team

Automatically create and configure AI team structures for projects with single, dual, or custom modes, integrating MCP and OpenClaw services.

Registry SourceRecently Updated
3130Profile unavailable