pdf-batch-processor

Batch process PDF files - merge multiple PDFs, split PDF into multiple files, rotate pages, extract text, extract images, compress PDFs. Use when you need to process multiple PDF files in bulk.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "pdf-batch-processor" with this command: npx skills add potatosolo/pdf-batch-processing-tool

PDF Batch Processor

Batch process multiple PDF files with common operations. No need for expensive online services - process locally, keep your data private.

Core Capabilities

1. Merge multiple PDFs

Combine multiple PDF files into one
Preserve page order
Add table of contents optional

2. Split a PDF

Split by page ranges
Split each page into a separate file
Extract specific pages

3. Rotate pages

Rotate all pages or specific page ranges
Support 90/180/270 degree rotation

4. Extract text

Extract text from all pages
Export to plain text or markdown
Batch extract from multiple PDFs in a folder

5. Extract images

Save all images from a PDF to separate image files
Preserve original image quality when possible

6. Compress PDF

Reduce file size for web/email
Three compression levels (low/medium/high)

Usage Examples

Merge multiple PDFs

python scripts/merge_pdfs.py --output combined.pdf file1.pdf file2.pdf file3.pdf

Split PDF into individual pages

python scripts/split_pdfs.py --input document.pdf --output output-folder/ --mode pages

Extract all text from PDFs in a folder

python scripts/extract_text.py --input ./pdfs/ --output ./text/

Rotate all pages 90 degrees clockwise

python scripts/rotate_pdf.py --input input.pdf --output output.pdf --degrees 90

Installation

pip install pypdf pillow

When to use this skill

✅ Use when:

You have multiple PDFs that need the same operation
You want to keep processing local (private, no uploads needed)
You need to automate PDF processing in a workflow

❌ Don't use when:

You only need to edit one page manually (use a GUI PDF editor)
The PDF is encrypted/scanned image-only (needs OCR first)
You need advanced editing (add/remove content, edit text)

Notes

Works with standard PDF files
For scanned/image PDFs you need OCR first (use an OCR tool before processing)
All processing is local - your files never leave your machine

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Open Registry Record Open in ClawHub

Related Skills

Related by shared tags or category signals.

General

HeyCube AI Memory Butler

引导安装 HeyCube 黑方体个人档案管理服务到 OpenClaw。分步配置：设置环境变量、安装 SQLite 工具、安装口令触发 Skill。用户主动说"安装黑方体"、"配置 HeyCube"、"heycube setup"时触发。

Registry SourceRecently Updated

1880Profile unavailable

General

airoom-ltd-global-finance-data-platform

Automatically logs into a WordPress site and downloads all downloadable files from a specified page with error handling and configurable limits.

Registry SourceRecently Updated

2790Profile unavailable

General

Windows Local Embedding

在 Windows 上为 OpenClaw 配置本地 embedding / 本地记忆检索时使用。适用于：下载并接入 `nomic-embed-text-v1.5.Q8_0.gguf`、把 `memorySearch.provider` 改成 `local`、检查 `openclaw memory status...

Registry SourceRecently Updated

2730Profile unavailable

General

Auto Create AI Team

Automatically create and configure AI team structures for projects with single, dual, or custom modes, integrating MCP and OpenClaw services.

Registry SourceRecently Updated

3130Profile unavailable