pdf-extractor

Extract text, metadata, and content from PDF documents.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "pdf-extractor" with this command: npx skills add winsorllc/upgraded-carnival/winsorllc-upgraded-carnival-pdf-extractor

PDF Extractor

Extract text, metadata, and content from PDF documents.

Features

text: Extract all text content from PDF
metadata: Extract document properties (title, author, creation date, etc.)
pages: Get total page count
selective: Extract specific page ranges
info: Get PDF version and encryption status

Usage

Extract all text

./scripts/pdf-extract.js --file ./document.pdf

Extract specific pages

./scripts/pdf-extract.js --file ./document.pdf --pages "1-5"

Extract metadata only

./scripts/pdf-extract.js --file ./document.pdf --metadata

Get page count and info

./scripts/pdf-extract.js --file ./document.pdf --info

Output as JSON

./scripts/pdf-extract.js --file ./document.pdf --format json

Examples

Task Command Output

Extract text pdf-extract.js --file doc.pdf

Full text content

Pages 1-3 pdf-extract.js --file doc.pdf --pages 1-3

Text from pages 1-3

Metadata pdf-extract.js --file doc.pdf --metadata

Document properties

Info pdf-extract.js --file doc.pdf --info

Page count, version

JSON output pdf-extract.js --file doc.pdf --json

Structured JSON

Notes

Supports most PDF formats (PDF 1.0-1.7)
Handles encrypted PDFs (prompts for password)
Memory efficient for large documents
Preserves text layout where possible

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Open in GitHub Open in ClawHub

Related Skills

Related by shared tags or category signals.

General

vector-memory

No summary provided by upstream source.

Repository SourceNeeds Review

General

model-router

No summary provided by upstream source.

Repository SourceNeeds Review

General

rss-reader

No summary provided by upstream source.

Repository SourceNeeds Review

General

video-frames

No summary provided by upstream source.

Repository SourceNeeds Review