pypdf

Extract text, metadata, and pages from PDF files using pypdf. Use for tasks such as reading PDF content, extracting specific pages, splitting or merging PDFs, reading document metadata, and rotating or transforming pages.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "pypdf" with this command: npx skills add maverick-ai-tech/pypdf

pypdf

Use scripts/pypdf_cli.py for deterministic PDF operations instead of ad-hoc pypdf snippets.

Workflow

  1. Confirm the PDF file path is accessible locally.
  2. Run the desired command (inspect, extract-text, extract-pages, merge, split, rotate).
  3. Inspect text output or the resulting PDF file.

Command Guide

  • Inspect metadata and page count:
    • python scripts/pypdf_cli.py info --input <file.pdf>
  • Extract all text:
    • python scripts/pypdf_cli.py extract-text --input <file.pdf>
  • Extract text from specific pages (0-indexed):
    • python scripts/pypdf_cli.py extract-text --input <file.pdf> --pages 0 1 2
  • Split PDF into individual pages:
    • python scripts/pypdf_cli.py split --input <file.pdf> --output-dir <dir/>
  • Extract a page range into a new PDF:
    • python scripts/pypdf_cli.py extract-pages --input <file.pdf> --pages 0 1 2 --output <out.pdf>
  • Merge multiple PDFs:
    • python scripts/pypdf_cli.py merge --inputs <a.pdf> <b.pdf> <c.pdf> --output <merged.pdf>
  • Rotate pages:
    • python scripts/pypdf_cli.py rotate --input <file.pdf> --angle 90 --output <rotated.pdf>
    • --angle must be 90, 180, or 270.
    • Optionally restrict to specific pages with --pages 0 2.

Operational Rules

  • Pages are always 0-indexed in all commands.
  • For extract-text, output goes to stdout; redirect to a file when needed.
  • Require explicit --output for commands that write a new PDF.
  • Install dependency if missing: pip install pypdf.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Expedy

Expedy integration. Manage Organizations, Pipelines, Users, Filters. Use when the user wants to interact with Expedy data.

Registry SourceRecently Updated
General

Evenium

Evenium integration. Manage Events, Users, Roles. Use when the user wants to interact with Evenium data.

Registry SourceRecently Updated
General

Exhibitday

ExhibitDay integration. Manage Organizations. Use when the user wants to interact with ExhibitDay data.

Registry SourceRecently Updated
General

Enigma

Enigma integration. Manage Deals, Persons, Organizations, Leads, Projects, Activities and more. Use when the user wants to interact with Enigma data.

Registry SourceRecently Updated