pdf-extractor

This skill provides tools for extracting text and metadata from PDF documents and converting them to different formats.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "pdf-extractor" with this command: npx skills add maxvaega/skillkit/maxvaega-skillkit-pdf-extractor

PDF Extractor Skill

This skill provides tools for extracting text and metadata from PDF documents and converting them to different formats.

Available Scripts

extract.py

Extracts text and metadata from PDF files.

Input:

{ "file_path": "/path/to/document.pdf", "pages": "all" | [1, 2, 3] }

Output:

{ "text": "Extracted text content...", "metadata": { "title": "Document Title", "author": "Author Name", "pages": 10 } }

convert.sh

Converts PDF files to different formats (text, markdown, etc.).

Input:

{ "input_file": "/path/to/input.pdf", "output_format": "txt" | "md" | "html" }

parse.py

Parses structured data from PDF forms and tables.

Input:

{ "file_path": "/path/to/form.pdf", "extract_tables": true, "extract_forms": true }

Usage Example

from skillkit import SkillManager

manager = SkillManager() result = manager.execute_skill_script( skill_name="pdf-extractor", script_name="extract", arguments={"file_path": "document.pdf", "pages": "all"} )

if result.success: print(result.stdout)

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

large-content-skill

No summary provided by upstream source.

Repository SourceNeeds Review
General

git-helper

No summary provided by upstream source.

Repository SourceNeeds Review
General

markdown-formatter

No summary provided by upstream source.

Repository SourceNeeds Review