office-toolkit

A comprehensive toolkit for Microsoft Office documents (Word, Excel, PowerPoint) and PDF files. Supports reading, writing, format conversion, and batch processing. Features: - DOCX: Read/write with styles, tables, images - PPTX: Read/write slides, extract text - XLSX: Read/write spreadsheets, formulas - PDF: Read text, create from documents - Convert: DOCX→PDF, PPTX→PDF

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "office-toolkit" with this command: npx skills add weiwei2027/office-toolkit

Office Toolkit

Comprehensive document processing toolkit for Office and PDF files.

ClawHub: https://clawhub.ai/weiwei2027/office-toolkit Install: clawhub install office-toolkit

Supported Formats

FormatReadWriteConvert FromConvert To
DOCX-PDF
PPTX-PDF
XLSX--
PDF✅ (from DOCX/PPTX)DOCX, PPTX-

Quick Start

Read Documents

# Word
docx-read.py document.docx

# PowerPoint
pptx-read.py presentation.pptx

# Excel
xlsx-read.py spreadsheet.xlsx

# PDF
pdf-read.py document.pdf

Create Documents

# Word with content
docx-write.py output.docx --title "Report" --content "Hello World"

# PowerPoint with slides
pptx-write.py output.pptx --title "Presentation" --slides 5

# Excel with data
xlsx-write.py output.xlsx --sheet "Data" --data data.json

Convert Formats

# DOCX to PDF
convert.py document.docx --to pdf

# PPTX to PDF
convert.py presentation.pptx --to pdf

Installation

# Install all dependencies
pip install -r requirements/all.txt

# Or install only what you need
pip install -r requirements/docx.txt   # Word only
pip install -r requirements/pptx.txt   # PowerPoint only
pip install -r requirements/xlsx.txt   # Excel only
pip install -r requirements/pdf.txt    # PDF only

Directory Structure

office-toolkit/
├── SKILL.md                    # This file
├── requirements/               # Dependency files
│   ├── base.txt               # Core dependencies
│   ├── docx.txt               # python-docx
│   ├── pptx.txt               # python-pptx
│   ├── xlsx.txt               # openpyxl
│   └── pdf.txt                # pymupdf
├── scripts/                    # CLI tools
│   ├── docx-read.py
│   ├── docx-write.py
│   ├── pptx-read.py
│   ├── pptx-write.py
│   ├── xlsx-read.py
│   ├── xlsx-write.py
│   ├── pdf-read.py
│   ├── pdf-write.py
│   └── convert.py
├── lib/                        # Shared library
│   ├── __init__.py
│   ├── base.py                # Base classes
│   ├── utils.py               # Utilities
│   └── validators.py          # Input validation
└── tests/                      # Test suite
    ├── test_docx.py
    ├── test_pptx.py
    ├── test_xlsx.py
    └── test_pdf.py

Python API Usage

from lib.base import DocumentProcessor

# Process Word document
processor = DocumentProcessor('docx')
text = processor.read('document.docx')
processor.write('output.docx', content="New content")

# Convert format
processor.convert('document.docx', 'pdf')

Notes

  • DOCX: Uses python-docx library. Supports text, tables, styles, images.
  • PPTX: Uses python-pptx library. Supports slides, text, shapes, charts.
  • XLSX: Uses openpyxl library. Supports cells, formulas, charts, styling.
  • PDF: Uses pymupdf (fitz) for reading, reportlab for creation.

Roadmap

  • Excel support (xlsx read/write) - ✅ Added in v1.0.1
  • PDF creation from scratch
  • Format conversion improvements
  • Batch processing
  • Template system

Changelog

v1.0.1 (2026-03-20)

  • Added Excel (.xlsx) read/write support
  • Improved error handling with helpful messages
  • Added JSON output option for read operations
  • Added PDF page selection support

v1.0.0 (2026-03-20)

  • Initial release
  • DOCX/PPTX/PDF read and write support

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Template SDS Generator

Generate a deterministic, template-preserving 16-section SDS/MSDS package from 1 DOCX template, 1 prompt/rule file, and 1-3 source SDS/MSDS files, with DOCX/...

Registry Source
2500Profile unavailable
General

Encrypted File Reader

读取本地授权访问的加密及受保护文本、Word和Excel文件内容,支持企业安全策略环境下的合法文件读取。

Registry Source
4160Profile unavailable
Coding

FormatFerry Markdown Converter

Convert HTML, DOCX, PDF, XLSX, CSV to Markdown

Registry SourceRecently Updated
1190Profile unavailable
Coding

文档内容总结 Summary & Analysis txt/docx/pdf/xlsx/xls

local document summary & analysis tool. triggers: 帮我总结, 总结文件, 分析文档, 分析总结, 总结一下, 分析一下 summarize for me, analyze for me, summarize the file, analyze the docume...

Registry Source
5010Profile unavailable