Document Processing

# Document Processing Skill

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "Document Processing" with this command: npx skills add xiongrui-xr/document-processing

Document Processing Skill

Description

文档处理技能,支持多格式文档转换、内容提取和批量处理。

Features

  • 📄 格式转换(PDF/Word/Excel/PPT)
  • 🔍 内容提取(文本、表格、图片)
  • ✏️ 批量处理(重命名、格式统一)
  • 🤝 飞书集成(云文档同步)

Dependencies

  • Python 3.13+
  • PyPDF2, python-docx, openpyxl
  • pdfplumber, python-docx2txt

Usage

from document_processing import DocumentProcessor

# 初始化文档处理器
processor = DocumentProcessor()

# 转换PDF到Word
processor.convert_format("input.pdf", "output.docx")

# 提取PDF内容
content = processor.extract_content("document.pdf")

# 批量处理文档
processor.batch_process("./docs", ".docx", ".pdf")

Integration with Feishu

  • 文档自动同步到飞书云文档
  • 内容提取结果自动保存到飞书多维表格
  • 批量处理结果推送到飞书消息

Configuration

document_processing:
  temp_dir: "/tmp/document_processing"
  default_format: "docx"
  ocr_enabled: true
  ocr_api_key: "your_ocr_api_key"

feishu:
  doc_folder_token: "your_folder_token"
  bitable_app_token: "your_bitable_token"
  bitable_table_id: "your_table_id"

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

GigaChat (Sber AI) Proxy

Integrate GigaChat (Sber AI) with OpenClaw via gpt2giga proxy

Registry SourceRecently Updated
3600smvlx
General

TencentCloud Video Face Fusion

通过提取两张人脸核心特征并实现自然融合,支持多种风格适配,提升创意互动性和内容传播力,广泛应用于创意营销、娱乐互动和社交分享场景。

Registry SourceRecently Updated
General

TencentCloud Image Face Fusion

图片人脸融合(专业版)为同步接口,支持自定义美颜、人脸增强、牙齿增强、拉脸等参数,最高支持8K分辨率,有多个模型类型供选择。

Registry SourceRecently Updated
General

YoudaoNote News

有道云笔记资讯推送:基于收藏笔记分析关注话题,推送最新相关资讯。支持对话触发与每日定时推送(如早上9点)。触发词:资讯推送、设置资讯推送、生成资讯推送。

Registry SourceRecently Updated
1.5K1lephix