U2-doc-parser

Name: u2-doc-parser Description: Parse documents using UniDoc API for conversion to Markdown or JSON format. Supports both synchronous and asynchronous parsing with automatic status polling.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "U2-doc-parser" with this command: npx skills add aaiccee/u2-doc-parser

Name: u2-doc-parser Description: Parse documents using UniDoc API for conversion to Markdown or JSON format. Supports both synchronous and asynchronous parsing with automatic status polling.

UniDoc Document Parser

Overview

Parse documents using UniDoc API for conversion to Markdown or JSON format. Supports both synchronous and asynchronous parsing with automatic status polling. Ideal for converting various document formats (PDF, DOC, DOCX, images) through a cloud-based API service.

⚠️ Important Privacy Notice

  • This skill uploads your documents to an external API service: https://unidoc.uat.hivoice.cn
  • Documents are transmitted over the internet and processed on third-party servers
  • No authentication or API key is required for this UAT environment
  • Do not use with sensitive, confidential, or private documents
  • By using this skill, you acknowledge that your files will be uploaded to external servers

Prereqs / when to read references

If you encounter API errors, network issues, or need to understand the API endpoints, read:

  • references/unidoc-notes.md

Quick start (single document)

# Output to terminal (default)
python scripts/unidoc_parse.py /path/to/file.pdf

# Save to file
python scripts/unidoc_parse.py /path/to/file.pdf --output result.md

# Convert to JSON format (async mode)
python scripts/unidoc_parse.py /path/to/file.docx --format json --mode async

Options

  • --format md|json (default: md)
    • Output format: Markdown or JSON
  • --mode sync|async (default: sync)
    • Synchronous mode: waits for conversion to complete
    • Asynchronous mode: polls status until completion
  • --func METHOD (default: unisound)
    • Conversion method/algorithm to use
  • --output FILE (optional)
    • Save output to file instead of printing to terminal
    • When not specified, results are printed directly to stdout
  • --uid UUID (optional)
    • Custom user ID (auto-generated if not provided)

Output

  • Default: Prints converted content directly to terminal (stdout)
  • With --output: Saves to specified file path
  • Progress and error messages are sent to stderr
  • Can be piped to other commands: python scripts/unidoc_parse.py doc.pdf | grep "keyword"

Notes

  • Privacy: Your documents are uploaded to UniDoc's UAT servers for processing
  • No authentication: Current implementation does not require API keys or credentials
  • Network: Requires internet connectivity to https://unidoc.uat.hivoice.cn
  • Supported formats: PDF, DOC, DOCX, PNG, JPG, etc.
  • Async mode: Polls every 1 second until completion (max 5 minutes)
  • Limits: Max file size and rate limits depend on API service configuration
  • Recommendation: For large files or batch processing, prefer async mode
  • Security: Only use with non-sensitive test documents

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Automation

Video Call Agent

Start a video call with a real-time AI avatar using the Runway Characters API. The agent sends the user a call invite link — for standups, urgent alerts, che...

Registry SourceRecently Updated
Automation

Clawshake

B2B deal discovery for AI agents. Register your company, post seeks, respond to opportunities, negotiate in deal rooms, follow agent feeds, and search the di...

Registry SourceRecently Updated
Automation

Weixin Task Workbench

在单个微信 / OpenClaw 对话窗口中提供“单入口、多任务”的任务工作台体验:用户始终只需和一个助手聊天,就能并行推进多个事项,并把不同任务稳定路由到各自独立会话,减少串话、上下文污染与多线程协作混乱。适用于微信中的长期协作、任务切换、任务总结、任务归档与任务生命周期管理;当用户发送“任务列表 / 新建任务...

Registry SourceRecently Updated
Automation

自动记忆进化

Automatically summarize daily discussions and update memory files. This skill should be used when users want to automatically organize conversation notes int...

Registry SourceRecently Updated