mistral-ocr

Convert PDF/images to Markdown/JSON/HTML using Mistral OCR API. Supports image extraction, table recognition, header/footer handling, and multi-column layouts. Usage: Upload a file and say Use Mistral OCR to process this.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "mistral-ocr" with this command: npx skills add yzdame/mistral-ocr

⚠️ Privacy Warning - 隐私警告

IMPORTANT - READ BEFORE INSTALLING:

This skill uploads your files to Mistral's cloud servers for OCR processing.

Do NOT use with sensitive or confidential documents unless:

  • You trust Mistral's data handling policies
  • You have reviewed Mistral's privacy policy
  • You accept that file contents will be transmitted and processed remotely

For sensitive documents, use offline/local OCR tools instead.


Mistral OCR Skill

A powerful OCR tool that converts PDF files and images into Markdown, JSON, or HTML formats using Mistral's state-of-the-art OCR API.

Installation

# Clone or download this repository
git clone https://github.com/YZDame/Mistral-OCR-SKILL.git
cd Mistral-OCR-SKILL

# Install dependencies
pip install -r requirements.txt

🔑 API Key Setup (Required)

Get your API key: 👉 https://console.mistral.ai/home

Set the environment variable:

export MISTRAL_API_KEY=your_api_key

CLI Usage

cd scripts

# Process PDF to Markdown
python3 mistral_ocr.py -i input.pdf

# Process PDF to JSON
python3 mistral_ocr.py -i input.pdf -f json

# Specify output directory
python3 mistral_ocr.py -i input.pdf -o ~/my_ocr_results

Arguments

FlagDescription
-i, --inputInput file path (required)
-f, --formatOutput format: markdown/json/html (default: markdown)
-o, --outputOutput directory

Data Privacy

What happens to your files:

  1. Files are uploaded to Mistral's OCR API
  2. Files are processed on Mistral servers
  3. Processing results are returned to you
  4. Files are not stored on Mistral servers (per Mistral policy)

For more details, see: https://mistral.ai/privacy-policy

License

MIT

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

WPS PDF Processing

当用户需要对 PDF 文件进行任何操作时,使用本技能。包括:读取或提取 PDF 中的文字/表格、合并多个 PDF、拆分 PDF、旋转页面、添加水印、创建新 PDF、填写 PDF 表单、加密/解密 PDF、提取图片,以及对扫描版 PDF 进行 OCR 识别使其可搜索。只要用户提到 .pdf 文件或希望生成 PDF,...

Registry SourceRecently Updated
720Profile unavailable
General

Akashic Doc Analyzer

Parse, analyze, and extract content from documents (PDF, DOCX, PPTX, audio). Supports OCR, table extraction, and semantic chunking.

Registry SourceRecently Updated
1360Profile unavailable
General

china-doc-ocr

智能文档OCR识别与结构化提取。Use when the user has a complex document, PDF, scanned image, photo, invoice, receipt, ID card, table, or chart that needs to be recognized a...

Registry SourceRecently Updated
1.3K0Profile unavailable
General

📝 文档处理大师

上传任意文档,智能提取关键信息、生成摘要。无需API Key,安装即用。

Registry SourceRecently Updated
380Profile unavailable