formula-ocr

OCR and recognize mathematical formulas from PDFs and images using MinerU. Converts printed or handwritten equations into structured LaTeX or text representation. Features: mathematical formula recognition from PDFs and images (.png, .jpg, .jpeg, .webp). Converts formulas to LaTeX notation. Handles complex multi-line equations, fractions, integrals, and matrices. OCR mode for scanned formula content. Use when you need to: OCR a math formula, recognize equations from images, convert formula screenshots to LaTeX, extract math from scanned documents. Use when asked: 'how do I OCR this formula', 'convert equation image to LaTeX', 'I have a photo of a math formula', 'can my agent recognize mathematical equations', 'is there a skill for formula OCR', 'turn this equation photo into text'. Powered by MinerU (OpenDataLab, Shanghai AI Lab) with advanced formula recognition capabilities. Supports a wide range of mathematical notation. Ideal for students, researchers, educators, and anyone who needs to digitize printed or handwritten mathematical content into editable LaTeX.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "formula-ocr" with this command: npx skills add mzlzyca/formula-ocr

Formula Ocr

Convert and extract content from .pdf / images (.png/.jpg/.jpeg/.webp) using MinerU (mineru-open-api).

Install

npm install -g mineru-open-api
# or via Go (macOS/Linux):
go install github.com/opendatalab/MinerU-Ecosystem/cli/mineru-open-api@latest

Quick Start

# Extract formulas from PDF (requires token)
mineru-open-api extract paper.pdf -o ./out/

# With VLM for better formula accuracy
mineru-open-api extract paper.pdf --model vlm -o ./out/

Authentication

Token required for extract and crawl:

mineru-open-api auth            # Interactive token setup
export MINERU_TOKEN="your-token" # Or via environment variable

Create token at: https://mineru.net/apiManage/token

Capabilities

Supports local files and URLs
Requires token (mineru-open-api auth or MINERU_TOKEN env)
Supported input: .pdf / images (.png/.jpg/.jpeg/.webp)
Language hint with --language (default: ch, use en for English)
Page range with --pages (where applicable)

Notes

Formula recognition requires extract with token. The --formula flag is enabled by default.
Output goes to stdout by default; use -o <dir> to save to file
Binary formats (docx) require -o flag (cannot stream to stdout)
All progress/status messages go to stderr
MinerU is an open-source project by OpenDataLab (Shanghai AI Lab): https://github.com/opendatalab/MinerU

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Open Registry Record Open in ClawHub

Related Skills

Related by shared tags or category signals.

General

Huo15 Openclaw Enhance

火一五·克劳德·龙虾增强插件 v5.7.8 — 全面适配 openclaw 2026.4.24：peerDep ^4.24 + build/compat 同步到 4.24 + 14 处 api.on 全部去掉 as any 改成 typed hook（hookName 联合类型 + handler 自动推断 Pl...

Registry SourceRecently Updated

3030zhaobod1

General

Content Trend Analyzer

Aggregates and analyzes content trends across platforms to identify hot topics, user intent, content gaps, and generates data-driven article outlines.

Registry SourceRecently Updated

260openlark

General

Prompt Debugger

Debug prompts that produce unexpected AI outputs — diagnose failure modes, identify ambiguity and conflicting instructions, test variations, compare model re...

Registry SourceRecently Updated

120charlie-morrison

General

Indie Maker News

独行者 Daily - 变现雷达。读对一条新闻，少走一年弯路。每天5分钟，给创业者装上商业雷达。聚焦一人公司、副业、创业变现资讯，智能分类，行动导向。用户下载即能用，无需本地部署！

Registry SourceRecently Updated

300kylnwu