AB-Agents-Vision-MiniMax

👁️ Image analysis via MiniMax VL API. Describe images, extract text from screenshots, analyze photos. Requires MiniMax Token Plan API key (free tier available).

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "AB-Agents-Vision-MiniMax" with this command: npx skills add alexburrstudio/ab-agents-vision-minimax

AB Agents Vision (MiniMax) 👁️

Image analysis via MiniMax VL API — simple, fast, reliable.

⚠️ Requires MiniMax Token Plan API keyget free key

What It Does

  • 📸 Describe images — Get detailed scene descriptions
  • 📝 Extract text — Read from screenshots, photos, documents
  • 🔍 Analyze photos — Identify objects, people, settings
  • 🌐 URL support — Analyze images from the web

Requirements

  • MiniMax Token Plan API keySubscribe free
  • Linux/macOS
  • uvx (auto-installed)

Quick Start

# 1. Install uvx
curl -LsSf https://astral.sh/uv/install.sh | sh

# 2. Get free MiniMax API key
# https://platform.minimax.io → Subscribe → Token Plan (free tier)

# 3. Use
export MINIMAX_API_KEY="sk-cp-your-key"
./vision.sh image.jpg "Describe this image"

Usage

# Basic description
./vision.sh photo.jpg

# With custom prompt
./vision.sh screenshot.png "What text do you see?"

# URL support
./vision.sh "https://example.com/image.jpg" "Describe this"

Examples

Screenshot analysis:

Input: screenshot.png + "What text is in the image?"
Output: "The screenshot shows a code editor with Python code..."

Photo description:

Input: photo.jpg + "Describe in detail"
Output: "A person's bare foot and lower leg resting on a brown
textured waffle-weave blanket. The skin is light-toned..."

Installation

git clone https://github.com/alexburrstudio/ab-agents-vision.git
cd ab-agents-vision/skills/vision
chmod +x vision.sh

Or via ClaWHub:

clawhub install AB-Agents-Vision-MiniMax

Troubleshooting

ErrorSolution
API Error: 1033Retry — MiniMax system error
No responseCheck MINIMAX_API_KEY is set correctly
SlowUse smaller images (<10MB)

AB-Agents 🦀

Related Skills

📊 AB Agents Meter Reader — Read meter readings from photos (uses this skill for vision)


AB-Agents 🦀

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Research

MiniMax Vision Analysis

Analyze, describe, and extract information from images using the MiniMax vision MCP tool. Use when: user shares an image file path or URL (any message contai...

Registry SourceRecently Updated
2940Profile unavailable
Coding

Word OCR

OCR and text extraction from Word documents (.docx, .doc) using the MinerU API. This skill leverages mineru-open-api CLI to perform optical character recogni...

Registry SourceRecently Updated
890Profile unavailable
General

屏幕截图OCR工具

从屏幕截图中提取文字,支持纯文本、结构化及问答分离格式,适合中医考试题识别。

Registry Source
2460Profile unavailable
General

Tesseract OCR文字识别

基于Tesseract引擎的OCR文字识别技能,支持中文、英文、中英混合三种模式,输出text/structured/question_answer三种格式。

Registry Source
2290Profile unavailable