AB Agents Vision (MiniMax) 👁️

Image analysis via MiniMax VL API — simple, fast, reliable.

⚠️ Requires MiniMax Token Plan API key — get free key

What It Does

📸 Describe images — Get detailed scene descriptions
📝 Extract text — Read from screenshots, photos, documents
🔍 Analyze photos — Identify objects, people, settings
🌐 URL support — Analyze images from the web

Requirements

MiniMax Token Plan API key — Subscribe free
Linux/macOS
uvx (auto-installed)

Quick Start

# 1. Install uvx
curl -LsSf https://astral.sh/uv/install.sh | sh

# 2. Get free MiniMax API key
# https://platform.minimax.io → Subscribe → Token Plan (free tier)

# 3. Use
export MINIMAX_API_KEY="sk-cp-your-key"
./vision.sh image.jpg "Describe this image"

Usage

# Basic description
./vision.sh photo.jpg

# With custom prompt
./vision.sh screenshot.png "What text do you see?"

# URL support
./vision.sh "https://example.com/image.jpg" "Describe this"

Examples

Screenshot analysis:

Input: screenshot.png + "What text is in the image?"
Output: "The screenshot shows a code editor with Python code..."

Photo description:

Input: photo.jpg + "Describe in detail"
Output: "A person's bare foot and lower leg resting on a brown
textured waffle-weave blanket. The skin is light-toned..."

Installation

git clone https://github.com/alexburrstudio/ab-agents-vision.git
cd ab-agents-vision/skills/vision
chmod +x vision.sh

Or via ClaWHub:

clawhub install AB-Agents-Vision-MiniMax

Troubleshooting

Error	Solution
API Error: 1033	Retry — MiniMax system error
No response	Check MINIMAX_API_KEY is set correctly
Slow	Use smaller images (<10MB)

AB-Agents 🦀

Related Skills

📊 AB Agents Meter Reader — Read meter readings from photos (uses this skill for vision)

AB-Agents 🦀

AB-Agents-Vision-MiniMax

Safety Notice

Copy this and send it to your AI assistant to learn

AB Agents Vision (MiniMax) 👁️

What It Does

Requirements

Quick Start

Usage

Examples

Installation

Troubleshooting

Related Skills

Source Transparency

Related Skills

MiniMax Vision Analysis

Word OCR

屏幕截图OCR工具

Tesseract OCR文字识别