AB Agents Vision (MiniMax) 👁️
Image analysis via MiniMax VL API — simple, fast, reliable.
⚠️ Requires MiniMax Token Plan API key — get free key
What It Does
- 📸 Describe images — Get detailed scene descriptions
- 📝 Extract text — Read from screenshots, photos, documents
- 🔍 Analyze photos — Identify objects, people, settings
- 🌐 URL support — Analyze images from the web
Requirements
- MiniMax Token Plan API key — Subscribe free
- Linux/macOS
uvx(auto-installed)
Quick Start
# 1. Install uvx
curl -LsSf https://astral.sh/uv/install.sh | sh
# 2. Get free MiniMax API key
# https://platform.minimax.io → Subscribe → Token Plan (free tier)
# 3. Use
export MINIMAX_API_KEY="sk-cp-your-key"
./vision.sh image.jpg "Describe this image"
Usage
# Basic description
./vision.sh photo.jpg
# With custom prompt
./vision.sh screenshot.png "What text do you see?"
# URL support
./vision.sh "https://example.com/image.jpg" "Describe this"
Examples
Screenshot analysis:
Input: screenshot.png + "What text is in the image?"
Output: "The screenshot shows a code editor with Python code..."
Photo description:
Input: photo.jpg + "Describe in detail"
Output: "A person's bare foot and lower leg resting on a brown
textured waffle-weave blanket. The skin is light-toned..."
Installation
git clone https://github.com/alexburrstudio/ab-agents-vision.git
cd ab-agents-vision/skills/vision
chmod +x vision.sh
Or via ClaWHub:
clawhub install AB-Agents-Vision-MiniMax
Troubleshooting
| Error | Solution |
|---|---|
| API Error: 1033 | Retry — MiniMax system error |
| No response | Check MINIMAX_API_KEY is set correctly |
| Slow | Use smaller images (<10MB) |
AB-Agents 🦀
Related Skills
📊 AB Agents Meter Reader — Read meter readings from photos (uses this skill for vision)
AB-Agents 🦀