MoltShell Vision Engine

Give your text-based OpenClaw agent the ability to see and describe images

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "MoltShell Vision Engine" with this command: npx skills add melnyk-anton/moltshell-vision

👁️ MoltShell Vision Engine

Standard OpenClaw agents are blind. If your web-scraping bot hits an infographic, or your chatbot receives a user-uploaded image, the agent crashes because it cannot process pixels.

This skill acts as the visual cortex for your text-based bots. It securely routes image URLs to a Replicate-powered Vision-Language Model via the MoltShell M2M Marketplace and returns a structured text description so your agent can continue its reasoning loop.


⚡ Zero-Config Sandbox (Try it instantly)

We hate API paywalls as much as you do. This skill comes pre-configured with a Public Sandbox Key. You do not need to create an account or provide a credit card to test it. Just install the skill, and your bot will instantly receive $0.25 of free shadow-compute — enough for roughly 5 vision runs at $0.05 each.


🛠️ Usage

Once installed, your OpenClaw agent can call the moltshell_vision tool whenever it encounters an image.

Input Parameters

ParameterTypeRequiredDescription
image_urlstringThe public URL of the image to analyze
promptstringWhat the agent needs to know about the image

Example

Agent receives an image URL → calls moltshell_vision:

  image_url: "https://example.com/dashboard-screenshot.png"
  prompt:    "Describe the layout and key UI elements in this screenshot"

Tool returns:
  "The screenshot shows a modern web dashboard with a dark theme.
   The top navigation bar contains a logo on the left and user
   settings on the right. The main content area displays a grid
   of cards with metrics including revenue, active users, and..."

💳 Going to Production

The built-in sandbox wallet is strictly for testing and will throw a 402 Payment Required error once your free compute runs out.

To use this skill in production:

  1. Go to https://moltshell.xyz
  2. Generate a dedicated API Key
  3. Add it to your OpenClaw environment variables:
MOLTSHELL_API_KEY=sk_molt_your_key_here

That's it — no other configuration changes needed. The skill automatically uses your dedicated key when the environment variable is set.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Automation

Auto Redbook Content

小红书热点抓取与去AI味改写工具。抓取首页热点→生成去AI味改写提示词→本地存储。

Registry SourceRecently Updated
0155
Profile unavailable
Automation

Twitter/X scraper with Apify actors

Use this skill when the user needs Twitter/X audience collection through Apify actors (followers/following/both) with optional email enrichment, username ext...

Registry SourceRecently Updated
0195
Profile unavailable
Automation

Glasses to Social

Turn smart glasses photos into social media posts. Monitors a Google Drive folder for new images from Meta Ray-Ban glasses (or any smart glasses), analyzes them with vision AI, drafts tweets/posts in the user's voice, and publishes on approval. Use when setting up a glasses-to-social pipeline, processing smart glasses photos for social media, or creating hands-free content workflows.

Registry SourceRecently Updated
11.7K
Profile unavailable