Soundside — OpenClaw Skill

Connect your OpenClaw agent to Soundside's 12 MCP tools for AI media generation, editing, and analysis.

Setup

Add to your openclaw.json:

{
  "skills": {
    "entries": {
      "soundside": {
        "enabled": true,
        "env": {
          "SOUNDSIDE_API_KEY": "mcp_your_key_here"
        }
      }
    }
  },
  "mcpServers": {
    "soundside": {
      "transport": "streamable-http",
      "url": "https://mcp.soundside.ai/mcp",
      "headers": {
        "Authorization": "Bearer ${SOUNDSIDE_API_KEY}"
      }
    }
  }
}

Then restart: openclaw gateway restart

What You Get

Once connected, your agent has access to:

Generation (6 tools)

create_image — Text-to-image across 5 providers (Vertex AI, Grok, Runway, MiniMax, Luma)
create_video — Text/image-to-video across 5 providers (Vertex Veo 3.1, Runway, MiniMax, Luma, Grok)
create_audio — TTS, transcription, voice cloning (MiniMax, Vertex AI)
create_music — Music from lyrics + style prompt (MiniMax)
create_text — LLM completions with structured output (Vertex Gemini, Grok, MiniMax)
create_artifact — Charts, presentations, documents, diagrams

Editing & Analysis (2 tools)

edit_video — 21 editing actions: trim, concat, Ken Burns, mix audio, text overlays, color grading, film grain, split screen, and more
analyze_media — Technical analysis + AI vision QA scoring

Library (3 tools)

lib_list — Browse projects, collections, resources
lib_manage — Create/update/delete library entities
lib_share — Share projects by email

Durable Resource Pattern

Every generation returns a resource_id that persists across sessions:

Generate media → receive resource_id
Chain into editing: edit_video(resource_id=..., action="add_text", ...)
Organize: lib_manage(entity_type="project", operation="create", ...)
Check status: lib_list(entity_type="resources", resource_id=...)
Download only for final delivery

This keeps workflow state durable without local storage.

Example Workflows

Generate and edit an image:

"Create an image of a sunset over the ocean using Vertex AI,
 then add the text 'Golden Hour' as a title overlay"

Produce a narrated video:

"Generate a video of waves crashing using Luma,
 create TTS narration saying 'The ocean calls to those who listen',
 then mix the narration into the video"

Build a presentation:

"Create a pitch deck with 5 slides covering our Q1 metrics,
 include a revenue chart showing growth from $100K to $600K"

Pricing

Live pricing: GET https://mcp.soundside.ai/api/x402/status

Soundside charges near-cost on provider pass-through (~10% margin). Editing and analysis are $0.01/call. A typical video pipeline (image → video → edit → analyze) costs $0.50-3.00 depending on provider.

Soundside

Safety Notice

Copy this and send it to your AI assistant to learn

Soundside — OpenClaw Skill

Setup

What You Get

Generation (6 tools)

Editing & Analysis (2 tools)

Library (3 tools)

Durable Resource Pattern

Example Workflows

Pricing

Docs

Source Transparency

Related Skills

七窍玲珑战法 v4.0

Autoresearch.Bak

Clone Anywebsite

jeffli-content-factory