media-orchestrator

Unified skill for resolving, downloading, and delivering media (audio/video) to chat platforms. Integrates yt-dlp for resolution and handles Spotify metadata sync.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "media-orchestrator" with this command: npx skills add sieershafilone/media-orchestrator

Media Orchestrator

This skill centralizes the logic for handling all media requests (audio, video, Spotify tracks) from chat platforms like WhatsApp and Telegram. It leverages yt-dlp for robust search and download capabilities from YouTube and performs Spotify metadata resolution for WebUI integration.

⚙️ How it Works

  1. Resolution: Upon receiving a media request, the orchestrator uses yt-dlp's ytsearch1: functionality to find the highest quality match from YouTube.
  2. Download: The selected media (audio or video) is downloaded directly into the OpenClaw workspace.
  3. Delivery: The downloaded file is then dispatched to the requesting chat platform (WhatsApp/Telegram) using the message tool's filePath argument, ensuring silent delivery as per protocol.
  4. Spotify Sync: For Spotify-specific requests, the orchestrator resolves track metadata via Zero-Auth scraping (bypassing direct API holds) and persists this data as a JSON contract in the workspace, enabling synchronized playback in the OpenClaw WebUI.

📂 Workspace Paths

  • Primary Workspace: /home/ky11rie/.openclaw/workspace/
    • Raw downloaded media files (.mp4, .mp3, etc.) reside here temporarily.
  • Spotify Metadata: /home/ky11rie/.openclaw/workspace/media/spotify/
    • Stores Spotify track contracts (e.g., track_id.json) generated by the spotify-surface component.
  • Skill Scripts: /home/ky11rie/.openclaw/workspace/skills/spotify-surface/scripts/
    • Contains the dedicated logic engine for Spotify integration.

🛠️ Commands

This skill acts as an underlying orchestrator, processing natural language requests. Example intents it handles:

  • send audio file song: [song name]
  • send video file mp4 480p: [video name]
  • play [spotify track or url] (leveraging spotify-surface)

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

SERP Outline Extractor

Turn a target keyword or query into a search-informed content outline with likely subtopics, questions, and comparison angles. Useful for SEO briefs, blog pl...

Registry SourceRecently Updated
General

Multi-Model Response Comparator

Compare responses from multiple AI models for the same task and summarize differences in quality, style, speed, and likely cost. Best for model selection, ev...

Registry SourceRecently Updated
General

API Pricing Comparator

Compare AI API or model pricing across providers and produce a structured summary for product pages, blog posts, or buyer guides. Works with OpenAI-compatibl...

Registry SourceRecently Updated