youtube-podcaster

Extracts the original text of a Youtube video and converts it into a multi-voice AI podcast using Gemini for script generation, OpenAI for TTS, and a local Node.js API with FFmpeg. It also can show you the text of the Podcast in WebVTT format.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "youtube-podcaster" with this command: npx skills add kaudata/youtube-podcaster

YouTube Podcaster

This skill enables the automated conversion of YouTube videos into multi-host AI podcasts. It manages transcription, script generation via Gemini, and audio synthesis via OpenAI locally.

Security Setup

For maximum security, the backend server binds strictly to 127.0.0.1. It is not accessible from your local network or the internet.

  1. Install Dependencies: You must run the install command once before the first use. Say: Run the npm install command for the youtube-podcaster skill.
  2. Credentials: Place your Gemini API Key and OpenAI API Key in the .env file within the skill folder (skills/youtube-podcaster/.env) using the variable names GEMINI_API_KEY and OPENAI_API_KEY.
  3. Execution: Start the server with npm start or by instructing the agent: Start the local server for the youtube-podcaster skill.

Usage

Once the server is running, say: Create a podcast for the video https://www.youtube.com/watch?v=<video_id> using the youtube-podcaster skill

The skill orchestrates three local API calls to localhost:7860:

  1. Transcription: Extracts text via the YouTube transcript API.
  2. Drafting: Uses Gemini to create a natural dialogue script.
  3. Synthesis: Uses OpenAI TTS (tts-1) and FFmpeg to generate a gapless .m4a file.

Safe Cleanup

When you are finished using the studio, shut down the background process to free up system resources. Do not use generic kill commands. Instead, instruct the agent to use the tracked process ID: Stop the youtube-podcaster server process

(The agent will execute kill $(cat .podcaster.pid) or pkill -f "node index.js" to target the specific process safely).

Storage & File Outputs

Files are saved to downloads/<session_id>/ inside the skill directory. The server includes an hourly garbage collector that automatically deletes inactive sessions.

  • Audio: podcast.m4a
  • Captions: podcast.vtt
  • Scripts: script.txt and original.txt

Source Code

The source code is available at: https://github.com/kaudata/youtube-podcaster

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Omnicast

A local multi-modal podcast pipeline. Ingests media, drafts scripts, synthesizes audio, renders cover art, and uploads to YouTube.

Registry SourceRecently Updated
2821Profile unavailable
General

Podcast Growth Engine

A 12-phase system guiding podcast launch, production, guest management, audience growth, monetization, and repurposing without platform restrictions.

Registry SourceRecently Updated
1310Profile unavailable
Coding

Podwise

Podcast knowledge workflows powered by Podwise CLI: search podcasts and episodes by keyword, monitor followed shows for new releases, find popular episodes,...

Registry SourceRecently Updated
2380Profile unavailable
General

ReelTalk

Helper for processing shared video links. Takes a URL, downloads the audio track, creates a text transcript, and produces a summary. Supports all major platf...

Registry SourceRecently Updated
2611Profile unavailable