youtube-channel-parse

Fetch, transcribe, summarize, and filter YouTube channels or individual videos. Use when asked to analyze a full YouTube channel, parse a single video, collect transcripts, group or filter videos by date, topic, speakers, or metadata criteria, or produce reusable summaries and structured outputs from YouTube content.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "youtube-channel-parse" with this command: npx skills add ulyanas/youtube-channel-parse

YouTube Channel Parse

Overview

This skill supports two workflows.

For an individual video:

  • parse a specific video URL
  • fetch or generate a transcript
  • summarize the video
  • extract structured notes or reusable outputs

For a YouTube channel:

  • inventory channel videos
  • select which videos to analyze by criteria such as topic, date range, speakers, or other metadata filters
  • retrieve transcripts for the selected subset
  • produce summaries and structured outputs for the filtered set

Inputs

Accept these inputs:

  • channel_url or video_url
  • optional since_date
  • optional until_date
  • optional topic_filter
  • optional speaker_filter
  • optional metadata_filters
  • output_prefix
  • optional whisper_model

Workflow Selection

Choose the workflow from the provided URL:

  • if the user provides video_url, run the individual video workflow
  • if the user provides channel_url, run the channel workflow
  • if the user provides both, use the video workflow for the specific video request and the channel workflow for the filtered channel request

Check environment access early.

If the environment blocks YouTube network access, stop retrying after confirming the restriction and tell the user how to enable access before continuing.

Individual Video Workflow

  1. Normalize the video URL and derive the video id.
  2. Fetch video metadata.
  3. Retrieve the transcript.
  4. Produce a transcript file.
  5. Produce a markdown summary or notes file when the user asks for analysis, findings, or reusable notes.
  6. Produce structured JSON output when the task calls for downstream reuse.

Channel Workflow

  1. Inventory the channel videos.
  2. Normalize metadata for each video.
  3. Apply selection criteria such as date range, topic, speakers, or metadata filters.
  4. Write the filtered dataset before transcript work when the selection itself is part of the deliverable.
  5. Retrieve transcripts for the selected subset.
  6. Produce summaries, notes, and reports for the filtered set.

Filtering Rules

Apply filters in this order:

  1. date window
  2. metadata filters
  3. topic filter
  4. speaker filter

Use metadata-based filtering before transcript retrieval when that narrows the set efficiently.

Use transcript-based filtering when the requested criteria depend on spoken content, such as speaker references, repeated themes, or terms not present in the title or description.

Read detailed filter semantics from references/filtering.md when the request depends on nuanced selection logic.

Transcript Workflow

Retrieve direct transcripts first.

When direct transcripts are unavailable, try subtitle retrieval before full audio transcription.

When subtitle retrieval is unavailable, generate transcripts from downloaded audio and continue the workflow.

Reuse existing transcript files and cached audio files when they already match the current request.

Use tiny.en or tiny by default for fast transcription unless the user asks for higher accuracy.

If metadata retrieval works but captions or audio fail with bot-detection, 403, or sign-in verification errors, treat that as a content-access restriction and explain the environment limits clearly.

Outputs

  • channel inventory in JSON and markdown when needed
  • filtered video dataset in JSON, CSV, and markdown when needed
  • transcript files per video
  • filtered subsets by topic, speaker, date, or metadata criteria
  • markdown report with findings, summaries, and caveats

Use markdown outputs for summaries, findings, reusable notes, and channel-level reports.

Read naming and directory conventions from references/outputs.md when writing artifacts.

Capabilities

WorkflowCapabilityResult
ChannelAnalyze the entire channelInventory, selection, transcripts, and channel-level outputs
VideoAnalyze an individual videoTranscript, summary, notes, and structured outputs
ChannelFilter videos by date windowA narrowed candidate set before transcript work
ChannelFilter or group videos by topic, date range, speakers, or metadata criteriaA selected subset for analysis and reporting
Video or ChannelFetch direct transcripts when availableFaster transcript retrieval from existing sources
Video or ChannelGenerate transcripts from downloaded audio when neededContinued execution when direct transcripts are unavailable
Video or ChannelGenerate concise summaries and reusable artifactsMarkdown reports, notes, and structured data outputs

Resources

Use these bundled scripts:

  • fetch_channel.py for inventory, date filtering, and metadata normalization
  • transcribe_with_fallback.py for transcript API attempts, yt-dlp audio download, and local Whisper transcription
  • build_report.py for topic grouping, summaries, and markdown or CSV outputs

Use these reference files:

  • references/dependencies.md
  • references/filtering.md
  • references/outputs.md

Dependencies

This skill uses:

  • yt-dlp for channel inventory and audio download
  • youtube-transcript-api for direct transcript fetch
  • faster-whisper for local transcription fallback

Use commands such as:

uv run --with yt-dlp --with youtube-transcript-api --with faster-whisper python scripts/<script>.py ...

Pass --no-check-certificates to yt-dlp when the environment requires it.

Read references/dependencies.md for command patterns and dependency notes.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Research

ilp-b3-researcher

Responde dúvidas do time comercial sobre planos de incentivo de longo prazo (ILP) de empresas de capital aberto listadas na B3. Use esta skill SEMPRE que alg...

Registry SourceRecently Updated
Research

观远 BI · 马甲实战版

观远 BI(Guandata)全链路操作 — 数据查询/建卡/取数(Part A)、ETL 治理/写入/删除(Part B,含 SmartETL 全链路重写 + 字段使用度审计 + ExecPlan 工程化)、自定义图表 HTML/CSS/JS 注入与排障(Part C)。当用户提到 营业额/门店/会员/订单/建...

Registry SourceRecently Updated
Research

Tax Paperwork Staging Board

Gather scattered tax-time paperwork quickly into a clear staging board with a missing-items checklist, folder map, and handoff cover sheet, without giving ta...

Registry SourceRecently Updated
590Profile unavailable
Research

Cross Domain Engine

Discover hidden correlations between disparate research domains using EVEZ OODA loop architecture. Use when finding novel cross-domain connections, detecting...

Registry SourceRecently Updated
500Profile unavailable