watch-youtube

Watch and analyze YouTube videos using Gemini's video understanding API. Pass any YouTube URL to get summaries, timestamps, Q&A, or detailed analysis of video content — audio and visual.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "watch-youtube" with this command: npx skills add mikeygonz/skills/mikeygonz-skills-watch-youtube

Watch YouTube

Use Google's Gemini API to actually watch YouTube videos and answer questions about them.

How It Works

Gemini processes both audio and visual streams of YouTube videos at 1 FPS. You pass a URL + prompt, it returns analysis.

When to Use

  • User shares a YouTube URL and wants a summary, transcript, or analysis
  • User asks "what did they say about X in this video?"
  • User wants timestamps of key moments
  • User wants to compare multiple videos (up to 10 per request with Gemini 2.5+)

When NOT to Use

  • Private or unlisted videos (only public videos work)
  • User just wants the video link or metadata

Setup

Requires GOOGLE_API_KEY environment variable. Get one free at https://aistudio.google.com/apikey

Usage

Run the script:

GOOGLE_API_KEY="$GOOGLE_API_KEY" python3 ~/.openclaw/workspace/skills/watch-youtube/watch.py "<youtube_url>" "<prompt>"

Examples

Summarize:

watch.py "https://www.youtube.com/watch?v=VIDEO_ID" "Summarize this video in 5 bullet points"

Timestamps:

watch.py "https://www.youtube.com/watch?v=VIDEO_ID" "List the key moments with timestamps"

Q&A:

watch.py "https://www.youtube.com/watch?v=VIDEO_ID" "What tools or products did they mention?"

Specific section:

watch.py "https://www.youtube.com/watch?v=VIDEO_ID" "What happens at 05:30?" 

Limits

  • Free tier: 8 hours of YouTube video per day
  • Paid tier: No limit
  • Max videos per request: 10 (Gemini 2.5+)
  • Max video length: ~1 hour (1M context), ~3 hours (low res)
  • ~300 tokens per second of video

Models

  • gemini-2.5-flash — fast, cheap, good for most use cases
  • gemini-2.5-pro — deeper analysis, longer videos
  • gemini-3-flash-preview — latest, best quality

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

read-x

No summary provided by upstream source.

Repository SourceNeeds Review
General

linear

No summary provided by upstream source.

Repository SourceNeeds Review
General

deploy-preview

No summary provided by upstream source.

Repository SourceNeeds Review
General

transcribe-youtube-videos

No summary provided by upstream source.

Repository SourceNeeds Review