hn-podcast-transcribe

Download, transcribe, and archive Hacker News podcast episodes using Whisper. Use when: (1) transcribing HN podcast episodes, (2) archiving podcast transcripts as markdown, (3) listing available episodes from HN podcast RSS feeds, (4) batch-processing multiple episodes, (5) any "podcast transcription" or "audio to text" request involving HN or tech podcasts. Supports any podcast RSS feed with custom URL option.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "hn-podcast-transcribe" with this command: npx skills add terrycarter1985/hn-podcast-transcribe

HN Podcast Transcribe

Automated pipeline: RSS feed → download audio → Whisper transcription → markdown archive.

Quick Start

List episodes from a feed

python3 scripts/hn_podcast_feed.py --feed roundup --limit 10
python3 scripts/hn_podcast_feed.py --json --limit 50

Feeds: roundup (Hacker News Roundup), recap (Hacker News Recap), highlights (HN Highlights), or pass a custom URL with --url <RSS_URL>.

Download and transcribe a single episode

bash scripts/fetch_and_transcribe.sh <audio_url> ./output

Full archive pipeline (download → transcribe → markdown)

By direct audio URL:

bash scripts/archive_episode.sh "https://media.transistor.fm/ac6c95a2/8dc4e7fe.mp3" ./archive

Batch archive recent episodes (with dedup)

# Archive latest 5 from a feed, skip already-processed
bash scripts/batch_archive.sh roundup 5 ./archive
bash scripts/batch_archive.sh highlights 10 ./archive

Processed episodes are tracked in ./archive/.processed_episodes.json to avoid duplicate work.

Configuration

VariableDefaultDescription
WHISPER_MODELmediumWhisper model: tiny, base, small, medium, large, turbo
WHISPER_LANGenLanguage code for transcription

Use turbo for speed, large for best accuracy, tiny for quick testing.

Output

Each archived episode produces:

  • <stem>.md — Markdown with metadata header + full transcript
  • <stem>.mp3 — Original audio file

Batch runs maintain .processed_episodes.json for dedup tracking.

Requirements

  • whisper (pip install openai-whisper)
  • yt-dlp (optional, for non-direct URLs)
  • ffmpeg
  • Python 3.8+

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Context Shrink

Auto-compress session memories when context usage exceeds 85%

Registry SourceRecently Updated
6700bg1avd
General

VPS Command Runner

Run commands across multiple VPS simultaneously. Execute SSH commands, deploy updates, check logs, and manage services across all your servers from one place...

Registry SourceRecently Updated
General

Go-To-Market SalesClaw | Plan your outreach campaign

Register and autonomously send, receive, and manage up to 5 daily outreach emails with your own @sendclaw.com address for go-to-market campaigns.

Registry SourceRecently Updated
General

SalesClaw | Reach out to 5 existing contacts per day

Autonomously manage email outreach by sending up to 5 emails daily to your existing contacts to facilitate sales and communication.

Registry SourceRecently Updated