xiaoyuzhou-transcribe

Download a Xiaoyuzhou (小宇宙) episode from a public URL and generate subtitles (.srt) plus transcript (.txt) using faster-whisper. Use when a user provides a xiaoyuzhoufm episode URL and wants full transcript text with timestamps, or needs to convert the audio into SRT/TXT files.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "xiaoyuzhou-transcribe" with this command: npx skills add weifenghuang/skillvault/weifenghuang-skillvault-xiaoyuzhou-transcribe

Xiaoyuzhou Transcribe

Overview

Generate SRT and TXT transcripts from a Xiaoyuzhou episode URL by downloading the audio and running faster-whisper locally. Use the bundled script to keep the workflow deterministic and repeatable.

Quick Start

  • Install dependency: python3 -m pip install -U faster-whisper
  • Run: python3 scripts/xiaoyuzhou_transcribe.py "<episode-url>" --output-dir .
  • Expect outputs: xiaoyuzhou-<eid>.mp3, xiaoyuzhou-<eid>.srt, xiaoyuzhou-<eid>.txt

Workflow

  • Fetch the episode page and parse __NEXT_DATA__ to locate the audio URL.
  • Download the audio (resume supported) unless --audio-path is provided.
  • Transcribe audio with faster-whisper and write SRT + TXT.

Script Usage

scripts/xiaoyuzhou_transcribe.py accepts:

  • --output-dir: write outputs to a specific directory
  • --model: whisper model size (tiny by default; use base or small for higher accuracy)
  • --language: force a language code or allow auto-detect
  • --audio-path: transcribe a local audio file instead of downloading
  • --force-download: re-download the audio even if it exists
  • --no-vad: disable VAD filtering if segments are too aggressive

Notes

  • If the episode is private or requires login, the script cannot access it.
  • Xiaoyuzhou does not expose public transcripts for many episodes; this workflow generates subtitles via speech-to-text instead.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

image-gen

Generate AI images from text prompts. Triggers on: "生成图片", "画一张", "AI图", "generate image", "配图", "create picture", "draw", "visualize", "generate an image".

Archived SourceRecently Updated
General

explainer

Create explainer videos with narration and AI-generated visuals. Triggers on: "解说视频", "explainer video", "explain this as a video", "tutorial video", "introduce X (video)", "解释一下XX(视频形式)".

Archived SourceRecently Updated
General

asr

Transcribe audio files to text using local speech recognition. Triggers on: "转录", "transcribe", "语音转文字", "ASR", "识别音频", "把这段音频转成文字".

Archived SourceRecently Updated