bili-summary

Extract and summarize Bilibili videos. Fetches subtitles or GPU-transcribed audio, danmaku (scrolling comments), video comments, and description — outputs structured JSON for AI agents to summarize. Triggers: Bilibili video summary, summarize this video, what does this video say, bilibili video, B站视频总结, BV号, bilibili.com, video content, video summary, extract video text, video transcript.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "bili-summary" with this command: npx skills add gkd2323c/bilibili-video-summary

Bilibili Video Summary Tool

Extract full content from a Bilibili video — transcript/subtitles, danmaku, comments, and description — then use your own LLM capabilities to produce a deep summary. No external AI API required (no OpenAI / Gemini key needed).

Capabilities

Data SourceMethodPriority
CC SubtitlesBilibili APIFastest, used if available
Audio Transcriptionwhisper.cpp + Vulkan GPUAutomatic fallback when no subtitles
Video Descriptionyt-dlpAlways captured
Danmaku (scrolling comments)yt-dlpParsed, analyzed for frequent content
CommentsBilibili Comment APIHot-sorted, deduplicated, top liked extracted

Workflow

When you receive a Bilibili video link and are asked to summarize it, follow these steps:

Step 1: Extract all data

python bili-transcript.py "<video_url>"

The script automatically:

  1. Gets video title, uploader, duration, description
  2. Attempts Bilibili CC subtitles (fastest, used if available)
  3. Falls back to GPU transcription: download audio → convert to wav → whisper.cpp with Vulkan
  4. Downloads and analyzes danmaku (scrolling comments)
  5. Fetches video comments, sorted by likes

Output files are saved to ./bili-output/:

  • transcript.txt — full transcript/subtitle text
  • danmaku.json — danmaku data with statistics
  • comments.json — comment data with top-liked

The JSON output includes preview text, danmaku summary, and top comments.

Step 2: Read full transcript

The JSON preview truncates at 2000 characters. Read the full file:

cat ./bili-output/transcript.txt

Step 3: Read danmaku and comments

Review community response data:

cat ./bili-output/danmaku.json
cat ./bili-output/comments.json

Step 4: Compose your summary

Use your own LLM capabilities to produce a comprehensive summary. Suggested structure:

Video Overview — Title, uploader, duration, transcription source (subtitle / GPU). Key info from the description (project links, update notes, etc.).

Core Content — What the video is about. Fluent paragraph summary of the main narrative.

Key Points — Notable arguments, data points, or information worth highlighting.

Community Response (optional) — Reactions from danmaku and comments. Skip if content is insubstantial (spam, trolling, no valuable discussion).

  • Danmaku analysis: look for frequently repeated phrases (community memes/reactions), informative questions, technical discussions, controversy points
  • Comment analysis: look for top-liked opinions, creator interactions, user-reported issues, technical insights

Assessment (optional) — Content quality, information density, notable strengths or weaknesses.

Available Actions

# Video metadata only
python bili-transcript.py "<URL>" --action info

# CC subtitles only (if available)
python bili-transcript.py "<URL>" --action subtitle

# Force GPU transcription (skip subtitle check)
python bili-transcript.py "<URL>" --action transcribe

# Danmaku only
python bili-transcript.py "<URL>" --action danmaku

# Comments only
python bili-transcript.py "<URL>" --action comments

# Custom output directory
python bili-transcript.py "<URL>" --output ./my-output

Environment Variables

VariablePurpose
WHISPER_CPP_DIRPath to whisper.cpp directory (containing whisper-cli)
WHISPER_MODELPath to whisper model file (e.g., ggml-large-v3-turbo.bin)
BILI_OUTPUT_DIRDefault output directory (default: ./bili-output)

Performance Reference

Video LengthTotal TimeNotes
5 minutes~15sGPU transcription is fast
12 minutes~22sDownload + convert + transcribe
1 hour~2-3 minDepends on audio density
Danmaku/Comments~5-10sDepends on comment volume

Dependencies

  • Python packages: yt-dlp, av (PyAV)
  • Transcription engine: whisper.cpp with Vulkan support (optional, only needed if no CC subtitles)
  • Model: ggml-large-v3-turbo.bin (~1.6GB, download separately)
  • GPU: Any Vulkan-compatible GPU (NVIDIA, AMD, Intel) — auto-detected
  • No external AI API keys required

Limitations

  • Requires internet access to Bilibili
  • Some content requires login (paid courses, restricted videos) — may fail
  • Danmaku and comment APIs may be rate-limited
  • whisper.cpp does not support m4a; script auto-converts via PyAV
  • Very long videos (>2 hours) take significant transcription time; try --action subtitle first
  • Comments are fetched from the first 3 pages (~60 comments); may not cover very hot videos fully

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Values

Run a Moral Graph Elicitation interview when the user expresses a strong feeling, a goal, a norm, or a difficult choice in a way that suggests an underlying...

Registry SourceRecently Updated
General

小红书推流算法与内容优化

小红书推流算法分析与内容优化。当用户需要:(1) 了解小红书流量机制/推流算法,(2) 优化小红书标题、封面、正文,(3) 提升帖子互动率/ CES评分,(4) 制定小红书内容策略,(5) 分析为什么帖子流量低。触发场景包括「小红书怎么推流」「为什么流量不好」「帮我优化标题/封面」等。

Registry SourceRecently Updated
General

Dxf Text Extractor

提取并清理建筑结构 CAD DXF 文件中的文本,自动将探索者(TSSD)等特殊钢筋符号映射为标准规范(如 HRB400)。当用户上传 DXF 文件,或要求“提取DXF文字”、“解析结构图纸”、“读取CAD文本”、“处理结构说明”时触发使用。

Registry SourceRecently Updated
General

Brown Uni

布朗大学是常春藤盟校,位于普罗维登斯,以开放课程体系和高学术自由著称,录取率约5%。

Registry SourceRecently Updated