Bilibili Transcript v2.0
High-accuracy Bilibili video transcription with multi-language AI subtitle support.
Overview
This skill provides a complete transcription workflow for Bilibili videos:
- Extract Video Metadata - Title, author, publish date, duration
- Smart Subtitle Detection - Priority: CC subtitles → AI subtitles (multi-language) → Whisper transcription
- Multi-language AI Subtitle Support - Auto-detects:
ai-zh,ai-en,ai-ja,ai-es,ai-ar,ai-pt,ai-ko,ai-de,ai-fr - Browser Cookie Support - WSL Chromium or Windows Edge for member-only videos
- Formatted Output - Saves as structured TXT file with metadata + summary placeholder + full transcript
- Simplified Chinese - Automatically converts Traditional to Simplified Chinese
What's New in v2.0
- ✅ Multi-language AI subtitles - Supports 9 languages: Chinese, English, Japanese, Spanish, Arabic, Portuguese, Korean, German, French
- ✅ WSL Chromium support - Better cookie extraction than Windows Edge
- ✅ Correct subtitle download - Uses
--write-subs --write-auto-subscombo - ✅ Language auto-detection - Automatically finds available AI subtitle language
AI Subtitle Language Codes
Bilibili uses ai- prefix for AI-generated subtitles:
| Code | Language | 语言 |
|---|---|---|
ai-zh | Chinese | 中文 |
ai-en | English | 英文 |
ai-ja | Japanese | 日文 |
ai-es | Spanish | 西班牙文 |
ai-ar | Arabic | 阿拉伯文 |
ai-pt | Portuguese | 葡萄牙文 |
ai-ko | Korean | 韩文 |
ai-de | German | 德文 |
ai-fr | French | 法文 |
Requirements
Hardware (Your Setup)
- GPU: NVIDIA RTX 4070 Super (12GB VRAM) - ✅ Perfect for medium model
- WSL Memory: 16GB (configured)
- WSL CPU: 6 cores (configured)
Software
yt-dlp- Video/audio downloadffmpeg- Audio processingwhisper- Speech-to-text (local, no API key)opencc- Traditional to Simplified Chinese conversion (optional)
Browser (for AI subtitles)
- WSL Chromium (recommended) - Log in to Bilibili in WSL
- Windows Edge - Alternative option
Workflow
Step 1: Run Transcription Script
./scripts/bilibili_transcript.sh "https://www.bilibili.com/video/BVxxxxx"
Priority order:
- CC Subtitles (manual) - Fastest, highest accuracy
- AI Subtitles (auto-generated) - Fast, good accuracy, multi-language
- Whisper Transcription - Slowest, ~95% accuracy, works for all videos
Step 2: Generate Detailed Summary
After the script completes, read the generated TXT file and:
- Read the full transcript (第二部分)
- Generate a comprehensive summary (第一部分)
- Save the updated file
Step 3: Present to User
In Discord, post:
- Brief summary in message
- Attach the TXT file for full content
Setup WSL Chromium Login
For best results with AI subtitles:
-
Start WSL Chromium:
chromium-browser & -
Navigate to bilibili.com
-
Log in with your Bilibili account
-
Run the transcription script
The script will automatically use Chromium's cookies to access member-only AI subtitles.
Usage Examples
Example 1: Basic Transcription (Default Output)
./scripts/bilibili_transcript.sh "https://www.bilibili.com/video/BV1Z1wJzgEAj/"
# Output: workspace/Bilibili transcript/[VideoTitle]_BVxxxxx_transcript.txt
Example 2: Custom Output Directory
./scripts/bilibili_transcript.sh "https://www.bilibili.com/video/BV1Z1wJzgEAj/" ~/Documents
Notes
Model Selection
- Your config: RTX 4070 Super 12GB + 16GB RAM + 6 cores
- Default:
mediummodel (~95% accuracy, balanced speed) ✅ - Fallback: If GPU unavailable, automatically uses CPU (slower)
Accuracy Comparison
| Source | Accuracy | Speed | Best For |
|---|---|---|---|
| CC Subtitles | 100% | ⚡ Instant | All videos with manual subtitles |
| AI Subtitles (ai-zh) | ~90% | ⚡ Instant | Chinese videos |
| AI Subtitles (ai-en) | ~85% | ⚡ Instant | English videos |
| Whisper medium | ~95% | 🐢 Slow | No subtitle videos |
Default Output Directory
- Location:
workspace/Bilibili transcript/ - Created automatically on first run
- All transcript files organized in one place
File Naming
Output files are named: [VideoTitle]_[BVID]_transcript.txt
- Special characters (including Chinese punctuation) are replaced with underscores
- Title truncated to 50 characters
- Example:
股票分红_是从左口袋掏右口袋吗_BV1ddzUYTE27_transcript.txt
Subtitle Priority
The script tries subtitles in this order:
- Manual CC subtitles (zh-CN, zh-TW, en, ja, etc.)
- AI subtitles (any available language: ai-zh, ai-en, ai-ja, etc.)
- Whisper voice transcription (fallback)
This ensures fastest processing while maintaining high accuracy.