Video-Learner
Declared Requirements
This skill requires the following tools and access:
- Tools: python3, yt-dlp, ffmpeg, whisper (Python package), node
- Dependency: douyin-download (requires node)
- Write path: ~/.openclaw/workspace/skills/
Trigger
When the user sends a Douyin/BiliBili/YouTube video link, this skill is invoked.
Processing Flow
- Create temp directory in
/tmp/for video download - Download video using yt-dlp or douyin-download
- Extract audio using ffmpeg
- Transcribe audio to text using Whisper (local)
- Analyze text content using the agent's LLM capability
- Display analysis results to user
- After user confirmation, generate SKILL.md to ~/.openclaw/workspace/skills/<new-skill-name>/
- Delete temp video files after processing
Security Notes
- Whisper: Runs locally, no audio sent to external services
- Content analysis: Uses the agent's own model capability
- Temp files: Automatically cleaned up after processing
- User confirmation: Generated new Skill requires explicit user approval
- Generated Skills are stored in ~/.openclaw/workspace/skills/, can be deleted anytime
Limitations
- Only processes user-provided video links
- Does not proactively fetch other content
- Does not store video files