whisper-transcriber

Offline speech-to-text (ASR) using whisper.cpp (whisper-cli) + ffmpeg. Supports batch transcription, timestamps, SRT/TXT/JSON outputs, and model download. Cross-platform: macOS/Linux; Windows via WSL2 (recommended).

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "whisper-transcriber" with this command: npx skills add vvusu/whisper-transcriber

🎤 Whisper Transcriber(离线语音转文字)

基于 whisper.cpp / whisper-cli 的离线语音识别技能。

快速使用

  • 安装依赖(跨平台自动检测):
bash <SKILL_DIR>/scripts/install.sh
  • 转写一个音频文件:
bash <SKILL_DIR>/scripts/transcribe.sh voice.ogg
  • 批量转写目录 + 输出 SRT:
bash <SKILL_DIR>/scripts/transcribe.sh ./recordings -b -s

跨平台说明(Windows 推荐 WSL2)

  • macOS / Linux:使用 scripts/install.sh
  • Windows:推荐 WSL2(Ubuntu),在 WSL 内按 Linux 方式运行本 skill(最稳)

WSL 内安装/使用:

bash <SKILL_DIR>/scripts/install.sh
bash <SKILL_DIR>/scripts/transcribe.sh voice.ogg

说明:原生 Windows 环境下 whisper-cli 的安装来源/包名不统一,公开发布时容易造成安装失败;因此本 skill 默认建议 WSL2。

可配置项(环境变量)

无需改脚本,直接用环境变量覆盖默认值:

  • WHISPER_DEFAULT_MODEL(默认 base)
  • WHISPER_DEFAULT_LANG(默认 zh)
  • WHISPER_MODEL_DIR(默认 <SKILL_DIR>/assets/models
  • WHISPER_MODEL_DIR_USER(默认同 WHISPER_MODEL_DIR自动下载的默认目录;如你想把模型放到别处再覆盖此变量)
  • WHISPER_TEMP_DIR(默认 ${TMPDIR:-/tmp};每次运行会在其下 mktemp 创建独立临时目录并自动清理)

目录结构

  • scripts/transcribe.sh:主转写脚本(支持批量、SRT/TXT/JSON)
  • scripts/install.sh:跨平台安装依赖 + 可选下载模型
  • scripts/install.ps1:Windows(非 WSL)best-effort 安装/下载(不作为默认推荐路径)
  • assets/models/:模型默认下载/存放位置(仓库通过 .gitignore 忽略大模型文件,避免提交/发布)
  • config.json:默认配置(发布/维护用,脚本通过环境变量覆盖即可)

需要更细的用法/参数说明:直接运行 bash <SKILL_DIR>/scripts/transcribe.sh --help

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Coding

Web Research Assistant

AI-powered web research assistant that leverages BrowserAct API to supplement restricted web access by searching the internet for additional information. Designed for OpenClaw and Claude Code.

Registry SourceRecently Updated
2.5K3phheng
Coding

Pandoc Convert

Convert documents between 40+ formats using pandoc CLI. Handles Markdown ↔ Word ↔ PDF ↔ HTML ↔ LaTeX ↔ EPUB with smart defaults, professional templates, and comprehensive tooling.

Registry SourceRecently Updated
Coding

Monitored Ralph Loop

Generate copy-paste bash scripts for Ralph Wiggum/AI agent loops (Codex, Claude Code, OpenCode, Goose). Use when asked for a "Ralph loop", "Ralph Wiggum loop", or an AI loop to plan/build code via PROMPT.md + AGENTS.md, SPECS, and IMPLEMENTATION_PLAN.md, including PLANNING vs BUILDING modes, backpressure, sandboxing, and completion conditions.

Registry SourceRecently Updated
1.3K0endogen
Coding

Forge

Autonomous quality engineering swarm that forges production-ready code through continuous behavioral verification, exhaustive E2E testing, and self-healing fix loops. Combines DDD+ADR+TDD methodology with BDD/Gherkin specifications, 7 quality gates, defect prediction, chaos testing, and cross-context dependency awareness. Architecture-agnostic — works with monoliths, microservices, modular monoliths, and any bounded-context topology.

Registry SourceRecently Updated