whisperX

WhisperX provides local speech-to-text transcription using OpenAI Whisper, with high-quality offline recognition, no API key required, word-level timestamps, and optional speaker diarization.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "whisperX" with this command: npx skills add niuzb/whisperx

WhisperX Speech Recognition Skill

Local offline speech-to-text - A WhisperX-powered speech recognition skill for OpenClaw. Up to 30x faster than standard OpenAI Whisper, runs fully offline with no API key required.

Features

Pure ASR: Converts voice messages to text only — no voice replies generated
Fully offline: Model runs locally, no internet or API key needed
Word-level timestamps: Precise per-word time alignment
90+ languages: Includes auto language detection
Speaker diarization: Optional, requires a HuggingFace token

Installation

# Install ffmpeg (macOS)
brew install ffmpeg

# Install ffmpeg (Ubuntu/Debian)
apt-get install ffmpeg

# Install WhisperX
pip install whisperx
# or using uvx:
uvx whisperx

GPU users: ensure CUDA 12.8 is installed for faster inference.

Usage

# Basic transcription (auto-detect language)
whisperx path/to/audio.wav

# Specify model and language
whisperx  --model small --language zh path/to/audio.wav

# CPU mode (low memory)
whisperx --model small --device cpu --compute_type int8  path/to/audio.wav

Notes

Dependencies: whisperx, ffmpeg
Supported formats: MP3, WAV, OGG, FLAC, M4A, OPUS, AAC, and all other ffmpeg-supported formats
Model cache: Downloaded automatically to ~/.cache/whisper/ on first run
Recommended models: base or small for CPU; large-v3 for GPU

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Open Registry Record Open in ClawHub

Related Skills

Related by shared tags or category signals.

General

地藏经药师经智慧

地藏经药师经智慧 - 佛家孝道与救度思想，涵盖地藏本愿、药师十二愿、因果报应、消灾延寿等核心智慧，适用于道德修养、慈悲精神、身心健康

Registry SourceRecently Updated

00taro291

General

Precision Oncology Zhcn

综合学术文献、流行病学报告、临床与药物指南及临床试验报告，提供关于癌症及其治疗的报告。基于癌变机制进行详细的分子生物学和组织学分析。当查询涉及以下内容时加载本技能： - 癌症或肿瘤 - 癌变机制 - 癌症或肿瘤的治疗典型查询 - 乳腺癌是如何发生的？ - 白血病的一线和二线治疗 - CAR-T 疗法治疗胰腺...

Registry SourceRecently Updated

900patsnaplifescience

General

hermes-traffic-guardian

Hermes runtime traffic monitoring baseline for opt-in proxy inspection, egress detection, and attestation-aware traffic posture.

Registry SourceRecently Updated

00davida-ps

General

Scp Paradigm

Use when analyzing how industry structure drives firm behavior and market performance, assessing market concentration, entry barriers, or competitive dynamic...

Registry SourceRecently Updated

00panlm