local-vosk

Local speech-to-text using Vosk. Lightweight, fast, fully offline. Perfect for transcribing Telegram voice messages, audio files, or any speech-to-text task without cloud APIs.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "local-vosk" with this command: npx skills add sfkiwi/local-vosk

Local Vosk STT

Lightweight local speech-to-text using Vosk. Fully offline after model download.

Use Cases

  • Telegram voice messages — transcribe .ogg voice notes automatically
  • Audio files — any format ffmpeg supports
  • Offline transcription — no API keys, no cloud, no costs

Quick Start

# Transcribe Telegram voice message
./skills/local-vosk/scripts/transcribe voice_message.ogg

# Transcribe any audio
./skills/local-vosk/scripts/transcribe audio.mp3

# With language (default: en-us)
./skills/local-vosk/scripts/transcribe audio.wav --lang en-us

Supported Formats

Any format ffmpeg can decode: ogg (Telegram), mp3, wav, m4a, webm, flac, etc.

Models

Default model: vosk-model-small-en-us-0.15 (~40MB)

Other models available at https://alphacephei.com/vosk/models

Setup (if not installed)

pip3 install vosk --user --break-system-packages

# Download model
mkdir -p ~/vosk-models && cd ~/vosk-models
wget https://alphacephei.com/vosk/models/vosk-model-small-en-us-0.15.zip
unzip vosk-model-small-en-us-0.15.zip

Notes

  • Quality is good for conversational speech
  • For higher accuracy, use larger models or faster-whisper
  • Processes audio at ~10x realtime on typical hardware
  • Telegram voice messages are .ogg format — works out of the box

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Zip

Zip - command-line tool for everyday use

Registry SourceRecently Updated
General

Youtube Script

YouTube视频脚本、标题A/B测试、缩略图文案、SEO优化、开头Hook、章节标记。YouTube script writer with title testing, thumbnail copy, SEO optimization, hooks, chapter markers. Use when you...

Registry SourceRecently Updated
1760ckchzh
General

Topmediai AI Music Generator

Generate AI music, BGM, or lyrics via TopMediai API. Supports auto polling and two-stage output (preview first, then final full audio) for generation tasks.

Registry SourceRecently Updated
General

Yamlcheck

YAML validator and formatter. Validate YAML syntax, pretty-print with proper indentation, convert between YAML and JSON, and lint YAML files for common issues.

Registry SourceRecently Updated