Speech Recognition

# Speech Recognition 语音识别

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "Speech Recognition" with this command: npx skills add yvanboyang/speech-recognition-forqq

Speech Recognition 语音识别

将 AMR/SILK 格式的语音转换为文字。

环境要求

  • Python 虚拟环境:source /opt/conda/bin/activate py314
  • 依赖包:pysilk, faster-whisper
  • 模型路径:/opt/workspace/yby_workspace/whisper-model

使用方式

from skills.speech_recognition import transcribe_audio

text = transcribe_audio("/path/to/audio.amr")
print(text)

实现逻辑

  1. 读取 AMR/SILK 格式文件
  2. 使用 pysilk 解码为 PCM 数据
  3. 保存为 WAV 文件
  4. 使用 faster-whisper 转写为文字
  5. 返回识别结果

支持格式

  • QQ 语音:.amr (SILK_V3 编码)
  • 标准 AMR:amr, amrnb, amrwb

依赖安装

source /opt/conda/bin/activate py314
pip install pysilk faster-whisper

模型下载

需要从 HuggingFace 下载 faster-whisper 模型:

python3 -m huggingface_hub snapshot-download \
  --repo-type model \
  --repo-id Systran/faster-whisper-base \
  --local-dir /opt/workspace/yby_workspace/whisper-model

需要的文件:

  • model.bin
  • config.json
  • tokenizer.json
  • vocabulary.txt

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Expedy

Expedy integration. Manage Organizations, Pipelines, Users, Filters. Use when the user wants to interact with Expedy data.

Registry SourceRecently Updated
General

Evenium

Evenium integration. Manage Events, Users, Roles. Use when the user wants to interact with Evenium data.

Registry SourceRecently Updated
General

Exhibitday

ExhibitDay integration. Manage Organizations. Use when the user wants to interact with ExhibitDay data.

Registry SourceRecently Updated
General

Enigma

Enigma integration. Manage Deals, Persons, Organizations, Leads, Projects, Activities and more. Use when the user wants to interact with Enigma data.

Registry SourceRecently Updated