whisper-stt

语音转文字 - 使用OpenAI Whisper将音频文件识别为文字

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "whisper-stt" with this command: npx skills add qiaotucodes/openclaw-skill-whisper-stt

Whisper 语音转文字技能

将音频/语音文件识别并转换为文字。

使用方式

自动处理(推荐)

当用户发送语音/音频文件时,自动:

  1. 识别语音内容
  2. 转换为文字
  3. 发送文字到飞书

手动命令

用户发送音频文件后,技能自动处理。

支持格式

  • MP3, WAV, M4A, OGG, FLAC, WebM
  • 任何音频格式(FFmpeg支持即可)

模型选择

模型大小速度精度
tiny~1GB最快基础
base~1GB一般
small~2GB中等较好
medium~5GB较慢很好
large~10GB最慢最佳
turbo~6GB接近large

技术实现

import whisper

# 加载模型(首次使用会下载)
model = whisper.load_model("base")  # 可选: tiny/base/small/medium/large/turbo

# 识别语音
result = model.transcribe("audio.mp3")

# 输出文字
print(result["text"])

依赖

  • Python 3.8+
  • PyTorch
  • openai-whisper
  • ffmpeg

注意事项

  1. 首次使用会下载模型(1-10GB)
  2. 大模型需要较多内存
  3. 中文识别效果很好

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Expedy

Expedy integration. Manage Organizations, Pipelines, Users, Filters. Use when the user wants to interact with Expedy data.

Registry SourceRecently Updated
General

Evenium

Evenium integration. Manage Events, Users, Roles. Use when the user wants to interact with Evenium data.

Registry SourceRecently Updated
General

Exhibitday

ExhibitDay integration. Manage Organizations. Use when the user wants to interact with ExhibitDay data.

Registry SourceRecently Updated
General

Enigma

Enigma integration. Manage Deals, Persons, Organizations, Leads, Projects, Activities and more. Use when the user wants to interact with Enigma data.

Registry SourceRecently Updated