volcengine-stt

Transcribe audio to text using Volcano Engine (Volcengine/ARK) speech-to-text APIs. Use when the user wants to replace Whisper/OpenAI STT with Volcengine, transcribe Telegram/Discord voice notes via Volcengine, or build a reusable STT skill for other OpenClaw agents.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "volcengine-stt" with this command: npx skills add reed1898/volcengine-stt

Volcengine STT

Use this skill to run speech-to-text through Volcengine.

Quick start

{baseDir}/scripts/transcribe.sh /path/to/audio.ogg

Default behavior:

Endpoint: ${ARK_BASE_URL:-https://ark.cn-beijing.volces.com/api/v3}/audio/transcriptions
Model: ${ARK_STT_MODEL:-doubao-seed-asr-1-0}
Auth header: Authorization: Bearer $ARK_API_KEY
Output file: <input>.txt

Required env

ARK_API_KEY (required)

Optional:

ARK_BASE_URL (default: https://ark.cn-beijing.volces.com/api/v3)
ARK_STT_MODEL (default: doubao-seed-asr-1-0)

Useful flags

# Save plain text to custom path
{baseDir}/scripts/transcribe.sh ./voice.ogg --out /tmp/voice.txt

# Force model
{baseDir}/scripts/transcribe.sh ./voice.ogg --model doubao-seed-asr-1-0

# Return raw JSON (for debugging/integration)
{baseDir}/scripts/transcribe.sh ./voice.ogg --json --out /tmp/voice.json

# Hint language/prompt when needed
{baseDir}/scripts/transcribe.sh ./voice.ogg --language zh --prompt "中英混合，保留术语"

Integration notes

For OpenClaw voice-message handling, call this script instead of Whisper script.
Keep keys in machine-local config or env, never commit secrets.
If your Volcengine account uses a different model name, pass --model or set ARK_STT_MODEL.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Open Registry Record Open in ClawHub

Related Skills

Related by shared tags or category signals.

General

Fitbit Tracker

Personal Fitbit integration for daily health tracking with adaptive sleep and activity reporting

Registry SourceRecently Updated

1922crabsticksalad

General

Ollama Load Balancer

Ollama load balancer for Llama, Qwen, DeepSeek, and Mistral inference across multiple machines. Load balancing with auto-discovery via mDNS, health checks, q...

Registry SourceRecently Updated

2570twinsgeeks

General

Google Merchant Center

Google Merchant Center integration. Manage Accounts. Use when the user wants to interact with Google Merchant Center data.

Registry SourceRecently Updated

1960membranedev

General

Twitter/X All-in-One — Search, Monitor & Publish Text & Media Posts

Searches and reads X (Twitter): profiles, timelines, mentions, followers, tweet search, trends, lists, communities, and Spaces. Publishes posts, likes/unlike...

Registry SourceRecently Updated

2401aisadocs