Category: provider
Volcengine 语音合成 TTS
Validation
mkdir -p output/volcengine-ai-audio-tts
python -m py_compile skills/ai/audio/volcengine-ai-audio-tts/scripts/generate_tts.py && echo "py_compile_ok" > output/volcengine-ai-audio-tts/validate.txt
Pass criteria: command exits 0 and output/volcengine-ai-audio-tts/validate.txt is generated.
Output And Evidence
- Save generated audio files, request payloads, and response metadata to
output/volcengine-ai-audio-tts/. - Keep one validation log per execution.
Prerequisites
- Python 3.8+. No extra SDK required (uses
requestsand stdlib). - Install:
pip install requests - Set environment variables (from Volcengine 豆包语音控制台):
VOLCENGINE_TTS_APP_ID— 应用 IDVOLCENGINE_TTS_TOKEN— 应用 Token(用于 Authorization: Bearer;${token})VOLCENGINE_TTS_CLUSTER— 业务集群,如volcano_tts(标准音色)
Optional: use .env in repo root or script directory; script will load them.
Normalized interface (tts.generate)
Request
text(string, required) — 合成文本,UTF-8,单次建议 ≤1024 字节voice_type(string, required) — 音色,见 发音人参数列表,如BV700_streamingencoding(string, optional) — 编码格式:pcm|wav|mp3|ogg_opus,默认mp3rate(int, optional) — 采样率 8000/16000/24000,默认 24000speed_ratio(float, optional) — 语速 [0.2, 3],默认 1.0volume_ratio(float, optional) — 音量 [0.1, 3],默认 1.0pitch_ratio(float, optional) — 音高 [0.1, 3],默认 1.0language(string, optional) — 语言,如cn
Response
audio_path(string) — 本地保存的音频文件路径sample_rate(int)format(string)duration_ms(string, when returned by API)code(int) — 3000 表示成功
Quick start (Python script)
# 使用内联 JSON 请求
python skills/ai/audio/volcengine-ai-audio-tts/scripts/generate_tts.py \
--request '{"text":"你好,这是一段测试语音。","voice_type":"BV700_streaming"}' \
--output output/volcengine-ai-audio-tts/audio/out.mp3
# 使用请求文件
python skills/ai/audio/volcengine-ai-audio-tts/scripts/generate_tts.py \
--file request.json \
--output output/volcengine-ai-audio-tts/audio/out.wav \
--print-response
Operational guidance
- 每次请求的
reqid需唯一,脚本内使用 UUID。 - 长文本请分段多次调用或使用异步长文本接口。
- 音色与
cluster需与控制台一致;复刻音色使用 speaker id 作为voice_type。 - 遇 429 请降低并发或增加间隔。
Output location
- Default output:
output/volcengine-ai-audio-tts/audio/ - Override base dir with
OUTPUT_DIR.
Workflow
- Confirm user intent, text, voice, and output format.
- Run one minimal request to verify credentials and cluster/voice_type.
- Execute the target synthesis with explicit parameters.
- Verify results and save output/evidence files.
References
references/api_reference.md— 请求/响应参数与错误码- 在线语音合成 API - HTTP 一次性合成
- 参数基本说明
- 发音人参数列表
- Source list:
references/sources.md