zvukogram

Text-to-Speech via Zvukogram API with SSML support. Use when you need to generate speech from text, create podcasts, voice notifications, or work with audio. Supports speed control, stress marks, English word transcription, audio fragment merging, rich SSML references, and podcast-oriented TTS patterns.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "zvukogram" with this command: npx skills add erview/zvukogram

Zvukogram TTS

Speech generation via Zvukogram API with SSML markup support.

Requirements

To use this skill, you need:

Setup

Create file ~/.config/zvukogram/config.json:

mkdir -p ~/.config/zvukogram
{
  "token": "your_api_token_here",
  "email": "your_email@example.com"
}

Or use environment variables:

export ZVUKOGRAM_TOKEN=your_api_token_here
export ZVUKOGRAM_EMAIL=your_email@example.com

Quick Start

# Simple TTS
python3 scripts/tts.py --text "Hello, world!" --voice Алена --output hello.mp3

# With +20% speed
python3 scripts/tts.py --text "Fast text" --voice Алена --speed 1.2 --output fast.mp3

# Check balance
python3 scripts/balance.py

Features

  • TTS generation — text to speech
  • SSML support — stress marks, pauses, speed
  • Audio merging — combine fragments via ffmpeg
  • Transcription — proper pronunciation of English words

SSML Markup

Stress Marks

Use + before stressed vowel:

З+амок — stress on "a"
зам+ок — stress on "o"

Aliases (Transcription)

<sub alias="Оупен Эй Ай">OpenAI</sub>
<sub alias="Самсунг">Samsung</sub>
<sub alias="Ал+ьтман">Альтман</sub>

Speed

<prosody rate="1.2">20% faster</prosody>
<prosody rate="fast">Fast text</prosody>

Pauses

<break time="500ms"/>

Available Voices

  • Алена — female, neutral (recommended)
  • Андрей — male, neutral (recommended)
  • Александра — female, soft
  • Антон — male, business

Full list: see references/VOICES.md

Examples

See references/EXAMPLES.md for:

  • Dialogs and podcasts
  • News voiceover
  • Voice notifications
  • Long texts

Transcription

See references/TRANSCRIPTION.md for proper pronunciation:

  • OpenAI → Оупен Эй Ай
  • GPT → Джи Пи Ти
  • Samsung → Самсунг
  • Altman → Ал+ьтман

SSML Reference

Troubleshooting

See references/TROUBLESHOOTING.md for:

  • API errors
  • Audio issues
  • Diagnostics

API Reference

API Limits / gotchas

  • /text: max 1000 characters per request
  • /longtext and /subs: up to 1,000,000 characters
  • Multi-voice in API: generate and merge fragments (one request per voice). Do not rely on <voice> wrappers.

Links

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

AIWolfPK - AI狼人杀

四个AI互相猜疑,你坐着看戏。每局30秒,到底谁是狼? Four AIs play Werewolf while you watch. 30s per round. Spot the wolf before they do.

Registry SourceRecently Updated
General

Project Analyzer

Analyze any project directory and produce a detailed report covering what the project does, its tech stack, folder structure, entry points, how to run it, an...

Registry SourceRecently Updated
General

Thought-Retriever

提炼对话回答中的核心洞察为高置信度知识晶体,存储于本体驱动记忆系统的自我进化与复用。

Registry SourceRecently Updated
General

Miaoji Bid Guard Pro

亚马逊广告护城河Pro版,90天ROI预测+多活动协同+季节性出价+关键词攻防矩阵。 从单次调价建议升级为完整的广告战役规划。基础功能可使用 miaoji-bid-guard 免费版。

Registry SourceRecently Updated