elevenlabs-tts

ElevenLabs Text-to-Speech

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "elevenlabs-tts" with this command: npx skills add inference-sh/skills@elevenlabs-dialogue

ElevenLabs Text-to-Speech

Premium text-to-speech with 22+ voices via inference.sh CLI.

Quick Start

Requires inference.sh CLI (infsh ). Install instructions

infsh login

Generate speech with ElevenLabs

infsh app run elevenlabs/tts --input '{"text": "Hello, welcome to our product demo.", "voice": "aria"}'

Available Models

Model ID Best For Latency

Multilingual v2 eleven_multilingual_v2

Highest quality, 32 languages ~250ms

Turbo v2.5 eleven_turbo_v2_5

Balance of speed & quality ~150ms

Flash v2.5 eleven_flash_v2_5

Ultra-low latency ~75ms

Voice Library

Female Voices

Voice Style

aria

American, conversational

alice

British, confident

bella

American, warm

jessica

American, expressive

laura

American, professional

lily

British, soft

sarah

American, friendly

Male Voices

Voice Style

george

British, authoritative

adam

American, deep

bill

American, mature

brian

American, conversational

callum

Transatlantic, intense

charlie

Australian, natural

chris

American, casual

daniel

British, commanding

eric

American, friendly

harry

American, young

liam

American, articulate

matilda

American, warm

river

American, confident

roger

American, authoritative

will

American, bright

Examples

Basic Speech

infsh app run elevenlabs/tts --input '{"text": "Welcome to our quarterly earnings presentation.", "voice": "george"}'

Choose a Model

Highest quality

infsh app run elevenlabs/tts --input '{ "text": "This is our premium multilingual model with the best quality.", "voice": "aria", "model": "eleven_multilingual_v2" }'

Ultra-fast for real-time applications

infsh app run elevenlabs/tts --input '{ "text": "Flash model for low-latency applications.", "voice": "brian", "model": "eleven_flash_v2_5" }'

Voice Tuning

infsh app run elevenlabs/tts --input '{ "text": "Fine-tune the voice characteristics for your use case.", "voice": "bella", "stability": 0.3, "similarity_boost": 0.9, "style": 0.4 }'

Parameter Range Effect

stability

0-1 Higher = more consistent, lower = more expressive

similarity_boost

0-1 Higher = closer to original voice character

style

0-1 Higher = more style exaggeration

use_speaker_boost

true/false Enhances speaker clarity

Output Formats

High-quality MP3

infsh app run elevenlabs/tts --input '{ "text": "High quality audio output.", "voice": "daniel", "output_format": "mp3_44100_192" }'

Format Description

mp3_44100_128

MP3 at 44.1kHz, 128kbps (default)

mp3_44100_192

MP3 at 44.1kHz, 192kbps

pcm_16000

Raw PCM at 16kHz

pcm_22050

Raw PCM at 22.05kHz

pcm_24000

Raw PCM at 24kHz

pcm_44100

Raw PCM at 44.1kHz

Multilingual

ElevenLabs supports 32 languages including English, Spanish, French, German, Italian, Portuguese, Chinese, Japanese, Korean, Arabic, Hindi, Russian, and more.

Spanish

infsh app run elevenlabs/tts --input '{ "text": "Hola, bienvenidos a nuestra presentación.", "voice": "aria", "model": "eleven_multilingual_v2" }'

French

infsh app run elevenlabs/tts --input '{ "text": "Bonjour, bienvenue à notre démonstration.", "voice": "alice", "model": "eleven_multilingual_v2" }'

Voice + Video Workflow

1. Generate voiceover

infsh app run elevenlabs/tts --input '{ "text": "Introducing the future of AI-powered content creation.", "voice": "george" }' > voiceover.json

2. Create talking head video

infsh app run bytedance/omnihuman-1-5 --input '{ "image_url": "https://portrait.jpg", "audio_url": "<audio-url-from-step-1>" }'

Use Cases

  • Voiceovers: Product demos, explainer videos, commercials

  • Audiobooks: Long-form narration with consistent voices

  • Podcasts: AI hosts with natural delivery

  • E-learning: Course narration in multiple languages

  • Accessibility: High-quality screen reader content

  • IVR: Professional phone system messages

  • Video Narration: Documentary and social media content

Related Skills

ElevenLabs multi-speaker dialogue

npx skills add inference-sh/skills@elevenlabs-dialogue

ElevenLabs voice changer

npx skills add inference-sh/skills@elevenlabs-voice-changer

ElevenLabs sound effects

npx skills add inference-sh/skills@elevenlabs-sound-effects

All TTS models (Kokoro, DIA, Chatterbox, and more)

npx skills add inference-sh/skills@text-to-speech

Full platform skill (all 150+ apps)

npx skills add inference-sh/skills@infsh-cli

Browse all audio apps: infsh app list --category audio

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

nano-banana-2

Nano Banana 2 - Gemini 3.1 Flash Image Preview

Repository Source
72.2K164inferen-sh
General

qwen-image-2

Qwen-Image - Alibaba Image Generation

Repository Source
71.9K164inferen-sh
General

p-video

Pruna P-Video Generation

Repository Source
71.9K164inferen-sh
General

qwen-image-2-pro

Qwen-Image Pro - Professional Image Generation

Repository Source
71.8K164inferen-sh
elevenlabs-tts | V50.AI