comfyui-tts

Generate speech audio using ComfyUI Qwen-TTS service. Invoke when user needs text-to-speech conversion or voice generation through ComfyUI.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "comfyui-tts" with this command: npx skills add yhsi5358/comfyui-tts

ComfyUI TTS Skill

Generate speech audio using ComfyUI's Qwen-TTS service. This skill allows you to convert text to speech through ComfyUI's API.

Configuration

Environment Variables

Set these environment variables to configure the ComfyUI connection:

export COMFYUI_HOST="localhost"      # ComfyUI server host
export COMFYUI_PORT="8188"           # ComfyUI server port
export COMFYUI_OUTPUT_DIR=""         # Optional: Custom output directory

Usage

Basic Text-to-Speech

Generate audio from text using default settings:

scripts/tts.sh "你好,世界"

Advanced Options

Customize voice characteristics:

# Specify character and style
scripts/tts.sh "你好" --character "Girl" --style "Emotional"

# Change model size
scripts/tts.sh "你好" --model "3B"

# Specify output file
scripts/tts.sh "你好" --output "/path/to/output.wav"

# Combine options
scripts/tts.sh "你好,这是测试" \
  --character "Girl" \
  --style "Emotional" \
  --model "1.7B" \
  --output "~/audio/test.wav"

Available Options

OptionDescriptionDefault
--characterVoice character (Girl/Boy/etc.)"Girl"
--styleSpeaking style (Emotional/Neutral/etc.)"Emotional"
--modelModel size (0.5B/1.7B/3B)"1.7B"
--outputOutput file pathAuto-generated
--temperatureGeneration temperature (0-1)0.9
--top-pTop-p sampling0.9
--top-kTop-k sampling50

Workflow

The skill performs these steps:

  1. Construct Workflow: Builds a ComfyUI workflow JSON with your text and settings
  2. Submit Job: Sends the workflow to ComfyUI's /prompt endpoint
  3. Poll Status: Monitors job completion via /history endpoint
  4. Retrieve Audio: Returns the path to the generated audio file

Troubleshooting

Connection Refused

  • Verify ComfyUI is running: curl http://$COMFYUI_HOST:$COMFYUI_PORT/system_stats
  • Check host and port settings

Job Timeout

  • Large models (3B) take longer to generate
  • Try smaller models (0.5B, 1.7B) for faster results

Output Not Found

  • Check ComfyUI's output directory configuration
  • Verify file permissions

API Reference

The skill uses ComfyUI's native API endpoints:

  • POST /prompt - Submit workflow
  • GET /history - Check job status
  • Output files are saved to ComfyUI's configured output directory

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Fitbit Tracker

Personal Fitbit integration for daily health tracking with adaptive sleep and activity reporting

Registry SourceRecently Updated
General

Ollama Load Balancer

Ollama load balancer for Llama, Qwen, DeepSeek, and Mistral inference across multiple machines. Load balancing with auto-discovery via mDNS, health checks, q...

Registry SourceRecently Updated
General

Google Merchant Center

Google Merchant Center integration. Manage Accounts. Use when the user wants to interact with Google Merchant Center data.

Registry SourceRecently Updated
General

Twitter/X All-in-One — Search, Monitor & Publish Text & Media Posts

Searches and reads X (Twitter): profiles, timelines, mentions, followers, tweet search, trends, lists, communities, and Spaces. Publishes posts, likes/unlike...

Registry SourceRecently Updated