voiceclaw-jp

Voice conversation interface for OpenClaw using wake word detection, streaming LLM responses, and text-to-speech. Use when a user wants to talk to their OpenClaw agent by voice, set up a voice assistant, or add speech input/output to OpenClaw. Supports configurable wake words, VOICEVOX TTS, and sentence-level streaming for low-latency responses.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "voiceclaw-jp" with this command: npx skills add kentoku24/voiceclaw-jp

voiceclaw

Voice conversation skill for OpenClaw: wake word → STT → LLM (streaming) → TTS → playback.

Requirements

  • OpenClaw running locally (gateway with chatCompletions enabled)
  • Node.js 18+
  • VOICEVOX running on localhost:50021 (download)
  • Chrome/Edge (Web Speech API for STT)
  • HTTPS for remote mic access (localhost works without HTTPS)

Quick Start

# Install
git clone https://github.com/kentoku24/voiceclaw.git
cd voiceclaw
npm install

# Start (no .env needed if OpenClaw is running locally)
npm start
# → [voiceclaw] OpenClaw config loaded from ~/.openclaw/openclaw.json
# → [voiceclaw] listening on http://127.0.0.1:8788

# Open browser
open http://127.0.0.1:8788

Press 開始, say the wake word (default: アリス), then speak your command.

Configuration

All settings are optional. Set in .env or environment variables:

VariableDefaultDescription
WAKE_WORDSアリスちゃん,アリス,...Comma-separated wake words
STT_LANGja-JPSpeech recognition language
OPENCLAW_MODELopenclawLLM model name
VOICEVOX_URLhttp://127.0.0.1:50021VOICEVOX endpoint
VOICEVOX_SPEAKER1VOICEVOX speaker ID
HOST127.0.0.1Server bind address
PORT8788Server port

Gateway token is auto-detected from ~/.openclaw/openclaw.json. Override with OPENCLAW_GATEWAY_TOKEN if needed.

Architecture

Wake word (browser STT) → voiceclaw server → OpenClaw Gateway (streaming)
                                           → sentence-level TTS (VOICEVOX)
                                           → audio playback (Web Audio API)

See docs/architecture.md for the full sequence diagram.

API Endpoints

MethodPathDescription
GET/healthHealth check
GET/api/configClient-safe settings (wake words, STT lang)
POST/api/chat-streamStreaming LLM → sentence-level SSE
POST/api/chatNon-streaming LLM (fallback)
POST/api/ttsText → VOICEVOX → WAV audio

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Notion MCP

Notion MCP integration with managed authentication. Query databases, create and update pages, manage blocks. Use this skill when users want to interact with...

Registry SourceRecently Updated
General

Baserow

Baserow API integration with managed API key authentication. Manage database rows, fields, and tables. Use this skill when users want to read, create, update...

Registry SourceRecently Updated
General

Sunsama

Sunsama MCP integration with managed authentication. Manage daily tasks, calendar events, backlog, objectives, and time tracking. Use this skill when users w...

Registry SourceRecently Updated
General

Unbounce

Unbounce API integration with managed OAuth. Build and manage landing pages, track leads, and analyze conversion data. Use this skill when users want to inte...

Registry SourceRecently Updated