screen-narrator

Live narration of your macOS screen activity with Gemini vision + ElevenLabs speech.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "screen-narrator" with this command: npx skills add buddyh/narrator

Screen Narrator

This skill maps to the upstream narrator repo implementation.

It runs Gemini-vision narration styles (sports, nature, horror, noir, reality_tv, asmr, wrestling) and ElevenLabs TTS, with optional dual-lane narration and live control via JSON files.

Source of truth

Use the repo install:

cd /Users/buddy/narrator
/Users/buddy/narrator/.venv/bin/python -m narrator sports --help

Setup

cd /Users/buddy/narrator
python3 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

Required environment:

  • GEMINI_API_KEY
  • ELEVENLABS_API_KEY
  • optional: ELEVENLABS_VOICE_ID

Runtime control commands

Start live narration in a tmux session (preferred):

tmux new-session -d -s narrator "cd /Users/buddy/narrator && /Users/buddy/narrator/.venv/bin/python -m narrator sports --control-file /tmp/narrator-ctl.json --status-file /tmp/narrator-status.json"

Start with timer:

tmux new-session -d -s narrator "cd /Users/buddy/narrator && /Users/buddy/narrator/.venv/bin/python -m narrator wrestling --time 5m --control-file /tmp/narrator-ctl.json --status-file /tmp/narrator-status.json"

Change style on the fly:

echo '{"command": "style", "value": "horror"}' > /tmp/narrator-ctl.json

Set profanity:

echo '{"command": "profanity", "value": "low"}' > /tmp/narrator-ctl.json

Pause / resume:

echo '{"command": "pause"}' > /tmp/narrator-ctl.json
echo '{"command": "resume"}' > /tmp/narrator-ctl.json

Stop:

tmux kill-session -t narrator

Check status:

cat /tmp/narrator-status.json

Notes

  • macOS only (screen capture + TTS/audio).
  • This OpenClaw skill wrapper is aligned to the /Users/buddy/narrator implementation to avoid drift between docs and runtime.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Jetson CUDA Voice Pipeline

Fully offline, CUDA-accelerated local voice assistant pipeline for NVIDIA Jetson. Wake word (openWakeWord) → real-time VAD → whisper.cpp GPU STT → LLM → Pipe...

Registry SourceRecently Updated
General

Triumvirate Protocol

Orchestrates multi-AI debates with identity-aware context, tracking rounds and threads, and synthesizing verdicts and insights across providers.

Registry SourceRecently Updated
General

Maxhub Zhihu

知乎数据采集与分析。当用户提到知乎、zhihu、问答等相关需求时激活此Skill。

Registry SourceRecently Updated
1150xiewxx
General

Maxhub Reddit

Reddit数据采集。当用户提到reddit、社区、帖子等相关需求时激活此Skill。

Registry SourceRecently Updated
1110xiewxx