senseaudio-conversation-rehearsal

Use when a user wants to rehearse a high-pressure conversation such as a performance review, reporting meeting, promotion defense, difficult manager conversation, or stakeholder alignment session, using AudioClaw ASR for spoken rehearsal intake, AudioClaw TTS or an authorized cloned voice for the counterpart, and transcript-based debriefing on tone, structure, and communication risk.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "senseaudio-conversation-rehearsal" with this command: npx skills add kikidouloveme79/senseaudio-conversation-rehearsal

AudioClaw Conversation Rehearsal

What this skill is for

This skill is for realistic conversation rehearsal in high-pressure situations:

  • 汇报述职
  • 向上沟通
  • 绩效面谈
  • 晋升答辩
  • 难搞老板或强势同事沟通
  • 需要脱敏的正式谈话

It is designed to simulate the other person speaking back, not just generate a script.

Default stance

Use two voice modes:

  • proxy_voice
    • Recommended default
    • Use a role-appropriate system voice and behavior style
  • authorized_clone
    • Only use when the voice sample is explicitly authorized for rehearsal or internal training
    • Best official path: clone on the AudioClaw platform first, then pass the prepared clone voice_id
    • A prepared cloned voice id commonly looks like vc-..., and can be passed directly with --prepared-clone-voice-id

Do not default to cloning a real person's voice without clear permission.

Workflow

  1. Define the rehearsal:
    • scenario
    • counterpart role
    • relationship
    • talk topic
    • desired outcome
    • fear triggers
    • difficulty
  2. Run scripts/build_rehearsal_blueprint.py.
  3. Decide voice mode:
    • proxy voice
    • authorized clone
  4. Run the live loop in your agent stack:
    • counterpart turn via TTS
    • user spoken reply via ASR
    • if you want faster perceived intake, enable stream ASR
    • agent judges tone, structure, and progress
    • use scripts/build_counterpart_turn.py to generate the next counterpart reply
    • use scripts/senseaudio_counterpart_tts.py to synthesize that reply
    • official clone chain: prepare the clone on the AudioClaw platform first and pass the resulting voice_id
    • if that voice_id is a clone id like vc-..., counterpart TTS now auto-routes to SenseAudio-TTS-1.5
    • optional experimental path: if an authorized platform token is available, use scripts/senseaudio_clone_workspace.py to inspect clone slots or attempt a rehearsal-only clone from an authorized sample
    • if the user wants to actually hear the counterpart turns in Feishu or AudioClaw, use --send-feishu-audio or run scripts/send_rehearsal_counterparts_to_feishu.py
  5. After the session, run scripts/analyze_rehearsal_transcript.py.
  6. Produce a debrief:
    • weak openings
    • over-explaining
    • vague asks
    • missing evidence
    • apologetic or defensive tone
    • better rewrites

AudioClaw Trigger Pattern

Use this skill as a structured multi-turn rehearsal mode.

Recommended user trigger:

开始演练,用 $senseaudio-conversation-rehearsal。
场景:manager_update
对方身份:strict_manager
主题:项目延期说明
目标:获得补救方案认可
害怕点:被打断,被质疑执行力
难度:medium
prepared clone voice_id:your_clone_voice_id
后面我发语音,和我进行多轮演练,最后给我复盘。

The agent should:

  1. Collect the rehearsal slots first.
  2. Build the blueprint.
  3. Enter rehearsal mode, with reply mode defaulting to voice.
  4. Start the scene with the opening counterpart turn as voice, not text.
  5. For every later rehearsal turn:
    • transcribe with scripts/senseaudio_asr.py
    • generate the next counterpart turn
    • synthesize that turn with proxy voice or the prepared clone voice_id
    • in ongoing rehearsal mode, default to --send-feishu-audio so the counterpart turns are sent as Feishu audio messages without needing the user to repeat that request
    • only fall back to text-first replies if the user explicitly asks for text-only output or the channel cannot play voice
  6. End with scripts/analyze_rehearsal_transcript.py and return a concrete debrief.

Rehearsal mode should be sticky inside the same session:

  • Keep the same scenario, counterpart role, relationship, topic, desired outcome, fear triggers, difficulty, and chosen voice_id
  • Keep voice reply as the default from the opening turn onward until the user explicitly says to switch back to text replies or exit rehearsal mode
  • If the user says "直接发语音给我练" or "每轮都发语音", treat that as confirming the same sticky voice mode rather than a one-turn exception

If the user asks to "use the cloned voice", interpret that as:

  • use a platform-prepared clone voice_id when available
  • otherwise pause and ask for the clone voice_id or fall back to proxy_voice

Design rules

  • Prioritize behavior realism over exact voice likeness.
  • Treat the public documented clone flow and the experimental workspace automation flow as separate paths.
  • For scary-counterpart scenarios, structure the rehearsal in phases:
    • opening pressure
    • pushback
    • challenge question
    • close
  • Evaluate both:
    • what the user said
    • how the user said it
  • Keep debrief concrete and operational.

API key lookup

For this skill, use SENSEAUDIO_API_KEY as the default API key source again.

Practical rule:

  • scripts/run_live_rehearsal_session.py, scripts/run_complete_rehearsal_service.py, and scripts/senseaudio_counterpart_tts.py now default to SENSEAUDIO_API_KEY
  • If the host app injects SENSEAUDIO_API_KEY as a login token such as v2.public..., the shared bootstrap replaces it with the real sk-... value from ~/.audioclaw/workspace/state/senseaudio_credentials.json before the rehearsal call starts

Resources

  • scripts/build_rehearsal_blueprint.py
    • Builds a structured rehearsal plan and counterpart persona
  • scripts/build_counterpart_turn.py
    • Generates the next counterpart turn from rehearsal state and the user's latest reply
  • scripts/senseaudio_asr.py
    • Transcribes user spoken rehearsal turns with the official AudioClaw HTTP ASR API
  • scripts/senseaudio_counterpart_tts.py
    • Synthesizes a counterpart turn using a safe proxy voice or an explicitly authorized clone voice_id
  • scripts/run_live_rehearsal_session.py
    • Runs a multi-turn live rehearsal session from user audio replies, counterpart generation, TTS, and automatic debrief
    • Supports --stream-asr and --send-feishu-audio
  • scripts/send_rehearsal_counterparts_to_feishu.py
    • Reuses the Feishu voice delivery path to send the generated counterpart turns one by one as audio messages
  • scripts/senseaudio_clone_workspace.py
    • Lists clone slots, lists available voices, and creates an authorized rehearsal clone through the official AudioClaw workspace endpoints, preferring a platform token and otherwise trying a logged-in Chrome browser session
  • scripts/senseaudio_platform_token.py
    • Resolves an AudioClaw workspace platform token from env or a logged-in Chrome AudioClaw tab when Apple Events JavaScript is enabled
  • scripts/run_complete_rehearsal_service.py
    • One entry point that builds the blueprint, optionally resolves a prepared clone voice_id or attempts experimental workspace clone automation, runs the live rehearsal session, and writes a summary bundle
    • Supports --send-feishu-audio so the rehearsal counterpart can proactively send voice turns to Feishu or AudioClaw-linked chats
  • scripts/analyze_rehearsal_transcript.py
    • Scores a rehearsal transcript for tone and communication risks
  • references/live_rehearsal_loop.md
    • A minimal multi-turn runtime pattern for AudioClaw or another agent orchestrator
  • references/rehearsal_design.md
    • Product design, safety policy, and rollout plan

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

老人跌倒检测技能

Utilizes vision and radar technology for contactless detection of falls. It triggers alarms within seconds and is suitable for home safety monitoring of elde...

Registry SourceRecently Updated
General

Conversation Focus

对话焦点管理 + 需求澄清引导。每次对话开始时自动分析用户意图,确保需求清晰、限制明确,避免模型分散注意力。自动对接到 self-improving 进行持续优化。

Registry SourceRecently Updated
General

AIWolfPK - AI狼人杀

四个AI互相猜疑,你坐着看戏。每局30秒,到底谁是狼? Four AIs play Werewolf while you watch. 30s per round. Spot the wolf before they do.

Registry SourceRecently Updated
General

Project Analyzer

Analyze any project directory and produce a detailed report covering what the project does, its tech stack, folder structure, entry points, how to run it, an...

Registry SourceRecently Updated