senseaudio-asr

Build and troubleshoot SenseAudio speech recognition integrations, including HTTP transcription (`/v1/audio/transcriptions`), realtime WebSocket ASR (`/ws/v1/audio/transcriptions`), audio quality analysis (`/v1/audio/analysis`), and recognition record queries (`/v1/audio/records`). Use this whenever user asks for speech-to-text, diarization, translation, streaming ASR, or ASR model/parameter selection.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "senseaudio-asr" with this command: npx skills add scikkk/senseaudio-asr

SenseAudio ASR

Use this skill for all SenseAudio speech recognition tasks.

Credential source: read the API key from SENSEAUDIO_API_KEY and send it only in the Authorization: Bearer ... header. Do not place API keys in query parameters, logs, transcripts, or saved examples.

Read First

  • references/asr.md

Workflow

  1. Pick recognition mode:
  • HTTP file transcription for offline audio.
  • WebSocket for realtime streaming microphone/audio chunks.
  • Audio analysis for noise and quality checks before recognition.
  • Records query for recent recognition history lookup.
  1. Choose model by feature needs:
  • Lite for low-cost basic transcription.
  • ASR for streaming, translation, diarization, sentiment, and timestamps.
  • Pro when diarization plus explicit max_speakers control is needed.
  • DeepThink for streaming, translation, and intelligent editing; do not send language, diarization, sentiment, timestamps, ITN, or punctuation controls.
  1. Build minimal request:
  • Required auth, file/audio format, model.
  • Add optional controls only when needed.
  • Keep uploaded files at or below 10MB; split longer audio before sending.
  1. Validate compatibility:
  • Check model-parameter support before sending.
  • Enforce WS pcm / 16000Hz / mono requirements.
  • For HTTP stream=true, expect SSE text deltas only, not structured verbose fields.
  1. Parse robustly:
  • Handle JSON/text/verbose/SSE forms.
  • Handle WS terminal events and failures.
  • Treat returned audio URLs, api_key, session_id, and trace_id as sensitive operational data.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Content Keyword Tracker

An OpenClaw skill for tracking keyword trends and generating structured reports. Uses Tavily API for search and supports webhook notifications for daily repo...

Registry SourceRecently Updated
General

读书每日推荐

微信读书飙升榜每日推荐卡片生成器。从微信读书飙升榜抓取热门书籍数据,生成精美的每日读书推荐卡片(HTML/PNG)。当用户说「读书推荐」「微信读书」「飙升榜」「今日好书」「推荐一本书」「读书卡片」「book recommendation」时触发。也可用于每日定时推送读书推荐场景。

Registry SourceRecently Updated
General

V3.3 系统架构白皮书

V3.3系统架构白皮书 — V19认知治理协议最高级架构版本。三维耦合引擎(空间冗余×时间调度×结构对齐)整合内部自洽、外部控制、注意力均衡为单一状态机驱动执行流。

Registry SourceRecently Updated
General

CORE CONSTITUTION MANIFEST API Spec v1.0.0

CORE CONSTITUTION MANIFEST API规范文档v1.0.0 — V19认知治理协议的外部接入技术规范。包含宪法合规校验、系统启动自检、注意力均衡、审计冗余查询等核心端点。

Registry SourceRecently Updated