deepgram-asr

Transcribe audio via Deepgram Nova-3 API. Fast, accurate, and cost-effective speech-to-text for 50+ languages. Transcripción de audio rápida y precisa con Deepgram Nova-3. Use when the user needs transcription via Deepgram, or wants fast/cheap cloud transcription.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "deepgram-asr" with this command: npx skills add vahnxu/deepgram-asr

Deepgram ASR (Nova-3)

Transcribe audio files via Deepgram's Nova-3 model. Sub-300ms latency, 50+ languages, $0.0043/min. Free $200 credit on signup.

Transcriba archivos de audio con el modelo Nova-3 de Deepgram. Latencia inferior a 300ms, más de 50 idiomas, $0.0043/min. $200 de crédito gratis al registrarse.

Sending audio to OpenClaw

Currently, audio files can be sent to OpenClaw via Discord or WhatsApp. Send the audio file in a chat message and ask the bot to transcribe it.

Actualmente, los archivos de audio se pueden enviar a OpenClaw a través de Discord o WhatsApp.

Note: Direct voice recording in the OpenClaw web UI is not yet supported. Use a messaging app to send pre-recorded audio files.

Nota: La grabación de voz directa en la interfaz web de OpenClaw aún no está disponible. Use una aplicación de mensajería para enviar archivos de audio pregrabados.

Quick start

{baseDir}/scripts/transcribe.sh /path/to/audio.m4a

Defaults:

  • Model: nova-3
  • Output: <input>.txt

Useful flags

{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --out /tmp/transcript.txt
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --language es  # Spanish / Español
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --json --out /tmp/result.json
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --speakers  # speaker diarization / separación de hablantes
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --paragraphs  # smart paragraphs / párrafos inteligentes
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --summarize  # AI summary / resumen con IA
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --detect-language  # auto-detect language / detección automática de idioma

How it works

The script sends audio directly to Deepgram's API (api.deepgram.com) via HTTPS. No third-party services are involved — audio goes only to Deepgram.

  1. Reads the local audio file
  2. POSTs it to https://api.deepgram.com/v1/listen with your API key
  3. Parses the JSON response and extracts transcript text
  4. Saves to output file

Privacy: Audio is sent directly to Deepgram's servers over HTTPS. No data is stored by this skill; Deepgram's data retention policy applies.

Dependencies

  • curl — for API calls
  • python3 — for JSON response parsing (stdlib only, no pip packages needed)

Credentials

  1. Sign up at https://console.deepgram.com
  2. Create an API key at https://console.deepgram.com/project/api-keys
  3. Free tier: $200 credit on signup

Set the environment variable:

export DEEPGRAM_API_KEY="your_api_key"

Supported languages

Nova-3 supports 45+ languages. Common language codes:

LanguageCodeLanguageCode
EnglishenSpanish / Españoles
French / FrançaisfrGerman / Deutschde
Portuguese / PortuguêsptJapanese / 日本語ja
Chinese / 中文zhKorean / 한국어ko
Hindi / हिन्दीhiRussian / Русскийru
Italian / ItalianoitDutch / Nederlandsnl
Arabic / العربيةarTurkish / Türkçetr

Use --detect-language to auto-detect, or --language <code> to specify.

Full list: https://developers.deepgram.com/docs/models-languages-overview

Supported formats

WAV, MP3, MP4, M4A, OGG, FLAC, WebM, and more.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

交互式教学网页生成器

根据教案、教材或课件生成交互式教学HTML网页

Registry SourceRecently Updated
General

大学生学习规划助手

大学生学习规划与作业助手。根据课程表、作业截止日期和复习计划,生成每日任务清单、优先级排序,整理学习资料和复习要点,提供番茄钟学习提醒。触发词:学习计划、作业规划、复习计划、任务清单、DDL管理、课程安排、番茄钟、学习提醒。

Registry SourceRecently Updated
General

The English Tutor

英语口语陪练助手。触发词:「练习英语」「我要说英语」「英语陪练」。功能:上传生词表 → 每日定时情景对话 → 语音输入即时纠错 → 飞书语音推送 → 自动艾宾浩斯复习。

Registry SourceRecently Updated
General

Writing Triadic Repo

Triadic collaborative writing framework with self-evolving memory engine. Creator mines intent via progressive questioning, Executor produces ≥2 distinct dra...

Registry SourceRecently Updated