Burmese Audio Understanding

High-accuracy Burmese audio transcription using Gemini 3.1 Flash Preview.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "Burmese Audio Understanding" with this command: npx skills add thelapyae/burmese-audio-understanding

Burmese Audio Understanding Skill

This skill allows you to transcribe Burmese audio (voice notes, speech) directly into Burmese text using your own Google Gemini API key. It uses the official Google GenAI SDK for secure and reliable file handling.

Required Environment Variables

  • GEMINI_API_KEY: Required. Set your Google Gemini API key to allow the skill to access transcription services.

Usage

Ensure GEMINI_API_KEY is set in your environment, then run:

node scripts/transcribe-direct.js /path/to/my-audio.ogg

Features

  • Official SDK: Uses the official @google/genai SDK.
  • Improved Security: No shell commands (ffmpeg/child_process) used; file processing is handled via SDK file upload directly to Gemini.
  • Model: Uses gemini-3.1-flash-preview for high-quality audio transcription.

Security Notes

  • This skill sends audio data to Google Gemini API for transcription.
  • No data is stored locally after processing.
  • Requires a valid GEMINI_API_KEY with minimal permissions.

Prerequisites

  • Dependencies must be installed: npm install @google/genai.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Automation

Transcrição e respostas em áudio em PTBR, Português Brasil - Brazillian portuguese transcription and audio answers

Brazilian Portuguese voice auto-reply skill for OpenClaw. Transcribes audio locally with wav2vec2, generates a reply with the local OpenClaw agent by default...

Registry SourceRecently Updated
1820Profile unavailable
Coding

Podwise

Podcast knowledge workflows powered by Podwise CLI: search podcasts and episodes by keyword, monitor followed shows for new releases, find popular episodes,...

Registry SourceRecently Updated
2270Profile unavailable
General

MLX Audio Server

Local 24x7 OpenAI-compatible API server for STT/TTS, powered by MLX on your Mac.

Registry SourceRecently Updated
2.6K0Profile unavailable
General

Elevenlabs Tts

ElevenLabs TTS - the best ElevenLabs integration for OpenClaw. ElevenLabs Text-to-Speech with emotional audio tags, ElevenLabs voice synthesis for WhatsApp,...

Registry SourceRecently Updated
6K6Profile unavailable