assemblyai-transcriber

Transcribe audio files with speaker diarization (who speaks when). Supports 100+ languages, automatic language detection, and timestamps. Use for meetings, interviews, podcasts, or voice messages. Requires AssemblyAI API key.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "assemblyai-transcriber" with this command: npx skills add xenofex7/assemblyai-transcriber

AssemblyAI Transcriber 🎙️

Transcribe audio files with speaker diarization (who speaks when).

Features

  • ✅ Transcription in 100+ languages
  • ✅ Speaker diarization (Speaker A, B, C...)
  • ✅ Timestamps per utterance
  • ✅ Automatic language detection
  • ✅ Supports MP3, WAV, M4A, FLAC, OGG, WEBM

Setup

  1. Create AssemblyAI account: https://www.assemblyai.com/
  2. Get API key (free tier: 100 min/month)
  3. Set environment variable:
export ASSEMBLYAI_API_KEY="your-api-key"

Or save to config file:

// ~/.assemblyai_config.json
{
  "api_key": "YOUR_API_KEY"
}

Usage

Transcribe local audio

python3 scripts/transcribe.py /path/to/recording.mp3

Transcribe from URL

python3 scripts/transcribe.py https://example.com/meeting.mp3

Options

python3 scripts/transcribe.py audio.mp3 --no-diarization  # Skip speaker labels
python3 scripts/transcribe.py audio.mp3 --json            # Raw JSON output

Output Format

## Transcript

*Language: EN*
*Duration: 05:32*

**Speaker A** [00:00]: Hello everyone, welcome to the meeting.
**Speaker B** [00:03]: Thanks! Happy to be here.
**Speaker A** [00:06]: Let's start with the first item...

Pricing

  • Free Tier: 100 minutes/month free
  • After: ~$0.01/minute

Tips

  • For best speaker diarization: clear speaker changes, minimal overlap
  • Background noise is filtered well
  • Multi-language auto-detection works reliably

Author: xenofex7 | Version: 1.1.0

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Cclaw

Open-source comedy AI + video editing + poster generation. Create standup/sketch/manzai/scripts, edit videos via FFmpeg, and generate comedy posters via canv...

Registry SourceRecently Updated
General

Bird Recognition Tool | 鸟类识别工具

Identifies bird species in images/videos of target areas. Supports recognition of no less than 500 common bird species, supports customized model training, s...

Registry SourceRecently Updated
General

Image Amazon Product Image Suite

A professional product image generation skill purpose-built for the Amazon e-commerce platform. Outputs comply with Amazon's image guidelines while optimizin...

Registry SourceRecently Updated
General

SearchOnlineAssets

Online asset search tool: queries public stock libraries (Pixabay) for high-quality photos, illustrations, vectors and videos, returning result metadata and...

Registry SourceRecently Updated