chough

Installation

Arch Linux: paru -S chough-bin macOS: brew install --cask hyperpuncher/tap/chough Windows: winget install chough Source: go install github.com/hyperpuncher/chough/cmd/chough@latest

Requires: ffmpeg for audio/video support

Quick Reference

# Basic transcription (text to stdout)
chough audio.mp3

# Pipe audio from stdin
cat audio.mp3 | chough

# JSON with timestamps
chough -f json podcast.mp3 > transcript.json

# WebVTT subtitles
chough -f vtt -o subs.vtt video.mp4

# Low memory (30s chunks)
chough -c 30 audiobook.mp3

# Use remote server (requires CHOUGH_URL)
chough --remote audio.mp3

Flags

Flag	Description	Default
`-c, --chunk-size`	Chunk size in seconds	60
`-f, --format`	Output: text, json, vtt	text
`-o, --output`	Output file	stdout
`-r, --remote`	Transcribe via CHOUGH_URL server	-
`--version`	Show version	-

Chunk Size Guide

15-30s: Low memory (~500MB), higher error rate
60s: Balanced (default) - ~1.6GB RAM

Remote Mode

Use --remote flag to transcribe via an HTTP server instead of local processing. Requires CHOUGH_URL environment variable.

# Set server URL
export CHOUGH_URL=http://localhost:8080

# Transcribe via remote server
chough --remote audio.mp3

Check for CHOUGH_URL env var → verify /health endpoint → use server if healthy, otherwise fallback to local CLI.

Endpoints

Method	Endpoint	Description
POST	`/transcribe`	Transcribe audio (file upload, URL, or base64)
GET	`/health`	Health check with queue status

Examples

# Upload file
curl -X POST http://localhost:8080/transcribe \
  -F "file=@audio.mp3" \
  -F "format=json" \
  -F "chunk_size=60"

# Transcribe from URL
curl -X POST http://localhost:8080/transcribe \
  -H "Content-Type: application/json" \
  -d '{"url": "https://example.com/audio.mp3", "format": "vtt"}'

# Base64 audio
curl -X POST http://localhost:8080/transcribe \
  -H "Content-Type: application/json" \
  -d '{"base64": "...", "format": "text"}'

# Health check
curl http://localhost:8080/health

Performance

Duration	Time	Speed
15s	2.0s	7.4x realtime
1min	4.3s	14.1x realtime
5min	16.2s	18.5x realtime
30min	90.2s	19.9x realtime

Troubleshooting

Out of memory: Use -c 30 or -c 15 Model fails: Check internet, verify $XDG_CACHE_HOME is writable ffmpeg errors: Ensure ffmpeg is installed

Notes

First run downloads ~650MB model to $XDG_CACHE_HOME/chough/models
Auto-extracts audio from video files
Set CHOUGH_MODEL env var to use custom model path
Set CHOUGH_URL env var for --remote mode (must start with http:// or https://)
VTT groups tokens into subtitle cues automatically

Docs

GitHub

Safety Notice

Copy this and send it to your AI assistant to learn