audio-extract

Extracts audio track from a video file. Use when you need to get audio from video, prepare audio for transcription, or separate audio from video content. Runs locally with no API key required.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "audio-extract" with this command: npx skills add agntswrm/agent-media/agntswrm-agent-media-audio-extract

Audio Extract

Extracts the audio track from a video file. This is a local operation using the bundled ffmpeg binary - no API keys or external services required.

Command

npx agent-media@latest audio extract --in <path> [options]

Inputs

OptionRequiredDescription
--inYesInput video file path or URL (supports mp4, webm, mkv, avi, mov)
--formatNoOutput audio format: mp3 (default) or wav
--outNoOutput path, filename or directory (default: ./)

Output

Returns a JSON object with the extracted audio file:

{
  "ok": true,
  "media_type": "audio",
  "action": "extract",
  "provider": "local",
  "output_path": "extracted_123_abc.mp3",
  "mime": "audio/mpeg",
  "bytes": 24779
}

Examples

Extract audio as MP3 (default):

npx agent-media@latest audio extract --in video.mp4

Extract audio as WAV:

npx agent-media@latest audio extract --in video.mp4 --format wav

Custom output directory:

npx agent-media@latest audio extract --in video.mp4 --out ./audio-files

Use Case: Video Transcription Workflow

Since transcription services work best with audio files (smaller uploads, faster processing), use this workflow:

# Step 1: Extract audio from video (local, instant)
npx agent-media@latest audio extract --in interview.mp4 --format mp3
# Output: extracted_xxx.mp3

# Step 2: Transcribe the audio (cloud API)
npx agent-media@latest audio transcribe --in extracted_xxx.mp3 --provider fal

Provider

This action uses the local provider with bundled ffmpeg (via ffmpeg-static). No API keys required.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Automation

image-remove-background

No summary provided by upstream source.

Repository SourceNeeds Review
Automation

audio-transcribe

No summary provided by upstream source.

Repository SourceNeeds Review
Automation

video-generate

No summary provided by upstream source.

Repository SourceNeeds Review
Automation

image-edit

No summary provided by upstream source.

Repository SourceNeeds Review