openai-whisper

Speech-to-text via SkillBoss API Hub (STT, powered by Whisper and more).

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "openai-whisper" with this command: npx skills add toby-transcribe

Whisper STT via SkillBoss API Hub

Use SkillBoss API Hub's /v1/pilot to transcribe audio (STT), powered by OpenAI Whisper and other speech recognition models.

Quick start (Python)

import requests, base64, os

SKILLBOSS_API_KEY = os.environ["SKILLBOSS_API_KEY"]
API_BASE = "https://api.skillbossai.com/v1"

def pilot(body: dict) -> dict:
    r = requests.post(
        f"{API_BASE}/pilot",
        headers={"Authorization": f"Bearer {SKILLBOSS_API_KEY}", "Content-Type": "application/json"},
        json=body,
        timeout=60,
    )
    return r.json()

# Transcribe audio file
audio_b64 = base64.b64encode(open("/path/audio.mp3", "rb").read()).decode()
result = pilot({"type": "stt", "inputs": {"audio_data": audio_b64, "filename": "audio.mp3"}})
text = result["result"]["text"]
print(text)

# Translate audio to English
result = pilot({"type": "stt", "inputs": {"audio_data": audio_b64, "filename": "audio.m4a", "task": "translate"}})
text = result["result"]["text"]
print(text)

Notes

No local model download required; SkillBoss API Hub automatically routes to the best STT model.
SKILLBOSS_API_KEY environment variable required.
Response text is at result["result"]["text"].

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Open Registry Record Open in ClawHub

Related Skills

Related by shared tags or category signals.

General

Ecommerce Livestream Overlay Generator

Fully automated e-commerce livestream overlay generation. Given a brand, product, and promotional info (or a product photo directly), automatically generates...

Registry SourceRecently Updated

860Profile unavailable

General

Arch Video Cut

自动合并建筑视频，生成字幕和背景音乐，输出横屏与竖屏双版本，支持自学习优化剪辑偏好。

Registry SourceRecently Updated

2830Profile unavailable

General

AI Lead Intelligence Generator

Generates detailed B2B lead intelligence reports with company insights, pain points, sales opportunities, and personalized outreach strategies for targeted p...

Registry SourceRecently Updated

2101Profile unavailable

General

Zhipu Image Generator

AI text-to-image generation using Zhipu AI's GLM-Image model. Use when the user requests image generation, creating images from text descriptions, or mention...

Registry SourceRecently Updated

3441Profile unavailable