api-benchmark

Benchmark token generation speed across multiple LLM API providers. Measures TTFT (Time To First Token), tokens-per-second throughput, and total generation time. Use when comparing performance of different API providers, models, or testing API connectivity. Requires OpenCLAW config with provider definitions.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "api-benchmark" with this command: npx skills add Polar/api-benchmark

API Token Speed Benchmark

This skill benchmarks token generation speed across multiple LLM API providers.

When to use this skill

Use this skill when you need to:

  • Compare token generation speed across different API providers
  • Measure latency and throughput of LLM models
  • Verify API connectivity and authentication
  • Test new API endpoints or models

How to run benchmarks

List available targets

python3 main.py --targets

Run benchmark on a specific target

python3 main.py run --label <target-label>

Run benchmark on all targets

python3 main.py run --all

Run preflight check (verify API connectivity)

python3 main.py check --label <target-label>
python3 main.py check --all

Options

  • -l, --label: Specific target label to benchmark
  • -a, --all: Run on all available targets
  • -r, --repeat: Number of runs per prompt level (default: 1)
  • -c, --category: Run specific prompt category (can repeat: -c short -c medium). Options: short, medium, long
  • -q, --quiet: Quiet mode - suppress progress output
  • --timeout N: Request timeout in seconds (default: 120)
  • --table: Output as formatted table (default: JSON)

Configuration

The tool reads configuration from ~/.openclaw/openclaw.json. Targets are defined in the models.providers section with:

  • baseUrl: API base URL
  • apiKey: Authentication key (or ${ENV_VAR} to read from environment variable)
  • api: API format (anthropic-messages, openai-completions, openai-responses)
  • models: List of model configurations

Security Note: Instead of hardcoding API keys in the config file, use environment variable placeholders:

  • "apiKey": "${ANTHROPIC_API_KEY}" will read from the ANTHROPIC_API_KEY environment variable

Example provider config:

{
  "models": {
    "providers": {
      "my-provider": {
        "baseUrl": "https://api.example.com",
        "apiKey": "sk-xxx",
        "api": "openai-completions",
        "models": [
          { "id": "model-name", "api": "openai-completions" }
        ]
      }
    }
  }
}

Output Metrics

  • TTFT (Time To First Token): Latency before first token arrives (seconds)
  • TPS (Tokens Per Second): Generation throughput
  • Total Time: Full generation duration (seconds)
  • Input/Output Tokens: Token counts from API usage data (or estimated at 4 chars/token if not provided by API)

Note: Token counts are reported by the API when available. If the API doesn't return token counts, they are estimated at 4 characters per token.

Example Usage

# Check if a specific target is reachable
python3 main.py check --label my-provider

# Benchmark a single target
python3 main.py run --label my-provider --repeat 3

# Compare all targets
python3 main.py run --all --table

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

OpenClaw Growth Engineer

OpenClaw-first growth autopilot for mobile apps. Correlate analytics, crashes, billing, feedback, store signals, and repo context into proposal drafts that c...

Registry SourceRecently Updated
General

scrum-master

Expert Scrum Master specializing in agile transformation, team facilitation, and continuous improvement. Masters Scrum framework implementation, impediment r...

Registry SourceRecently Updated
General

AI-Cardiac-Rehab

AI 心脏康复管理系统 — 基于 ACC/AHA 指南的安全增强型康复管理 Web 应用。症状预警、风险分层、个体化运动处方、药物依从性提醒。

Registry SourceRecently Updated
General

circuit-intimate-therapy

婚姻家庭咨询里用"电路类比"做理论解释、案例诊断、干预方案设计,并在每次回答末尾**生成一个上下文相关的 HTML 交互 app**(写到 ~/circapps/generated/),复用 ~/circapps/ 下 12 个 canonical app 的视觉系统(common.css/common.js)。...

Registry SourceRecently Updated