agentbox-inference

LLM inference via paid API: OpenAI-compatible chat completions proxied through x402 providers. Supports Kimi K2.5, MiniMax M2.5. Uses x_payment tool for automatic USDC micropayments ($0.001-$0.003/call). Use when: (1) generating text with a specific model, (2) running chat completions through a pay-per-request LLM endpoint, (3) comparing outputs across models.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "agentbox-inference" with this command: npx skills add cascade-protocol/agentbox/cascade-protocol-agentbox-agentbox-inference

LLM Inference

Paid OpenAI-compatible chat completions API at https://inference.surf.cascade.fyi. Costs $0.001-$0.003 USDC per call via x402 on Solana. Use the x_payment tool for all requests.

Endpoint

Chat Completions

Generate a chat completion from a supported model.

x_payment({
  "url": "https://inference.surf.cascade.fyi/v1/chat/completions",
  "method": "POST",
  "body": "{\"model\": \"moonshotai/kimi-k2.5\", \"messages\": [{\"role\": \"user\", \"content\": \"Explain x402 in one sentence\"}]}"
})

Body Parameters:

ParamTypeRequiredDescription
modelstringyesModel ID (see table below)
messagesarrayyesArray of {role, content} objects
max_tokensintegernoMaximum tokens to generate
temperaturenumbernoSampling temperature (0-2)
top_pnumbernoNucleus sampling (0-1)

Message roles: system, user, assistant

Models & Pricing

ModelCost/callBest for
moonshotai/kimi-k2.5$0.003High-quality output, large context (262K)
minimax/minimax-m2.5$0.002Balanced quality/cost

Usage Patterns

Simple question

x_payment({
  "url": "https://inference.surf.cascade.fyi/v1/chat/completions",
  "method": "POST",
  "body": "{\"model\": \"moonshotai/kimi-k2.5\", \"messages\": [{\"role\": \"user\", \"content\": \"What is the x402 protocol?\"}]}"
})

With system prompt and parameters

x_payment({
  "url": "https://inference.surf.cascade.fyi/v1/chat/completions",
  "method": "POST",
  "body": "{\"model\": \"moonshotai/kimi-k2.5\", \"messages\": [{\"role\": \"system\", \"content\": \"You are a concise technical writer.\"}, {\"role\": \"user\", \"content\": \"Write a summary of Solana's transaction model\"}], \"max_tokens\": 500, \"temperature\": 0.7}"
})

Response Format

Standard OpenAI chat completion response:

{
  "id": "gen-...",
  "object": "chat.completion",
  "model": "moonshotai/kimi-k2.5",
  "choices": [{
    "index": 0,
    "message": { "role": "assistant", "content": "..." },
    "finish_reason": "stop"
  }],
  "usage": {
    "prompt_tokens": 12,
    "completion_tokens": 42,
    "total_tokens": 54
  }
}

Errors

HTTPMeaning
400Invalid request (check model name and messages format)
402Payment required (handled automatically by x_payment)
502Upstream provider error

Cost

Flat rate per model per call. Price is determined by the model field in the request body. Each call is independent - no sessions or state.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Automation

agentbox-twitter

No summary provided by upstream source.

Repository SourceNeeds Review
Automation

agentbox

No summary provided by upstream source.

Repository SourceNeeds Review
Automation

web

No summary provided by upstream source.

Repository SourceNeeds Review