cledon

Manage voice AI agent testing with Cledon's MCP server. Create test cases, scenarios, then run and monitor automated voice tests. Use when working with voice agent testing, SIP calls, test assertions, call transcripts, or Cledon.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "cledon" with this command: npx skills add triceralab/cledon-skill/triceralab-cledon-skill-cledon

Cledon — Voice AI Agent Testing

Cledon tests voice AI agents by simulating callers that phone your agent and evaluate responses against assertions.

Domain Model

Agent        — the voice AI being tested (name, phone number, personality)
Folder       — groups related test cases
Test Case    — defines assertions + expected tool calls for one agent
Scenario     — a runnable test with caller instructions for one test case
Run          — execution of a scenario producing transcript + pass/fail results

Relationships: Agent → many Test Cases → many Scenarios. Each Scenario produces Runs.

Available Tools (22)

Analytics

ToolPurpose
get-overall-statsDashboard summary: total scenarios, runs, pass rate, avg duration
get-run-historyRecent runs with pass/fail counts (1-90 days lookback)
get-failed-assertionsTop 10 recurring failures with up to 3 example runs each

Agents

ToolPurpose
list-agentsList all voice agents
get-agentFull agent details by ID
create-agentCreate agent in call mode (phone number) or LLM mode (ElevenLabs, Vapi, LiveKit, Famulor, Synthflow)
update-agentUpdate agent properties
delete-agentDelete agent and associated data

Test Cases & Scenarios

ToolPurpose
list-testcasesList test cases (optional folderId filter)
get-testcaseFull test case with assertions and expected tool calls
create-testcaseAI-generate test case from a transcript or system prompt; supports includeScenarios to auto-create scenarios
update-testcaseUpdate test case properties
execute-testcaseRun all scenarios for a test case
list-scenariosList scenarios (optional testCaseId filter)
get-scenarioFull scenario with caller instructions

Execution

ToolPurpose
run-scenarioTrigger single test → returns runId
run-multiple-scenariosBatch trigger → returns array of runIds
get-run-statusFull run details: transcript, assertions, tool call validation
get-scenario-runsRun history for one scenario with pass/fail counts
cancel-runCancel a stuck run (only status=running)

Credentials

ToolPurpose
list-credentialsList all stored voice platform credentials (keys never exposed)
create-credentialStore a new platform API key (elevenlabs, vapi, livekit, famulor, synthflow)
update-credentialUpdate a credential's name or API key
delete-credentialDelete a stored credential

Workflows

Get an overview of testing status

  1. get-overall-stats → see pass rate, total runs, average duration
  2. get-run-history with days=7 → see recent individual results
  3. get-failed-assertions → identify systemic issues

Run a test and check results

  1. list-scenarios → find the scenario ID
  2. run-scenario with scenarioId → get back a runId
  3. Wait a moment, then get-run-status with runId → see transcript + assertion results
  4. If status is still "running", wait and check again

Run all tests for a test case

  1. list-scenarios with testCaseId filter → collect all scenario IDs
  2. run-multiple-scenarios with the ID array
  3. get-run-history with days=1 → see batch results

Investigate failures

  1. get-failed-assertions → find the most common failures
  2. Pick a failure, note the example runIds
  3. get-run-status for each runId → read the transcript to understand what went wrong
  4. get-scenario-runs for that scenarioId → check if it's a regression or consistent failure

Drill into a specific test case

  1. get-testcase with id → see assertions and expected tool calls
  2. list-scenarios with testCaseId → see all persona combinations
  3. get-scenario for each → see caller instructions

Create a new test from scratch

  1. list-agents → pick the agent to test (or create-agent)
  2. create-testcase with agent ID and assertions; set includeScenarios: true to auto-generate scenarios
  3. execute-testcase → run all scenarios, or run-scenario → run a single one

Create tests from a transcript

  1. list-agents → pick the agent (or create-agent)
  2. create-testcase with agentId and transcript — AI analyzes the transcript and generates assertions, icons, and expected tool calls
  3. Optionally set includeScenarios: true to also generate caller scenarios
  4. execute-testcase → run all generated scenarios

Key Patterns

  • List endpoints return compact data. Use the corresponding get-by-ID tool to see full details.
  • run-scenario is async: it returns a runId immediately. Poll get-run-status to see results.
  • All data is scoped to the authenticated user's organization. No cross-tenant access.
  • Run outcome is either "passed" or "failed". Run status progresses: running → completed/failed.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Automation

clinic-visit-prep

帮助患者整理就诊前问题、既往记录、检查清单与时间线,不提供诊断。;use for healthcare, intake, prep workflows;do not use for 给诊断结论, 替代医生意见.

Archived SourceRecently Updated
Automation

changelog-curator

从变更记录、提交摘要或发布说明中整理对外 changelog,并区分用户价值与内部改动。;use for changelog, release-notes, docs workflows;do not use for 捏造未发布功能, 替代正式合规审批.

Archived SourceRecently Updated
Automation

klaviyo

Klaviyo API integration with managed OAuth. Access profiles, lists, segments, campaigns, flows, events, metrics, templates, catalogs, and webhooks. Use this skill when users want to manage email marketing, customer data, or integrate with Klaviyo workflows. For other third party apps, use the api-gateway skill (https://clawhub.ai/byungkyu/api-gateway).

Archived SourceRecently Updated
Automation

lifelog

生活记录自动化系统。自动识别消息中的日期(今天/昨天/前天/具体日期),使用 SubAgent 智能判断,记录到 Notion 对应日期,支持补录标记。 适用于:(1) 用户分享日常生活点滴时自动记录;(2) 定时自动汇总分析并填充情绪、事件、位置、人员字段

Archived SourceRecently Updated
cledon | V50.AI