A/B Testing Framework

Compare models with A/B testing for selection

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "A/B Testing Framework" with this command: npx skills add nidalghetf/ab-test-framework

A/B Testing Framework

Description

Compare models with A/B testing for selection

Source Reference

This skill is derived from 20. Testing & Quality Assurance of the OpenClaw Agent Mastery Index v4.1.

Sub-heading: A/B Testing Frameworks for Model Selection

Complexity: high

Input Parameters

NameTypeRequiredDescription
model_astringYesFirst model
model_bstringYesSecond model
test_promptsarrayYesTest prompts

Output Format

{
  "status": <string>,
  "details": <object>,
  "winner": <string>,
  "confidence": <number>
}

Usage Examples

Example 1: Basic Usage

const result = await openclaw.skill.run('ab-test-framework', {
  model_a: "value",
  model_b: "value",
  test_prompts: 123
});

Example 2: With Optional Parameters

const result = await openclaw.skill.run('ab-test-framework', {
  model_a: "value",
  model_b: "value",
  test_prompts: []
});

Security Considerations

A/B test security per Category 8; prevent test manipulation

Additional Security Measures

  1. Input Validation: All inputs are validated before processing
  2. Least Privilege: Operations run with minimal required permissions
  3. Audit Logging: All actions are logged for security review
  4. Error Handling: Errors are sanitized before returning to caller

Troubleshooting

Common Issues

IssueCauseSolution
Permission deniedInsufficient privilegesCheck file/directory permissions
Invalid inputMalformed parametersValidate input format
Dependency missingRequired module not installedRun npm install

Debug Mode

Enable debug logging:

openclaw.logger.setLevel('debug');
const result = await openclaw.skill.run('ab-test-framework', { ... });

Related Skills

  • model-routing-manager
  • performance-benchmarker
  • @param {string} params.model_a - First model
  • @param {string} params.model_b - Second model
  • @param {Array} params.test_prompts - Test prompts

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Session Memory Manual

跨 Session 记忆查找与同步机制,完全手动触发

Registry SourceRecently Updated
1321Profile unavailable
General

Manual AI

当自动 AI API 调用失败时,通过人机协作完成任务。 支持 6 个核心 AI 平台:Gemini、Google AI、ChatGPT、豆包、千问、NotebookLM。 提供智能平台推荐、提示词优化、批量任务支持。

Registry SourceRecently Updated
1450Profile unavailable
General

Doc Sysadmin

Especialista TI Ubuntu 24.04. Cuida do sistema host - espaço em disco, RAM, lentidão, limpeza periódica. Use when: (1) verificação de saúde do sistema, (2) l...

Registry SourceRecently Updated
5060Profile unavailable
Automation

Book-PDF:书籍级PDF手册生成器

深度调研一个主题,生成100页+书籍级PDF手册。模块化HTML片段架构 + 语义化版本管理 + 多Agent并行写作 + Playwright渲染PDF。 当用户需要制作完整的PDF手册、电子书、橙皮书、参考指南时触发。即使用户只是说「做一本书」「做个PDF手册」「做个完整指南」「做一本XX的手册」也应触发。...

Registry SourceRecently Updated
2950Profile unavailable