A/B Testing Framework

Compare models with A/B testing for selection

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "A/B Testing Framework" with this command: npx skills add nidalghetf/ab-test-framework

A/B Testing Framework

Description

Compare models with A/B testing for selection

Source Reference

This skill is derived from 20. Testing & Quality Assurance of the OpenClaw Agent Mastery Index v4.1.

Sub-heading: A/B Testing Frameworks for Model Selection

Complexity: high

Input Parameters

Name	Type	Required	Description
`model_a`	string	Yes	First model
`model_b`	string	Yes	Second model
`test_prompts`	array	Yes	Test prompts

Output Format

{
  "status": <string>,
  "details": <object>,
  "winner": <string>,
  "confidence": <number>
}

Usage Examples

Example 1: Basic Usage

const result = await openclaw.skill.run('ab-test-framework', {
  model_a: "value",
  model_b: "value",
  test_prompts: 123
});

Example 2: With Optional Parameters

const result = await openclaw.skill.run('ab-test-framework', {
  model_a: "value",
  model_b: "value",
  test_prompts: []
});

Security Considerations

A/B test security per Category 8; prevent test manipulation

Additional Security Measures

Input Validation: All inputs are validated before processing
Least Privilege: Operations run with minimal required permissions
Audit Logging: All actions are logged for security review
Error Handling: Errors are sanitized before returning to caller

Troubleshooting

Common Issues

Issue	Cause	Solution
Permission denied	Insufficient privileges	Check file/directory permissions
Invalid input	Malformed parameters	Validate input format
Dependency missing	Required module not installed	Run `npm install`

Debug Mode

Enable debug logging:

openclaw.logger.setLevel('debug');
const result = await openclaw.skill.run('ab-test-framework', { ... });

Related Skills

model-routing-manager
performance-benchmarker

@param {string} params.model_a - First model
@param {string} params.model_b - Second model
@param {Array} params.test_prompts - Test prompts

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Open Registry Record Open in ClawHub

Related Skills

Related by shared tags or category signals.

General

Session Memory Manual

跨 Session 记忆查找与同步机制，完全手动触发

Registry SourceRecently Updated

1321Profile unavailable

General

Manual AI

当自动 AI API 调用失败时，通过人机协作完成任务。支持 6 个核心 AI 平台：Gemini、Google AI、ChatGPT、豆包、千问、NotebookLM。提供智能平台推荐、提示词优化、批量任务支持。

Registry SourceRecently Updated

1450Profile unavailable

General

Doc Sysadmin

Especialista TI Ubuntu 24.04. Cuida do sistema host - espaço em disco, RAM, lentidão, limpeza periódica. Use when: (1) verificação de saúde do sistema, (2) l...

Registry SourceRecently Updated

5060Profile unavailable

Automation

Book-PDF：书籍级PDF手册生成器

深度调研一个主题，生成100页+书籍级PDF手册。模块化HTML片段架构 + 语义化版本管理 + 多Agent并行写作 + Playwright渲染PDF。当用户需要制作完整的PDF手册、电子书、橙皮书、参考指南时触发。即使用户只是说「做一本书」「做个PDF手册」「做个完整指南」「做一本XX的手册」也应触发。...

Registry SourceRecently Updated

2950Profile unavailable