A/B Testing Framework

Compare models with A/B testing for selection

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "A/B Testing Framework" with this command: npx skills add nidalghETF/ab-test-framework

A/B Testing Framework

Description

Compare models with A/B testing for selection

Source Reference

This skill is derived from 20. Testing & Quality Assurance of the OpenClaw Agent Mastery Index v4.1.

Sub-heading: A/B Testing Frameworks for Model Selection

Complexity: high

Input Parameters

NameTypeRequiredDescription
model_astringYesFirst model
model_bstringYesSecond model
test_promptsarrayYesTest prompts

Output Format

{
  "status": <string>,
  "details": <object>,
  "winner": <string>,
  "confidence": <number>
}

Usage Examples

Example 1: Basic Usage

const result = await openclaw.skill.run('ab-test-framework', {
  model_a: "value",
  model_b: "value",
  test_prompts: 123
});

Example 2: With Optional Parameters

const result = await openclaw.skill.run('ab-test-framework', {
  model_a: "value",
  model_b: "value",
  test_prompts: []
});

Security Considerations

A/B test security per Category 8; prevent test manipulation

Additional Security Measures

  1. Input Validation: All inputs are validated before processing
  2. Least Privilege: Operations run with minimal required permissions
  3. Audit Logging: All actions are logged for security review
  4. Error Handling: Errors are sanitized before returning to caller

Troubleshooting

Common Issues

IssueCauseSolution
Permission deniedInsufficient privilegesCheck file/directory permissions
Invalid inputMalformed parametersValidate input format
Dependency missingRequired module not installedRun npm install

Debug Mode

Enable debug logging:

openclaw.logger.setLevel('debug');
const result = await openclaw.skill.run('ab-test-framework', { ... });

Related Skills

  • model-routing-manager
  • performance-benchmarker
  • @param {string} params.model_a - First model
  • @param {string} params.model_b - Second model
  • @param {Array} params.test_prompts - Test prompts

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Doc Sysadmin

Especialista TI Ubuntu 24.04. Cuida do sistema host - espaço em disco, RAM, lentidão, limpeza periódica. Use when: (1) verificação de saúde do sistema, (2) l...

Registry SourceRecently Updated
0238
Profile unavailable
General

sysadmin

No summary provided by upstream source.

Repository SourceNeeds Review
Security

ContextKeeper

ContextKeeper — Safe project state tracking for AI agents. Manual checkpoint creation with validated inputs. No background processes, no PID manipulation, no...

Registry SourceRecently Updated
0513
Profile unavailable