ai-regression-testing

Validate that changes to AI prompts, extraction logic, or data schemas don't break existing extraction quality. Use when modifying agentPrompts.ts, extractionSchema.ts, or ai.ts routes. Covers golden test sets, diff testing, and confidence score regression.

Safety Notice

This listing is imported from SkillsMP metadata and should be treated as untrusted until upstream source review is completed.

Copy this and send it to your AI assistant to learn

Install skill "ai-regression-testing" with this command: npx skills add hmartinezc/skillsmp-hmartinezc-hmartinezc-ai-regression-testing

No markdown body

This source entry does not include full markdown content beyond metadata.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Automation

continuous-learning

Extract reusable patterns from coding sessions to improve future AI agent behavior. Use at the end of sessions or when noticing repeated patterns, common mistakes, or successful workflows worth preserving.

Repository SourceNeeds Review
General

cost-aware-llm-pipeline

Optimize Gemini API costs WITHOUT changing the extraction model. Use when building or modifying AI extraction pipelines, batch processing, or when API costs are increasing. Covers caching, prompt optimization, batching, retry logic, and cost tracking. The current model (gemini-3-flash-preview) is proven for PDF invoice extraction and should NOT be downgraded.

Repository SourceNeeds Review
Coding

customs-trade-compliance

Customs documentation, trade compliance, and international shipping regulations for logistics applications. Use when working with customs-related features, international freight documentation, HS codes, duties, or cross-border trade compliance.

Repository SourceNeeds Review
General

database-migrations

Safe, idempotent database migration patterns for libSQL/SQLite. Use when modifying server/schema.ts, adding tables or columns, writing seed data, or planning schema changes. Covers idempotent DDL, rollback strategies, and zero-downtime patterns.

Repository SourceNeeds Review
ai-regression-testing | V50.AI