data-product-validation

Score whether a data product idea is worth building before committing resources. Validation scorecard, experiment design, and go/kill decisions. Use when evaluating feasibility, making go/no-go decisions, validating demand, sizing bets, or when someone asks "is this worth building?" or "should we invest in this?"

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "data-product-validation" with this command: npx skills add hollandkevint/data-product-operator/hollandkevint-data-product-operator-data-product-validation

Validation Scorecard

Score each dimension 1-5. Multiply all five for a composite score (max 3,125).

1. Demand Frequency

How often do consumers need this answer?

ScoreFrequencyExample
5Daily or real-timeICU readmission risk scores
4WeeklyRegional performance reports
3MonthlyQuarterly business reviews
2QuarterlyAnnual planning data
1Rarely or onceAd-hoc executive request

2. Decision Impact

What happens when consumers don't have this data?

ScoreImpactExample
5Critical decision blockedCan't discharge patients without risk assessment
4Significant delay or costTeam spends 20+ hours/week on manual workaround
3Moderate inconvenienceReport takes 3 hours instead of 5 minutes
2Minor frictionSlightly slower process, acceptable workaround exists
1Mild inconvenienceNice to have, nobody changes behavior without it

3. Workaround Effort

What are consumers doing today instead?

ScoreEffortExample
5Custom tooling builtAnalyst maintains a 47-tab Excel model updated daily
4Significant manual process3 people spend 2 days/week compiling reports
3Partial automationScript exists but breaks frequently, needs babysitting
2Simple workaroundQuick Excel export, takes 30 minutes
1Haven't triedNobody has attempted to solve this yet

4. Data Feasibility

Can we actually build this with available data?

ScoreFeasibilityExample
5Data exists, quality verifiedSource audited, passes data-quality-assessment checks
4Data exists, quality unknownSource available but no quality baseline established
3Data exists, known issuesSource has gaps or accuracy problems that need fixing first
2Data partially existsNeed to combine 3+ sources, some missing
1Data doesn't existWould need new data collection or acquisition

Cross-reference data-quality-assessment for the 5-dimension scoring. A Data Feasibility score of 1 kills the idea regardless of other scores.

5. Schema Risk

How locked-in are consumers once we ship?

ScoreRiskExample
5Easily re-modeledInternal API, few consumers, versioning supported
4Moderate couplingDashboard consumed by 5-10 users, change is disruptive but manageable
3Significant coupling20+ downstream queries depend on current schema
2High couplingExternal consumers or contractual SLAs on schema shape
1Breaking change impossibleRegulated output, schema changes require compliance review

Thresholds

Composite ScoreDecisionAction
250+BuildShape a pitch, allocate a cycle
100-249InvestigateRun a 1-week experiment to de-risk the lowest-scoring dimension
<100KillDocument why and move on. Revisit only if new evidence surfaces

CRITICAL: A Data Feasibility score of 1 kills the idea at any composite score. You can't build a data product without data.

Experiment Types

When the scorecard says "Investigate," pick the cheapest experiment that addresses the weakest dimension:

Sample Query (1 day): Write the SQL. Run it against production data. Does the result answer the consumer's question? Tests Data Feasibility and reveals quality issues fast. If the query returns garbage, kill early.

Manual Pipeline (1 week): Build the transformation by hand for one consumer. Deliver a spreadsheet or flat file. Measure: did they use it? Did it change a decision? Tests Demand Frequency and Decision Impact with real behavior.

Schema Prototype (2-3 weeks): Build a minimal star schema with one fact table and 2-3 dimensions. Load a month of data. Let 2-3 consumers query it. Tests Schema Risk and reveals modeling assumptions before they calcify.

Type 1 vs Type 2 Decisions

Not all data product decisions are equal:

Type 1 (irreversible): Schema design decisions. Data source commitments. Terminology mappings published to external consumers. SCD strategies once historical data accumulates. These deserve full validation and the scorecard process.

Type 2 (reversible): Adding a metric to an existing dashboard. New filters on an existing report. Feature additions to established products. These don't need a scorecard. Just build and measure.

Discovery budget should match decision type. Spend 2 weeks validating a Type 1. Spend 2 hours on a Type 2.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

healthcare-data-domain

No summary provided by upstream source.

Repository SourceNeeds Review
General

data-model-design

No summary provided by upstream source.

Repository SourceNeeds Review
General

data-pipeline-quality

No summary provided by upstream source.

Repository SourceNeeds Review