discover-data

Automatically discover data pipeline and ETL skills when working with ETL, data pipelines, streaming, batch processing, data validation, or pipeline orchestration. Activates for data development tasks.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "discover-data" with this command: npx skills add rand/cc-polymath/rand-cc-polymath-discover-data

Data Skills Discovery

Provides automatic access to comprehensive data skills.

When This Skill Activates

This skill auto-activates when you're working with:

  • ETL
  • data pipelines
  • batch processing
  • stream processing
  • data validation
  • orchestration
  • Airflow
  • timely dataflow
  • differential dataflow
  • streaming aggregations
  • windowing
  • real-time analytics

Available Skills

Quick Reference

The Data category contains 9 skills:

  1. batch-processing - Orchestrating complex data pipelines with dependencies
  2. data-validation - Validating data schema before processing
  3. dataflow-coordination - Coordination patterns for distributed dataflow systems
  4. differential-dataflow - Differential computation for incremental updates and efficient joins
  5. etl-patterns - Designing data extraction from multiple sources
  6. pipeline-orchestration - Coordinating complex multi-step data workflows
  7. stream-processing - Processing real-time event streams (Kafka, Flink)
  8. streaming-aggregations - Windowing, sessionization, time-series aggregation
  9. timely-dataflow - Low-latency streaming computation with progress tracking

Load Full Category Details

For complete descriptions and workflows:

Read ../data/INDEX.md

This loads the full Data category index with:

  • Detailed skill descriptions
  • Usage triggers for each skill
  • Common workflow combinations
  • Cross-references to related skills

Load Specific Skills

Load individual skills as needed:

Traditional ETL/Batch

Read ../data/batch-processing.md Read ../data/data-validation.md Read ../data/etl-patterns.md Read ../data/pipeline-orchestration.md

Stream Processing

Read ../data/stream-processing.md Read ../data/streaming-aggregations.md

Advanced Dataflow Systems

Read ../data/timely-dataflow.md Read ../data/differential-dataflow.md Read ../data/dataflow-coordination.md

Common Workflow Combinations

Real-Time Analytics Pipeline

Load these skills together:

Read ../data/stream-processing.md # Kafka setup Read ../data/streaming-aggregations.md # Windowing patterns Read ../data/dataflow-coordination.md # Coordination

Incremental Computation System

Load these skills together:

Read ../data/timely-dataflow.md # Foundation Read ../data/differential-dataflow.md # Incremental updates Read ../data/dataflow-coordination.md # Distributed coordination

Hybrid Batch + Stream

Load these skills together:

Read ../data/batch-processing.md # Batch jobs Read ../data/stream-processing.md # Stream processing Read ../data/pipeline-orchestration.md # Overall coordination

Progressive Loading

This gateway skill enables progressive loading:

  • Level 1: Gateway loads automatically (you're here now)
  • Level 2: Load category INDEX.md for full overview
  • Level 3: Load specific skills as needed

Usage Instructions

  1. Auto-activation: This skill loads automatically when Claude Code detects data work
  2. Browse skills: Run Read ../data/INDEX.md for full category overview
  3. Load specific skills: Use bash commands above to load individual skills

Next Steps: Run Read ../data/INDEX.md to see full category details.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

anti-slop

No summary provided by upstream source.

Repository SourceNeeds Review
120-rand
General

elegant-design

No summary provided by upstream source.

Repository SourceNeeds Review
70-rand
General

discover-frontend

No summary provided by upstream source.

Repository SourceNeeds Review
47-rand
Research

discover-research

No summary provided by upstream source.

Repository SourceNeeds Review
42-rand