Data

Work with data across the full lifecycle from extraction and cleaning to analysis, visualization, and reporting.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "Data" with this command: npx skills add ivangdavila/data

When to Use

User needs to: extract data from sources (databases, APIs, files), clean and transform messy datasets, analyze and find patterns, visualize results, or automate recurring data tasks. Agent handles the full data workflow.

Quick Reference

AreaFileFocus
Querying & Extractionquerying.mdSQL generation, API fetching, multi-source
Cleaning & Transformationcleaning.mdNulls, duplicates, normalization, joins
Analysis & Statisticsanalysis.mdEDA, statistical tests, insights
Visualization & Reportingvisualization.mdCharts, dashboards, exports
Quality & Validationquality.mdData checks, anomaly detection, drift
Workflow Patternspatterns.mdCommon data workflows, automation

Core Operations

Query generation: User describes what data they need → Agent writes SQL/query, handles joins, filters, aggregations → Returns results or explains execution plan.

Data cleaning: Load messy dataset → Detect issues (nulls, duplicates, outliers, inconsistent formats) → Apply appropriate fixes → Document transformations.

Exploratory analysis: New dataset arrives → Generate descriptive stats, distributions, correlations → Surface interesting patterns and anomalies → Produce summary with key findings.

Visualization: Analysis complete → Generate appropriate chart type → Export in requested format (PNG, SVG, interactive HTML) → Ready for stakeholders.

Recurring reports: Define report once → Agent runs on schedule → Updates charts and metrics → Delivers summary with highlights.

Critical Rules

  • Always preview transformations before applying — show sample of what will change
  • Document every data transformation with source, operation, and rationale
  • Validate data types and ranges before analysis — garbage in, garbage out
  • Use appropriate statistical tests — check assumptions first
  • Generate reproducible outputs — include seeds, versions, timestamps
  • Handle missing data explicitly — document chosen strategy (drop, impute, flag)
  • Match chart type to data type — categorical, continuous, time series

User Modes

ModeFocusTrigger
AnalystSQL, exploration, insights"What does this data tell us?"
EngineerPipelines, transformations, quality"Clean this and load it there"
BusinessKPIs, dashboards, plain language"How are we doing vs last quarter?"
ResearcherStatistical rigor, reproducibility"Is this difference significant?"
DeveloperSchema design, API data, types"Generate types from this JSON"

See patterns.md for workflows per mode.

On First Use

  1. Identify data source (database, file, API)
  2. Establish connection or load file
  3. Initial EDA — shape, types, quality issues
  4. Clean and transform as needed
  5. Analyze or visualize per user goal

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Research

Academic Thesis Workflow

此技能提供一个标准化、可复现的学术论文生成工作流,通过四个有序步骤将论文主题转化为完整学术论文:主题可行性评估、主题转论证骨架加衍生方向、选定方向优化骨架、骨架转完整论文加自动复核。支持一切学科领域(人文社科与理工科),当用户想要撰写博士论文、硕士论文、期刊论文,或需要从主题出发系统化构建学术论文论证结构时,应使...

Registry SourceRecently Updated
80Profile unavailable
Research

Mac Mini Knowledge Base + RAG Setup

在 Mac Mini (M4) 上快速搭建本地知识库 + RAG 自然语言搜索系统。 适用场景: - 新 Mac 配置知识库:从零开始安装配置 Ollama、embedding模型、定时任务、OCR文档分析 - 遇到 PDF 提取乱码、定时任务超时、skill 加载失败等问题 - 想要建立每日自动分析文档 + 0...

Registry SourceRecently Updated
80Profile unavailable
Research

Knowledge Retrieval

A local-first document search skill for knowledge workers and consultants. Handles PPT/PDF/DOCX in place, searches with keyword + AI dual-channel, gets smarter with use. No cloud upload needed.

Registry SourceRecently Updated
380Profile unavailable
Research

Sichuan Cuisine (川菜大全)

Ultimate Sichuan cuisine (川菜) knowledge base with 45+ classic recipes spanning cold dishes, hot dishes, soups, snacks, hot pot/maocai/gan guo, and banquet di...

Registry SourceRecently Updated
270Profile unavailable