MLOps

Deploy ML models to production with pipelines, monitoring, serving, and reproducibility best practices.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "MLOps" with this command: npx skills add ivangdavila/mlops

Quick Reference

TopicFileKey Trap
CI/CD and DAGspipelines.mdCoupling training/inference deps
Model servingserving.mdCold start with large models
Drift and alertsmonitoring.mdOnly technical metrics
Versioningreproducibility.mdNot versioning preprocessing
GPU infrastructuregpu.mdGPU request = full device

Critical Traps

Training-Serving Skew:

  • Preprocessing in notebook ≠ preprocessing in service → silent bugs
  • Pandas in notebook → memory leaks in production (use native types)
  • Feature store values at training time ≠ serving time without proper joins

GPU Memory:

  • requests.nvidia.com/gpu: 1 reserves ENTIRE GPU, not partial memory
  • MIG/MPS sharing has real limitations (not plug-and-play)
  • OOM on GPU kills pod with no useful logs

Model Versioning ≠ Code Versioning:

  • Model artifacts need separate versioning (MLflow, W&B, DVC)
  • Training data version + preprocessing version + code version = reproducibility
  • Rollback requires keeping old model versions deployable

Drift Detection Timing:

  • Retraining trigger isn't just "drift > threshold" → cost/benefit matters
  • Delayed ground truth makes concept drift detection lag weeks
  • Upstream data pipeline changes cause drift without model issues

Scope

This skill ONLY covers:

  • CI/CD pipelines for models
  • Model serving and scaling
  • Monitoring and drift detection
  • Reproducibility practices
  • GPU infrastructure patterns

Does NOT cover: ML algorithms, feature engineering, hyperparameter tuning.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Social Post Calendar Skill

Guides structured help for Social Post Calendar using clear templates, checks, and safe defaults (category: Social & Marketing).

Registry SourceRecently Updated
General

Founder Signal

Founder Signal turns verified Reddit evidence into a small, reviewable signal package for founders evaluating product demand and positioning across one or mo...

Registry SourceRecently Updated
General

Content Keyword Tracker

An OpenClaw skill for tracking keyword trends and generating structured reports. Uses Tavily API for search and supports webhook notifications for daily repo...

Registry SourceRecently Updated
General

读书每日推荐

微信读书飙升榜每日推荐卡片生成器。从微信读书飙升榜抓取热门书籍数据,生成精美的每日读书推荐卡片(HTML/PNG)。当用户说「读书推荐」「微信读书」「飙升榜」「今日好书」「推荐一本书」「读书卡片」「book recommendation」时触发。也可用于每日定时推送读书推荐场景。

Registry SourceRecently Updated