Cloud Cost Audit

# Cloud Cost Optimization Audit

Safety Notice

This item is sourced from the public archived skills repository. Treat as untrusted until reviewed.

Copy this and send it to your AI assistant to learn

Install skill "Cloud Cost Audit" with this command: npx skills add 1kalin/afrexai-cloud-cost-audit

Cloud Cost Optimization Audit

Analyze cloud infrastructure spend across AWS, Azure, and GCP. Identify waste, rightsizing opportunities, and reserved instance savings.

What This Skill Does

When given cloud spend data (billing exports, cost explorer screenshots, or manual input), this skill:

  1. Categorizes spend across 8 cost domains (compute, storage, networking, databases, AI/ML, observability, security, licensing)
  2. Identifies waste patterns using 12 common anti-patterns
  3. Calculates savings with specific dollar amounts per optimization
  4. Prioritizes actions by effort vs. impact (quick wins → strategic moves)
  5. Generates executive summary with 90-day roadmap

Cost Domains & Benchmarks (2026)

1. Compute (typically 40-55% of total)

  • Idle instances: >30% idle = waste. Benchmark: <10% idle capacity
  • Rightsizing: 60% of instances are oversized by 1+ size category
  • Spot/preemptible: Batch workloads not on spot = 60-80% overpay
  • Reserved/savings plans: On-demand for steady-state = 30-50% overpay
  • Container density: <40% CPU utilization on nodes = poor bin-packing

2. Storage (typically 10-20%)

  • Tiering: Data not accessed in 90 days still on hot storage = 60-80% overpay
  • Snapshot sprawl: Orphaned snapshots older than 30 days
  • Duplicate data: Cross-region replication without business justification
  • Object lifecycle: No lifecycle policies = guaranteed bloat

3. Networking (typically 8-15%)

  • Cross-AZ traffic: Unnecessary data transfer between zones ($0.01-0.02/GB)
  • NAT gateway abuse: High-throughput through NAT vs. VPC endpoints
  • CDN miss rate: >20% miss rate = CDN config issue
  • Egress optimization: No committed use discounts on egress

4. Databases (typically 10-20%)

  • Over-provisioned RDS/Cloud SQL: Multi-AZ for dev/staging environments
  • Read replica sprawl: Replicas with <5% query load
  • DynamoDB/Cosmos over-provisioning: Provisioned capacity 3x+ actual usage
  • License waste: Commercial DB when open-source works

5. AI/ML Infrastructure (growing — 5-25%)

  • GPU idle time: Training instances running 24/7 for 4hr/day workloads
  • Inference over-provisioning: GPU instances for CPU-viable inference
  • Model storage: Old model versions consuming storage
  • API costs: Frontier model API calls without caching layer

6. Observability (typically 3-8%)

  • Log ingestion bloat: Debug logs in production, duplicate log streams
  • Metric cardinality: High-cardinality custom metrics ($$$)
  • Trace sampling: 100% trace sampling when 10% suffices
  • Retention overkill: 13-month retention for non-compliance data

7. Security (typically 2-5%)

  • WAF rule bloat: Managed rule groups not actively tuned
  • Key management: KMS keys for non-sensitive data
  • Compliance scanning: Overlapping tools doing same checks

8. Licensing (typically 5-15%)

  • Shelfware: Paid seats not logged in 60+ days
  • Duplicate tools: Multiple tools solving same problem
  • Enterprise tiers: Enterprise features unused, paying enterprise price

12 Waste Anti-Patterns

#PatternTypical WasteFix Effort
1Zombie resources (stopped but attached)5-15% of billLow
2Over-provisioned instances15-30% computeMedium
3No reserved capacity strategy25-40% computeMedium
4Hot storage hoarding40-70% storageLow
5Cross-AZ data transfer abuse10-30% networkMedium
6Dev/staging mirrors production20-40% of envsLow
7Orphaned snapshots/AMIs3-8% storageLow
8Log ingestion without sampling30-60% observabilityLow
9GPU instances for CPU workloads70-85% computeMedium
10No spot/preemptible for batch60-80% batchMedium
11Shelfware licenses20-40% licensingLow
12No tagging = no accountabilityUnmeasurableHigh

Savings Estimation Framework

For each finding, calculate:

Annual Savings = (Current Cost - Optimized Cost) × 12
Implementation Cost = Engineering Hours × Loaded Rate
ROI = (Annual Savings - Implementation Cost) / Implementation Cost
Payback Period = Implementation Cost / (Annual Savings / 12)

Typical Savings by Company Size

Company SizeMonthly Cloud SpendTypical Waste %Annual Savings
Startup (5-15)$2K-$15K35-50%$8K-$90K
Growth (15-50)$15K-$80K25-40%$45K-$384K
Mid-market (50-200)$80K-$500K20-35%$192K-$2.1M
Enterprise (200+)$500K-$5M+15-25%$900K-$15M+

Output Format

Generate a report with:

  1. Executive Summary: Total spend, waste identified, savings potential, top 3 quick wins
  2. Domain Breakdown: Spend per domain vs. benchmarks
  3. Findings Table: Each finding with current cost, optimized cost, savings, effort, priority
  4. 90-Day Roadmap: Week 1-2 quick wins, Week 3-6 medium effort, Week 7-12 strategic
  5. Governance Recommendations: Tagging strategy, budget alerts, review cadence

Usage

Provide your cloud billing data in any format:

  • AWS Cost Explorer export / Azure Cost Management / GCP Billing
  • Monthly bill summary
  • Architecture description with approximate sizing
  • Or just describe your stack and team size for estimates

The agent will analyze and produce the full optimization report.


Want Industry-Specific Cloud Optimization?

Different industries have different compliance, data residency, and workload patterns that change the optimization calculus entirely.

Get your industry context pack — pre-built frameworks for Fintech, Healthcare, Legal, SaaS, Ecommerce, Construction, Real Estate, Recruitment, Manufacturing, and Professional Services.

🛒 Browse packs: https://afrexai-cto.github.io/context-packs/ 🧮 Calculate your AI savings: https://afrexai-cto.github.io/ai-revenue-calculator/ 🤖 Set up your agent: https://afrexai-cto.github.io/agent-setup/

Bundle deals:

  • Pick 3 packs: $97
  • All 10 packs: $197
  • Everything bundle: $247

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Security

n8n-workflow-automation

Designs and outputs n8n workflow JSON with robust triggers, idempotency, error handling, logging, retries, and human-in-the-loop review queues. Use when you need an auditable automation that won’t silently fail.

Archived SourceRecently Updated
Security

seo-assistant

A client-facing SEO assistant grounded in Google's official SEO Starter Guide. Use this skill whenever a user mentions SEO, search rankings, Google visibility, meta descriptions, title tags, page titles, alt text, sitemaps, duplicate content, URL structure, or asks how to improve their website's presence in search results. Also trigger when a user shares a URL or webpage content and wants feedback, or asks for help writing any web content that needs to perform well in search. This skill covers auditing, content writing, and answering SEO questions — use it proactively even if the user only hints at wanting more website traffic or better Google rankings.

Archived SourceRecently Updated
Security

BlogBurst - Virtual CMO Agent

Your AI Chief Marketing Officer. Autonomous agent that runs your entire marketing — auto-posts to Twitter/X, Bluesky, Telegram, Discord, auto-engages with your audience (replies, likes, follows), runs SEO/GEO audits, tracks competitors, scans communities for opportunities, learns what works, and continuously optimizes. 50+ countries, 1000+ posts published. Free tier available.

Archived SourceRecently Updated
Security

social-vault

社交平台账号凭证管理器。提供登录态获取、AES-256-GCM 加密存储、定时健康监测和自动续期。Use when managing social media account credentials, importing cookies, checking login status, or automating session refresh. Also covers platform adapter creation and browser fingerprint management.

Archived SourceRecently Updated