agentic-kpi-tracking

Agentic KPI Tracking Skill

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "agentic-kpi-tracking" with this command: npx skills add melodic-software/claude-code-plugins/melodic-software-claude-code-plugins-agentic-kpi-tracking

Agentic KPI Tracking Skill

Guide measurement and tracking of agentic coding KPIs to assess ZTE readiness.

When to Use

  • Measuring agentic workflow effectiveness

  • Tracking progress toward ZTE

  • Analyzing success patterns

  • Identifying improvement areas

Core KPIs

Summary Metrics

Metric Calculation Target

Current Streak Consecutive successes (Attempts <= 2) Higher is better

Longest Streak Best consecutive success run Track improvement

Average Presence Mean attempts across all runs Target: 1

Total Plan Size Sum of all plan sizes Track scaling

Total Diff Size Sum of all changes (added + removed) Track throughput

Per-Run Metrics

Metric Source Meaning

Attempts Count of plan/patch runs 1 = perfect, higher = retries

Plan Size Lines in plan file Task complexity

Diff Size Lines added + removed Change magnitude

Files Changed Number of files modified Change scope

Calculation Methods

Attempts Count

Only count workflow restarts:

attempts_incrementing = ["adw_plan_iso", "adw_patch_iso"] attempts = count(workflow in all_adws if workflow in attempts_incrementing)

Build/test/review don't increment - only full replans.

Streak Calculation

current_streak = 0 for run in reversed(runs): if run.attempts <= 2: current_streak += 1 else: break

Diff Statistics

git diff origin/main --shortstat

Output: X files changed, Y insertions(+), Z deletions(-)

KPI File Format

Store in app_docs/agentic_kpis.md or equivalent:

Agentic KPIs

Summary

MetricValue
Current Streak5
Longest Streak12
Average Presence1.3
Total Plan Size450 lines
Total Diff Size2,340 lines

Detail

DateADW IDIssueClassAttemptsPlan SizeDiff +/-Files
2024-01-15abc123#45/bug135+45/-123
2024-01-14def456#44/feature285+120/-308

Tracking Workflow

Step 1: Gather Current Run Data

From state or git:

  • ADW ID

  • Issue number

  • Issue classification

  • Plan file path

  • All workflows run (for attempts)

Step 2: Calculate Metrics

attempts = count_attempts(all_adws) plan_size = wc_lines(plan_file) diff_stats = parse_git_diff()

Step 3: Update Detail Table

Add new row with current run data.

Step 4: Recalculate Summary

Update all summary metrics based on full detail table.

Step 5: Analyze Trends

  • Is streak increasing?

  • Is average presence decreasing?

  • Are plan sizes growing (handling bigger tasks)?

ZTE Readiness Indicators

Based on KPIs, assess ZTE readiness:

Indicator Threshold Status

Current Streak

= 5 Ready to try ZTE

Average Presence <= 1.5 Good efficiency

Recent Failures 0 in last 10 High confidence

Plan Size Trend Increasing Scaling up

Key Memory References

  • @agentic-kpis.md - KPI definitions from Lesson 002

  • @zte-progression.md - How KPIs relate to ZTE levels

  • @zte-confidence-building.md - Using KPIs for confidence

Output Format

Provide KPI update:

KPI Update

Run: {adw_id} Issue: #{issue_number} ({issue_class})

This Run

  • Attempts: 1
  • Plan Size: 45 lines
  • Diff: +67/-23 (4 files)

Updated Summary

  • Current Streak: 6 (was 5)
  • Longest Streak: 12 (unchanged)
  • Average Presence: 1.28 (improved)

Analysis

[Trend observations and recommendations]

Anti-Patterns

  • Gaming metrics (easy tasks only)

  • Ignoring failures (not counting retries)

  • Not tracking consistently

  • Celebrating streaks over actual delivery

Version History

  • v1.0.0 (2025-12-26): Initial release

Last Updated

Date: 2025-12-26 Model: claude-opus-4-5-20251101

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Coding

design-thinking

No summary provided by upstream source.

Repository SourceNeeds Review
Coding

plantuml-syntax

No summary provided by upstream source.

Repository SourceNeeds Review
Coding

system-prompt-engineering

No summary provided by upstream source.

Repository SourceNeeds Review