DOPPLER Debug Skill

Use this skill when generation fails, outputs drift, or Node/browser parity breaks. Use this skill with doppler-bench when investigating performance regressions.

Mandatory Style Guides

Read these before non-trivial debug-flow, parity, or harness-contract changes:

docs/style/general-style-guide.md
docs/style/javascript-style-guide.md
docs/style/config-style-guide.md
docs/style/command-interface-design-guide.md
docs/style/harness-style-guide.md

Developer Guide Routing

When debugging turns into extension work, also open:

docs/developer-guides/README.md

Common routes:

new manifest/runtime knob or missing contract field: docs/developer-guides/07-manifest-runtime-field.md
command-surface parity fix that becomes a new command or contract surface: docs/developer-guides/12-command-surface.md
missing kernel or runtime path: docs/developer-guides/11-wgsl-kernel.md
attention-path bug that requires a new implementation: docs/developer-guides/13-attention-variant.md
cache/layout bug that requires a new layout strategy: docs/developer-guides/15-kvcache-layout.md
family-level onboarding gap: docs/developer-guides/composite-model-family.md or docs/developer-guides/composite-pipeline-family.md

Execution Plane Contract

Contract and tunables are declared in JSON (runtime + harness contract); do not substitute behavior in-place.
JS coordinates deterministic execution: resolve/validate config, dispatch pipelines, and collect artifacts.
WGSL executes resolved kernels; any policy branch belongs to config/rule selection before dispatch.
For parity checks, command intent must match: unknown/mismatched intent is a failure, not an alternate path.

Fast Triage

# Primary debug run (auto surface = node-first transport; browser fallback only when node transport is unavailable)
npm run debug -- --config '{"request":{"modelId":"MODEL_ID","runtimePreset":"modes/debug"},"run":{"surface":"auto"}}' --json

# Verify pass/fail with inference suite
npm run verify:model -- --config '{"request":{"suite":"inference","modelId":"MODEL_ID","runtimePreset":"modes/debug"},"run":{"surface":"auto"}}' --json

# Force browser relay for mobile/WebGPU parity checks
npm run debug -- --config '{"request":{"modelId":"MODEL_ID","runtimePreset":"diagnostics/debug-logits"},"run":{"surface":"browser","browser":{"channel":"chrome","console":true}}}' --json

Runtime Overrides (Config-First)

Use runtime JSON patches instead of ad-hoc flags:

npm run debug -- \
  --config '{"request":{"modelId":"MODEL_ID"},"run":{"surface":"auto"}}' \
  --runtime-config '{"shared":{"tooling":{"intent":"investigate"},"debug":{"trace":{"enabled":true,"categories":["attn","ffn"],"maxDecodeSteps":2}}},"inference":{"batching":{"maxTokens":8},"sampling":{"temperature":0}}}' \
  --json

Perf-Focused Investigation

# Investigate-mode profile run (trace/profiler enabled by preset)
npm run debug -- --config '{"request":{"modelId":"MODEL_ID","runtimePreset":"experiments/gemma3-profile"},"run":{"surface":"auto"}}' --json

# Fast readback sensitivity checks
npm run bench -- --config '{"request":{"modelId":"MODEL_ID","runtimePreset":"experiments/gemma3-investigate-readback-r1","cacheMode":"warm"},"run":{"surface":"browser"}}' --json
npm run bench -- --config '{"request":{"modelId":"MODEL_ID","runtimePreset":"experiments/gemma3-investigate-readback-r8","cacheMode":"warm"},"run":{"surface":"browser"}}' --json

# Direct override for decode cadence tuning
npm run bench -- \
  --config '{"request":{"modelId":"MODEL_ID","cacheMode":"warm"},"run":{"surface":"browser"}}' \
  --runtime-config '{"shared":{"tooling":{"intent":"investigate"}},"inference":{"batching":{"batchSize":4,"readbackInterval":4,"stopCheckMode":"per-token","maxTokens":128},"sampling":{"temperature":0}}}' \
  --json

Notes:

runtime.shared.tooling.intent="calibrate" forbids trace/profiler instrumentation.
Set runtime.shared.tooling.intent="investigate" for profiling/tracing runs.

Cache and Surface Control

# Cold browser run (wipe OPFS cache before launch)
npm run debug -- --config '{"request":{"modelId":"MODEL_ID","cacheMode":"cold"},"run":{"surface":"browser"}}' --json

# Warm browser run (reuse OPFS cache)
npm run debug -- --config '{"request":{"modelId":"MODEL_ID","cacheMode":"warm"},"run":{"surface":"browser"}}' --json

What to Inspect in Results

result.metrics.modelLoadMs, result.metrics.firstTokenMs
result.metrics.prefillTokensPerSecTtft (preferred) and result.metrics.prefillTokensPerSec
result.metrics.decodeTokensPerSec
result.metrics.gpu (if available)
result.memoryStats
result.deviceInfo
result.reportInfo (report backend/path)

Canonical Files

tools/doppler-cli.js
src/tooling/command-api.js
src/tooling/node-command-runner.js
src/tooling/node-browser-command-runner.js
src/inference/browser-harness.js
src/config/presets/runtime/modes/debug.json
docs/developer-guides/README.md

Related Skills

doppler-bench for perf regression quantification
doppler-convert when conversion integrity is suspected

doppler-debug

Safety Notice

Copy this and send it to your AI assistant to learn

DOPPLER Debug Skill

Mandatory Style Guides

Developer Guide Routing

Execution Plane Contract

Fast Triage

Runtime Overrides (Config-First)

Perf-Focused Investigation

Cache and Surface Control

What to Inspect in Results

Canonical Files

Related Skills

Source Transparency

Related Skills

doppler

doppler

Leads

Bmi Calculator