Lean 4 Theorem Proving

Use this skill whenever you're editing Lean 4 proofs, debugging Lean builds, formalizing mathematics in Lean, or learning Lean 4 concepts. It prioritizes LSP-based inspection and mathlib search, with scripted primitives for sorry analysis, axiom checking, and error parsing.

Core Principles

Search before prove. Many mathematical facts already exist in mathlib. Search exhaustively before writing tactics.

Build incrementally. Lean's type checker is your test suite—if it compiles with no sorries and standard axioms only, the proof is sound.

Respect scope. Follow the user's preference: fill one sorry, its transitive dependencies, all sorries in a file, or everything. Ask if unclear.

Never change statements or add axioms without explicit permission. Theorem/lemma statements, type signatures, and docstrings are off-limits unless the user requests changes. Inline comments may be adjusted; docstrings may not (they're part of the API). Custom axioms require explicit approval—if a proof seems to need one, stop and discuss.

Commands

Command Purpose

/lean4:formalize

Turn informal math into Lean statements

/lean4:prove

Guided cycle-by-cycle theorem proving

/lean4:autoprove

Autonomous multi-cycle proving with stop rules

/lean4:checkpoint

Verified save point (build + axiom check + commit)

/lean4:review

Quality audit (--mode=batch or --mode=stuck )

/lean4:refactor

Strategy-level proof simplification

/lean4:golf

Optimize proofs for brevity

/lean4:learn

Interactive teaching and mathlib exploration

/lean4:doctor

Plugin troubleshooting and migration help

Which Command?

Situation Command

Draft a Lean statement from an informal claim /lean4:formalize

Filling sorries (interactive) /lean4:prove

Filling sorries (unattended) /lean4:autoprove

Verified save point /lean4:checkpoint

Quality check (read-only) /lean4:review

Simplify proof strategies (mathlib leverage, helpers) /lean4:refactor

Optimizing compiled proofs /lean4:golf

New to this project / exploring /lean4:learn --mode=repo

Navigating mathlib for a topic /lean4:learn --mode=mathlib

Something not working /lean4:doctor

Formalize + prove end-to-end /lean4:autoprove --formalize=auto --source=... --claim-select=first --formalize-out=...

Typical Workflow

/lean4:formalize Turn informal math into Lean statements (optional entry) ↓ /lean4:prove Guided cycle-by-cycle proving (asks before each cycle) /lean4:autoprove Autonomous multi-cycle proving (runs with stop rules) ↓ /lean4:refactor Simplify proof strategies (optional, or --dry-run to preview) ↓ /lean4:golf Optimize proofs for tactic-level brevity (optional) ↓ /lean4:checkpoint Create verified save point

Use /lean4:learn at any point to explore repo structure or navigate mathlib. Use /lean4:formalize standalone or via --formalize=auto on autoprove for end-to-end source-to-proof.

Notes:

/lean4:prove asks before each cycle; /lean4:autoprove loops autonomously with hard stop conditions
Both trigger /lean4:review at configured intervals (--review-every )
When reviews run (via --review-every ), they act as gates: review → replan → continue. In prove, replan requires user approval; in autoprove, replan auto-continues
Review supports --mode=batch (default) or --mode=stuck (triage); review is always read-only
--formalize=auto on autoprove wraps formalize+prove in a single command (source → claims → skeletons → proofs)
If you hit environment issues, run /lean4:doctor to diagnose

LSP Tools (Preferred)

Sub-second feedback and search tools (LeanSearch, Loogle, LeanFinder) via Lean LSP MCP:

lean_goal(file, line) # See exact goal lean_hover_info(file, line, col) # Understand types lean_local_search("keyword") # Fast local + mathlib (unlimited) lean_leanfinder("goal or query") # Semantic, goal-aware (10/30s) lean_leansearch("natural language") # Semantic search (3/30s) lean_loogle("?a → ?b → _") # Type-pattern (unlimited if local mode) lean_hammer_premise(file, line, col) # Premise suggestions for simp/aesop/grind (3/30s) lean_state_search(file, line, col) # Goal-conditioned lemma search (3/30s) lean_multi_attempt(file, line, snippets=[...]) # Test multiple tactics

Core Primitives

Script Purpose Output

sorry_analyzer.py

Find sorries with context text (default), json, markdown, summary

check_axioms_inline.sh

Check for non-standard axioms text

smart_search.sh

Multi-source mathlib search text

find_golfable.py

Detect optimization patterns JSON

find_usages.sh

Find declaration usages text

Usage: Invoked by commands automatically. See references/ for details.

Invocation contract: Never run bare script names. Always use:

Python: ${LEAN4_PYTHON_BIN:-python3} "$LEAN4_SCRIPTS/script.py" ...
Shell: bash "$LEAN4_SCRIPTS/script.sh" ...
Report-only calls: add --report-only to sorry_analyzer.py , check_axioms_inline.sh , unused_declarations.sh — suppresses exit 1 on findings; real errors still exit 1. Do not use in gate commands like /lean4:checkpoint .
Keep stderr visible for Lean scripts (no /dev/null redirection), so real errors are not hidden.

If $LEAN4_SCRIPTS is unset or missing, run /lean4:doctor and stay LSP-only until resolved.

Automation

/lean4:prove and /lean4:autoprove handle most tasks:

prove — guided, asks before each cycle. Ideal for interactive sessions.
autoprove — autonomous, loops with hard stop rules. Ideal for unattended runs.

Both share the same cycle engine (plan → work → checkpoint → review → replan → continue/stop) and follow the LSP-first protocol: LSP tools are normative for discovery and search; script fallback only when LSP is unavailable or exhausted. Compiler-guided repair is escalation-only — not the first response to build errors. For complex proofs, they may delegate to internal workflows for deep sorry-filling (with snapshot, rollback, and scope budgets), proof repair, or axiom elimination. You don't invoke these directly.

Skill-Only Behavior

When editing .lean files without invoking a command, the skill runs one bounded pass:

Read the goal or error via lean_goal /lean_diagnostic_messages
Search mathlib with up to 2 LSP tools (e.g. lean_local_search

lean_leanfinder /lean_leansearch /lean_loogle )

Try the Automation Tactics cascade
Validate with lean_diagnostic_messages (no project-gate lake build in this mode)
No looping, no deep escalation, no multi-cycle behavior, no commits
End with suggestions:

Use /lean4:prove for guided cycle-by-cycle help. Use /lean4:autoprove for autonomous cycles with stop safeguards.

Quality Gate

A proof is complete when:

lake build passes
Zero sorries in agreed scope
Only standard axioms (propext , Classical.choice , Quot.sound )
No statement changes without permission

Verification ladder: lean_diagnostic_messages(file) per-edit → lake env lean <path/to/File.lean> file gate (run from project root) → lake build project gate only. See cycle-engine: Build Target Policy.

Common Fixes

See compilation-errors for error-by-error guidance (type mismatch, unknown identifier, failed to synthesize, timeout, etc.).

Type Class Patterns

-- Local instance for this proof block haveI : MeasurableSpace Ω := inferInstance letI : Fintype α := ⟨...⟩

-- Scoped instances (affects current section) open scoped Topology MeasureTheory

Order matters: provide outer structures before inner ones.

Automation Tactics

Try in order (stop on first success): rfl → simp → ring → linarith → nlinarith → omega → exact? → apply? → grind → aesop

Note: exact? /apply? query mathlib (slow). grind and aesop are powerful but may timeout. See grind-tactic for interactive workflows, annotation strategy, and simproc escalation.

Troubleshooting

If LSP tools aren't responding, scripts provide fallback for all operations. If environment variables (LEAN4_SCRIPTS , LEAN4_REFS ) are missing, run /lean4:doctor to diagnose.

Script environment check:

echo "$LEAN4_SCRIPTS" ls -l "$LEAN4_SCRIPTS/sorry_analyzer.py"

One-pass discovery for troubleshooting (human-readable default text):

${LEAN4_PYTHON_BIN:-python3} "$LEAN4_SCRIPTS/sorry_analyzer.py" . --report-only

Structured output (optional): --format=json

Counts only (optional): --format=summary

Cold start / fresh worktree:

Fresh worktree or after lake clean ? Prime the cache in that worktree before the first real build.
Use the project's cache command: lake cache get on newer Lake, or lake exe cache get where the project still uses the mathlib cache executable.
If Lean LSP is cold or timing out on first use, run one lake build to bootstrap the workspace.
After bootstrap, return to the normal verification ladder: lean_diagnostic_messages(file) → lake env lean <path/to/File.lean> (from project root) → lake build only at checkpoint/final gate.
Do not symlink another worktree's .lake/build ; use Lake cache/artifact mechanisms instead.

References

Cycle Engine: cycle-engine — shared prove/autoprove logic (stuck, deep mode, falsification, safety)

LSP Tools: lean-lsp-server (quick start), lean-lsp-tools-api (full API — grep ^## for tool names)

Search: mathlib-guide (read when searching for existing lemmas), lean-phrasebook (math→Lean translations)

Errors: compilation-errors (read first for any build error), instance-pollution (typeclass conflicts — grep ## Sub- for patterns), compiler-guided-repair (escalation-only repair — not first-pass)

Tactics: tactics-reference (tactic lookup — grep ^### TacticName ), grind-tactic (SMT-style automation — when simp can't close), simp-reference (simp hygiene + custom simprocs), tactic-patterns, calc-patterns

Proof Development: proof-templates, proof-refactoring (28K — grep by topic), proof-simplification (strategy-level: mathlib search, congr lemmas, helper extraction), sorry-filling

Optimization: proof-golfing (includes safety rules, bounded LSP lemma replacement, bulk rewrites, anti-patterns; escalates to axiom-eliminator), proof-golfing-patterns, performance-optimization (grep by symptom), profiling-workflows (diagnose slow builds/proofs)

Domain: domain-patterns (25K — grep ## Area ), measure-theory (28K), axiom-elimination

Style: mathlib-style, verso-docs (Verso doc comment roles and fixups)

Custom Syntax: lean4-custom-syntax (read when building notations, macros, elaborators, or DSLs), metaprogramming-patterns (MetaM/TacticM API — composable blocks, elaborators), scaffold-dsl (copy-paste DSL template), json-patterns (json% syntax + ToJson)

Quality: linter-authoring (project-specific linter rules), ffi-patterns (C/ObjC bindings via Lake)

Workflows: agent-workflows, subagent-workflows, command-examples, learn-pathways (intent taxonomy, game tracks, source handling)

Internals: review-hook-schema

lean4

Safety Notice

Copy this and send it to your AI assistant to learn

One-pass discovery for troubleshooting (human-readable default text):

Structured output (optional): --format=json

Counts only (optional): --format=summary

Source Transparency

Related Skills

lean4-theorem-proving

lean4-memories

image-gen