Audit Context Building Skill

Overview

This skill implements Trail of Bits' audit context building methodology for the agent-studio framework. The core principle is: never form conclusions about code without reading it line by line first. This skill systematically builds understanding from the ground up, tracking every assumption, invariant, and data flow explicitly.

Source repository: https://github.com/trailofbits/skills

License: CC-BY-SA-4.0 Methodology: First Principles + 5 Whys + 5 Hows at micro scale

When to Use

Before security audits to build deep codebase understanding
When analyzing unfamiliar codebases for architectural review
When debugging complex cross-function interactions
When verifying correctness of critical code paths (auth, crypto, state machines)
When preparing for threat modeling with concrete code evidence
When onboarding to a new codebase section that handles sensitive operations

Iron Laws

NEVER form conclusions without line-by-line evidence — every claim about code behavior MUST be backed by specific line references; if you have not read the code, you do not know what it does.
NEVER trust comments over actual code — comments describe intent, code describes behavior; when they conflict, the code is authoritative; always verify what comments claim.
NEVER skip error handling paths — error paths frequently contain security-relevant behavior (fallback auth, leaked stack traces, privilege bypass) that is invisible to happy-path analysis.
ALWAYS map cross-function call flows before analyzing individual functions — isolated function analysis misses inter-function trust assumptions; understand the full call chain first.
ALWAYS record unverified assumptions explicitly — an unverified assumption is an unexamined risk; mark every assumption with [UNVERIFIED] and track it until confirmed or disproven.

Anti-Hallucination Rules

Never assume what a function does based on its name alone
Never trust comments over actual code behavior
Never skip error handling paths -- they often contain security-relevant behavior
Never extrapolate behavior from one code path to another without verification
Always note when you have NOT read a dependency and mark assumptions as unverified

Phase 1: Initial Reconnaissance

Goal: Map the surface area before diving deep.

Steps

Enumerate entry points: Find all public APIs, CLI commands, HTTP handlers, event listeners
Map the module graph: Identify imports, exports, and dependency relationships
Identify trust boundaries: Where does external input enter? Where do privilege changes occur?
Catalog data stores: Databases, files, caches, environment variables, secrets

Output Format

Reconnaissance Report

Entry Points

path/to/file.ts:42 - HTTP handler POST /api/login
path/to/file.ts:89 - HTTP handler GET /api/users/:id

Trust Boundaries

External input at: [list locations]
Privilege escalation at: [list locations]
Serialization/deserialization at: [list locations]

Data Stores

Database: [type, access patterns]
File system: [paths, permissions]
Environment: [variables accessed]

Phase 2: Deep Analysis (Line-by-Line)

Goal: Build precise mental model of each critical code path.

The Analysis Loop

For each function/method under analysis:

Read every line. No skipping.
For each line, ask:
What state does this line depend on?
What state does this line modify?
What can go wrong here? (error paths)
What assumptions does this line make about its inputs?
Is the assumption validated upstream?
Track in a structured note:

Function: `authenticateUser(req, res)` at `src/auth.ts:45-92`

Line-by-Line Notes

L45-48: Extracts email and password from req.body. Assumption: body is parsed JSON. Verified: Yes, middleware at app.ts:12.
L50: Queries DB for user by email. Assumption: email is sanitized. Verified: No -- raw string interpolation. FINDING: SQL injection risk.
L55-60: Compares password hash. Uses bcrypt.compare(). OK: timing-safe comparison.
L62: Creates JWT token. Assumption: secret is strong. Unverified: need to check env config.

Invariants

User must exist in DB before authentication succeeds
Password comparison is timing-safe (bcrypt)
JWT secret strength is unverified

Assumptions (Unverified)

Email input is sanitized before DB query
JWT secret is cryptographically random
Session duration is bounded

Call Flow

authenticateUser() → findUserByEmail() → bcrypt.compare() → jwt.sign()

Phase 3: Cross-Function Flow Analysis

Goal: Trace data and control flow across function boundaries.

Steps

Select a critical data flow (e.g., user input to database query)
Trace forward: Follow the data from entry point through every transformation
At each boundary, document:
What validation occurs?
What transformation occurs?
Is the data type preserved or changed?
Are there implicit type coercions?
Build the flow diagram:

Flow: User Login Input to Database

req.body (raw JSON) → Express body parser
{ email, password } (destructured) → authenticateUser()
email (string, UNVALIDATED) → findUserByEmail(email) ← RISK
email → SQL query template literal ← FINDING: injection
Result → user object (or null)
password + user.passwordHash → bcrypt.compare() ← OK

Phase 4: 5 Whys at Micro Scale

Apply 5 Whys to each finding or anomaly discovered:

Finding: SQL injection in findUserByEmail

Why is there SQL injection? → Email is concatenated into query string
Why is it concatenated? → Developer used template literals instead of parameterized queries
Why no parameterized query? → The ORM wrapper doesn't enforce parameterization
Why no input validation? → No validation middleware for this route
Why no middleware? → Route was added without security review

Phase 5: 5 Hows at Micro Scale

Apply 5 Hows to verify implementation correctness:

Verification: JWT Token Generation

How is the token created? → jwt.sign(payload, secret, options)
How is the secret managed? → process.env.JWT_SECRET
How is the secret rotated? → No rotation mechanism found
How is token expiry enforced? → expiresIn: '24h' in options
How is token revocation handled? → No revocation mechanism found

Output: Context Report

The final output is a structured context report:

Audit Context Report: [Component Name]

Summary

Files analyzed: N
Functions analyzed: N
Findings: N (Critical: X, High: Y, Medium: Z)
Unverified assumptions: N

Mental Model

[High-level description of how the component works, backed by line references]

Findings

[Each finding with line references, 5 Whys analysis, severity]

Invariants

[All tracked invariants with verification status]

Unverified Assumptions

[All assumptions that require further investigation]

Call Flow Maps

[All traced data/control flows]

Recommendations

[Prioritized list of actions based on findings]

Integration with Agent-Studio

Recommended Workflow

Invoke audit-context-building skill first for deep analysis
Feed findings into security-architect for threat modeling
Use variant-analysis skill to find similar patterns
Use static-analysis skill for automated confirmation

Complementary Skills

Skill Relationship

security-architect

Consumes context reports for threat modeling

variant-analysis

Finds pattern variants across codebase

static-analysis

Automated confirmation of manual findings

differential-review

Reviews fixes for completeness

code-analyzer

Provides complexity metrics for prioritization

Anti-Patterns

Anti-Pattern Why It Fails Correct Approach

Skipping to conclusions from function names Names describe intent, not behavior; leads to false findings Read the code line-by-line before forming conclusions

Trusting comments without reading code Comments are often wrong, stale, or misleading Treat comments as hypotheses to verify against actual code

Skipping error paths in analysis Security bugs often live in error handlers, not happy paths Explicitly trace all error branches with equal rigor

Analyzing functions before mapping call flows Misses cross-function trust assumptions and data flow Map module/call graph in Phase 1 before deep analysis

Leaving assumptions untracked Unverified assumptions silently become false findings Mark every assumption [UNVERIFIED] until confirmed

Memory Protocol

Before starting: Read existing audit context from .claude/context/reports/backend/ for prior analysis of the same codebase area.

During analysis: Write incremental findings to context report file as you discover them. Do not wait until the end.

After completion: Record key findings and methodology notes to .claude/context/memory/learnings.md for future audit sessions.

audit-context-building

Safety Notice

Copy this and send it to your AI assistant to learn

Reconnaissance Report

Entry Points

Trust Boundaries

Data Stores

Function: `authenticateUser(req, res)` at `src/auth.ts:45-92`

Line-by-Line Notes

Invariants

Assumptions (Unverified)

Call Flow

Flow: User Login Input to Database

Finding: SQL injection in findUserByEmail

Verification: JWT Token Generation

Audit Context Report: [Component Name]

Summary

Mental Model

Findings

Invariants

Unverified Assumptions

Call Flow Maps

Recommendations

Source Transparency

Related Skills

auth-security-expert

security-architect

tauri-security-rules

k8s-security-policies

audit-context-building

Safety Notice

Copy this and send it to your AI assistant to learn

Reconnaissance Report

Entry Points

Trust Boundaries

Data Stores

Function: authenticateUser(req, res) at src/auth.ts:45-92

Line-by-Line Notes

Invariants

Assumptions (Unverified)

Call Flow

Flow: User Login Input to Database

Finding: SQL injection in findUserByEmail

Verification: JWT Token Generation

Audit Context Report: [Component Name]

Summary

Mental Model

Findings

Invariants

Unverified Assumptions

Call Flow Maps

Recommendations

Source Transparency

Related Skills

auth-security-expert

security-architect

tauri-security-rules

k8s-security-policies

Function: `authenticateUser(req, res)` at `src/auth.ts:45-92`