seq

seq

Overview

Mine ~/.codex/sessions/ JSONL and ~/.codex/memories/ files quickly and consistently with the Zig seq CLI. Explicit $seq requests and artifact-forensics questions are seq -first. Focus on skill usage, format compliance, token counts, tool-call/tooling forensics, memory-file mining, and session-backed proof.

Trigger Cues

Explicit $seq asks.
Questions that ask to verify prior session output using artifacts ("use $seq to find it" / "what did that session actually say" ).
Artifact-forensics questions about provenance, trace proof, or "show me the artifact that proves it".
Questions about Codex memory artifacts or provenance ("what's in MEMORY.md" , "which rollout produced this memory" , "is this memory stale" , "how do memory_summary.md and rollout_summaries relate" ).
Orchestration ledger forensics from session traces (Orchestration Ledger , spawn_agents_on_csv , wave CSVs, concurrency counts).
Concurrency math validation (max_concurrency , effective fanout, occurrences of peak parallelism, planned rows vs actual parallelism).

Local-First Routing Ladder

Stay in seq first for explicit $seq asks and artifact-forensics work; do not jump to generic shell search unless seq cannot cover the need.
Preferred entrypoint: seq artifact-search .
Use this ladder:
artifact-search .
Specialized follow-ups that already match the artifact shape:
find-session for prompt-to-session lookup.
session-prompts for "what did that session actually say" and transcript proof.
session-tooling for shell/tool summaries.
orchestration-concurrency for ledger/concurrency proof.
query --spec ... memory_files for memory artifact inventory/routing.
opencode-prompts / opencode-events only when the literal gate is satisfied.
query-diagnose when a seq query run needs lifecycle debugging or deterministic next actions.
Generic seq query only when no specialized command covers the question.
Treat query-diagnose as a diagnostic tool for seq query , not the default artifact-discovery entrypoint.

Memory Artifact Model

Treat ~/.codex/memories as a file-backed memory workspace, not an opaque store.
Use root artifacts deliberately:
memory_summary.md : compact routing/index layer. Use it first when the question is broad or you need to decide where to dig.
MEMORY.md : durable handbook/registry. Use it first when the question is about reusable guidance, prior decisions, or task-family history.
raw_memories.md : merged stage-1 raw memories. Use it for recency and inventory, especially when tracing newly added or removed memory inputs.
Treat rollout_summaries/*.md as per-rollout markdown summaries, not JSONL logs.
Expect fields like thread_id , updated_at , cwd , rollout_path , and sometimes git_branch .
Use rollout_path to jump to the original session JSONL under ~/.codex/sessions when you need raw evidence.
Treat skills/ under the memory root as optional memory-derived helper assets.

Memory Mining Workflow

Use seq first for inventory, routing, and timestamp/category analysis; do not pretend it replaces reading the target markdown files.
Start broad, then go deeper:
Inventory memory_files to see what categories and files exist.
If the question is navigational or "what do we know?", route through memory_summary.md .
If the question is durable/procedural, route through MEMORY.md .
If you need provenance for one memory block, inspect the relevant rollout_summaries/*.md file.
If you need raw session evidence, follow rollout_path into ~/.codex/sessions/...jsonl ; use seq session/orchestration commands only when they accept the handle you actually have, otherwise inspect the file directly.
Prefer MEMORY.md / memory_summary.md over raw_memories.md when both cover the topic; raw_memories.md is a lower-level consolidation input, not the highest-level answer.

Opencode Explicitness Gate (Hard)

Only run opencode research when the request text includes the literal word opencode .
Treat these as opencode research and block them without the literal cue:
seq opencode-prompts
seq opencode-events
seq query specs targeting dataset opencode_prompts or opencode_events
For mixed searches without a literal opencode cue, hard-skip the opencode branch and continue with non-opencode seq datasets only.

Zig CLI Iteration Repos

When iterating on the Zig-backed seq helper CLI path, use these two repos:

skills-zig (/Users/tk/workspace/tk/skills-zig ): source for the seq Zig binary, build/test wiring, and release tags.
homebrew-tap (/Users/tk/workspace/tk/homebrew-tap ): Homebrew formula updates/checksum bumps for released seq binaries.

Quick Start

run_seq() { install_seq_direct() { local repo="${SKILLS_ZIG_REPO:-$HOME/workspace/tk/skills-zig}" if ! command -v zig >/dev/null 2>&1; then echo "zig not found. Install Zig from https://ziglang.org/download/ and retry." >&2 return 1 fi if [ ! -d "$repo" ]; then echo "skills-zig repo not found at $repo." >&2 echo "clone it with: git clone https://github.com/tkersey/skills-zig "$repo"" >&2 return 1 fi if ! (cd "$repo" && zig build -Doptimize=ReleaseSafe); then echo "direct Zig build failed in $repo." >&2 return 1 fi if [ ! -x "$repo/zig-out/bin/seq" ]; then echo "direct Zig build did not produce $repo/zig-out/bin/seq." >&2 return 1 fi mkdir -p "$HOME/.local/bin" install -m 0755 "$repo/zig-out/bin/seq" "$HOME/.local/bin/seq" }

local os="$(uname -s)" if command -v seq >/dev/null 2>&1 && seq --help 2>&1 | grep -q "skills-rank"; then seq "$@" return fi

if [ "$os" = "Darwin" ]; then if ! command -v brew >/dev/null 2>&1; then echo "homebrew is required on macOS: https://brew.sh/" >&2 return 1 fi if ! brew install tkersey/tap/seq; then echo "brew install tkersey/tap/seq failed." >&2 return 1 fi elif ! (command -v seq >/dev/null 2>&1 && seq --help 2>&1 | grep -q "skills-rank"); then if ! install_seq_direct; then return 1 fi fi

if command -v seq >/dev/null 2>&1 && seq --help 2>&1 | grep -q "skills-rank"; then seq "$@" return fi echo "no compatible seq binary found after install attempt." >&2 if [ "$os" = "Darwin" ]; then echo "expected install path: brew install tkersey/tap/seq" >&2 else echo "expected direct path: SKILLS_ZIG_REPO=<skills-zig-path> zig build -Doptimize=ReleaseSafe" >&2 fi return 1 }

run_seq datasets --root ~/.codex/sessions

Commands below use seq directly for brevity. Use run_seq when you want brew-aware bootstrap with binary-only execution.

Query (JSON Spec)

Run flexible mining via query with a small JSON spec (inline or @spec.json ).

List datasets:

seq datasets --root ~/.codex/sessions

Show dataset fields/params:

seq dataset-schema --dataset token_deltas --root ~/.codex/sessions

Examples:

Rank skill usage:

seq query --root ~/.codex/sessions --spec
'{"dataset":"skill_mentions","group_by":["skill"],"metrics":[{"op":"count","as":"count"}],"sort":["-count"],"limit":20,"format":"table"}'

Daily token totals (from token_count events):

seq query --root ~/.codex/sessions --spec
'{"dataset":"token_deltas","group_by":["day"],"metrics":[{"op":"sum","field":"delta_total_tokens","as":"tokens"}],"sort":["day"],"format":"table"}'

Top sessions by total tokens:

seq query --root ~/.codex/sessions --spec
'{"dataset":"token_sessions","select":["path","total_total_tokens"],"sort":["-total_total_tokens"],"limit":10,"format":"table"}'

Rank tool calls:

seq query --root ~/.codex/sessions --spec
'{"dataset":"tool_calls","group_by":["tool"],"metrics":[{"op":"count","as":"count"}],"sort":["-count"],"limit":20,"format":"table"}'

Inspect lifecycle-enriched tool invocations:

seq query --root ~/.codex/sessions --spec
'{"dataset":"tool_invocations","select":["timestamp","tool_name","command_text","workdir","wall_time_ms","exit_code","running_state"],"sort":["-timestamp"],"limit":20,"format":"table"}'

Flatten tool-call argument leaves:

seq query --root ~/.codex/sessions --spec
'{"dataset":"tool_call_args","where":[{"field":"tool_name","op":"eq","value":"exec_command"}],"select":["timestamp","tool_name","arg_path","value_kind","value_text","value_number","value_bool"],"sort":["-timestamp"],"limit":20,"format":"table"}'

Memory files by category:

seq query --spec
'{"dataset":"memory_files","group_by":["category"],"metrics":[{"op":"count","as":"count"}],"sort":["-count"],"format":"table"}'

Opencode prompts with source override:

seq query --spec
'{"dataset":"opencode_prompts","params":{"source":"db","opencode_db_path":"~/.local/share/opencode/opencode.db"},"where":[{"field":"part_types","op":"contains","value":"file"}],"select":["session_slug","prompt_text","part_types"],"sort":["-time_created_epoch_ms"],"format":"jsonl"}'

Ready-made specs

Prebuilt specs live in specs/ .

seq query --root ~/.codex/sessions --spec @specs/skills-rank.json seq query --root ~/.codex/sessions --spec @specs/tools-rank.json seq query --root ~/.codex/sessions --spec @specs/tokens-top-days.json seq query --root ~/.codex/sessions --spec @specs/tokens-top-sessions.json seq query --root ~/.codex/sessions --spec @specs/tk-trend-week.json

Spec reference

Top-level keys:

dataset (string, required)
params (object, optional; dataset-specific)
where (list of predicates, optional)
group_by (list of field names, optional)
metrics (list of aggregations, optional; default count when grouping)
select (list of field names, optional; for non-grouped queries)
sort (list of field names; prefix with - for descending)
limit (int, optional)
format (table | json | csv | jsonl ; default: table when grouped, else jsonl )

Where predicate shape:

{"field":"day","op":"eq","value":"2026-02-05"}

Supported where.op :

eq , neq
gt , gte , lt , lte (numeric-ish compare)
in , nin (value is a JSON list)
contains (substring)
contains_any (value is a JSON list of substrings; optional case_insensitive: true )
regex (regex-like matching with ^ , $ , and alternation | ; optional case_insensitive: true )
regex_any (value is a JSON list of regex-like strings; OR semantics)
exists , not_exists

Metrics shape (grouped queries):

{"op":"sum","field":"delta_total_tokens","as":"tokens"}

Supported metrics.op :

count
sum , min , max , avg
count_distinct

Tasks

Rank skill usage

seq skills-rank --root ~/.codex/sessions

Common options:

--format json|csv
--max 20
--since 2026-01-01T00:00:00Z
--until 2026-03-01T00:00:00Z

Trend a skill over time

seq skill-trend --root ~/.codex/sessions --skill tk --bucket week

Report on a specific skill

seq skill-report --root ~/.codex/sessions --skill tk

Another example:

seq skill-report --root ~/.codex/sessions --skill fix
--since 2026-03-01T00:00:00Z

Role breakdown by skill

seq role-breakdown --root ~/.codex/sessions --format table

Audit section compliance

seq section-audit --root ~/.codex/sessions
--sections "Contract,Invariants,Creative Frame"

Export occurrences

seq occurrence-export --root ~/.codex/sessions --format jsonl --output occurrences.jsonl

Bundle a report

seq report-bundle --root ~/.codex/sessions --top 20

Token usage summary

seq token-usage --root ~/.codex/sessions --top 10

Reproducible perf harness

Run stable workloads with fixed warmup/sample counts and optional baseline comparison.

zig build bench -Doptimize=ReleaseFast -- --config perf/frozen/workload_config.json

Find sessions by prompt text

seq find-session --root ~/.codex/sessions --prompt "adapter=auto" --limit 20

Constrain to a session window:

seq find-session --root ~/.codex/sessions
--prompt "adapter=auto"
--since 2026-03-01T00:00:00Z
--until 2026-03-05T00:00:00Z
--limit 20

List prompts/messages for one session

seq session-prompts --root ~/.codex/sessions --session-id <session_id>
--roles user,assistant --strip-skill-blocks --limit 100 --format jsonl

Constrain to a time window within that session:

seq session-prompts --root ~/.codex/sessions --session-id <session_id>
--roles user,assistant --since 2026-03-01T00:00:00Z --until 2026-03-05T00:00:00Z
--strip-skill-blocks --limit 100 --format jsonl

Current session from the active Codex thread:

seq session-prompts --root ~/.codex/sessions --current
--roles user,assistant --strip-skill-blocks --limit 100 --format jsonl

Session-backed tooling diagnostics

Summarize tool/shell activity for one session:

seq session-tooling --root ~/.codex/sessions --session-id <session_id>
--since 2026-03-01T00:00:00Z --until 2026-03-05T00:00:00Z
--summary --group-by executable --format table

Inspect seq query lifecycle health for one session:

seq query-diagnose --root ~/.codex/sessions --session-id <session_id>
--since 2026-03-01T00:00:00Z --until 2026-03-05T00:00:00Z
--threshold-ms 10000 --next-actions --format json

Cue vs invoked discovery-skill rate

seq routing-gap --root ~/.codex/sessions
--cue-spec @cue-spec.json
--discovery-skills grill-me,prove-it,complexity-mitigator,invariant-ace,tk
--format table

Orchestration concurrency summary

seq orchestration-concurrency --root ~/.codex/sessions --session-id <session_id> --format table

Or target one JSONL directly:

seq orchestration-concurrency --path /absolute/path/to/rollout.jsonl --format json

Assert the floor gate for non-trivial runs:

seq orchestration-concurrency --path /absolute/path/to/rollout.jsonl
--floor-threshold 3 --fail-on-floor --format table

Assert mesh-truth for claim eligibility:

seq orchestration-concurrency --path /absolute/path/to/rollout.jsonl
--fail-on-mesh-truth --format table

Query Opencode prompt history

Only run this section when the user request explicitly includes opencode .

seq opencode-prompts --limit 20 --format jsonl

Filter and project:

seq opencode-prompts
--source db
--contains "grill me"
--mode normal
--part-type file
--select session_slug,message_id,prompt_text,part_types
--sort -time_created_epoch_ms
--format table

Hybrid with --spec (CLI convenience flags override conflicting spec values):

seq opencode-prompts --spec @opencode-spec.json --contains "$plan" --limit 10

Query Opencode message/part events

Only run this section when the user request explicitly includes opencode .

seq opencode-events --limit 50 --format jsonl

Filter event rows:

seq opencode-events
--source db
--role assistant
--tool shell
--status completed
--select session_slug,message_id,event_index,part_type,tool_name,tool_status,text
--sort -time_created_epoch_ms
--format table

Inventory Codex memory artifacts

seq query --spec
'{"dataset":"memory_files","group_by":["category"],"metrics":[{"op":"count","as":"count"}],"sort":["-count"],"format":"table"}'

List root artifacts explicitly:

seq query --spec
'{"dataset":"memory_files","where":[{"field":"category","op":"eq","value":"root"}],"select":["relative_path","size_bytes","modified_at"],"sort":["relative_path"],"format":"table"}'

Find recent rollout summaries behind memory updates

seq query --spec
'{"dataset":"memory_files","where":[{"field":"category","op":"eq","value":"rollout_summaries"}],"select":["relative_path","modified_at","size_bytes"],"sort":["-modified_at"],"limit":20,"format":"table"}'

Then open the matching rollout_summaries/*.md file and use its rollout_path to jump into the original session JSONL when deeper proof is required.

Notes

Default root: ~/.codex/sessions .
memory_files defaults to ~/.codex/memories and accepts params.memory_root and params.include_preview .
memory_files exposes path , relative_path , name , category , extension , size_bytes , modified_at , and preview .
Current memory-file categories are root , rollout_summaries , and skills ; root may also include non-canonical files, so do not assume every root file is part of the memory contract.
rollout_summaries/*.md are markdown summaries; the original JSONL evidence lives at the rollout_path referenced inside those files.
memory_files is best for inventory and routing; when the answer depends on markdown body content, use seq to find the file and then read that specific file directly.
Opencode datasets (opencode_prompts , opencode_events ) default to source=auto , which resolves DB-first ($HOME/.local/share/opencode/opencode.db ) with JSONL fallback ($HOME/.local/state/opencode/prompt-history.jsonl ).
Opencode params: params.source , params.opencode_db_path , params.opencode_path , params.include_raw ; opencode_prompts also supports params.include_summary_fallback .
Skill names are inferred from ${CODEX_HOME:-$HOME/.codex}/skills by default, and from ${CLAUDE_HOME:-$HOME/.claude}/skills when needed.
Runtime bootstrap policy: require the Zig seq binary, default to Homebrew install on macOS, and fallback to direct Zig install from skills-zig on non-macOS.
Add --output <path> to write results to a file.
query auto-projects only referenced dataset fields (where , group_by , metrics.field , select , and non-grouped sort ) to reduce scan overhead.
query.params is now parsed and routed into dataset collectors for dataset-specific source overrides.
Session-tooling datasets now include tool_invocations and tool_call_args in addition to tool_calls .
tool_calls now exposes raw argument/input text plus command/workdir fields: arguments_text , input_text , command_text , primary_executable , workdir , parse_error .
tool_invocations adds lifecycle fields such as end_timestamp , pty_session_id , wall_time_ms , exit_code , running_state , and unresolved .
tool_call_args flattens JSON argument/input leaves with payload_source , arg_path , value_kind , value_text , value_number , value_bool , is_null , and array_index .
find-session returns session_id and path ; use these to target follow-on query or resume workflows.
find-session , session-prompts , session-tooling , and query-diagnose accept ISO-8601 --since / --until filters for session-backed narrowing.
Prefer the routing ladder: artifact-search , then specialized follow-ups, then query-diagnose , and generic query only when needed.
session-prompts defaults to --roles user ; set --roles user,assistant to include both sides of a conversation.
session-prompts --current resolves the current session from CODEX_THREAD_ID and fails closed if that env var is unavailable.
session-prompts deduplicates mirrored duplicate rows by default; pass --no-dedupe-exact to keep all duplicates.
Typical flow: run find-session , then pass the returned session_id into session-prompts --session-id <id> .
session-tooling summarizes per-invocation shell/tool behavior and can aggregate via --summary --group-by executable|command|tool .
query-diagnose emits per-query diagnostics from rollout JSONL and can suggest deterministic follow-up commands with --next-actions .
orchestration-concurrency reports both configured (max_concurrency ) and effective fanout (min(max_concurrency, csv_rows) ), plus how many times each maximum occurred.
It also emits effective_peak , spawn_substrate , mesh_truth_verdict , direct-lane counters (spawn_agent_calls , wait_calls , close_agent_calls ), serialized_wait_ratio , and floor-gate fields (floor_threshold , floor_applicable , floor_result ).
For sessions without spawn_agents_on_csv , it returns a structured row with mesh_truth_verdict=false and spawn_substrate=spawn_agent|none rather than failing.
Use --fail-on-mesh-truth to enforce fail-closed preflight for $mesh claims.

Resources

seq binary: CLI for ranking skills, auditing sections, querying datasets, and summarizing token usage.
zig build bench -Doptimize=ReleaseFast -- --config perf/frozen/workload_config.json : frozen-workload performance runner.

Safety Notice

Copy this and send it to your AI assistant to learn

Source Transparency

Related Skills

codex-upcoming-features

grill-me

creative-problem-solver

complexity-mitigator