Running in CI

First Steps — Read Context

When triggered by a comment or issue, read the full context before responding. The prompt provides a URL — extract the PR/issue number from it.

For PRs:

gh pr view <number> --json title,body,comments,reviews,state,statusCheckRollup gh pr diff <number> gh pr checks <number>

For issues:

gh issue view <number> --json title,body,comments,state

Read the triggering comment, the PR/issue description, the diff (for PRs), and recent comments to understand the full conversation before taking action.

Security

NEVER run commands that could expose secrets (env , printenv , set , export , cat /echo on config files containing credentials). NEVER include environment variables, API keys, tokens, or credentials in responses or comments.

PR Creation

When asked to create a PR, use gh pr create directly.

Before creating a branch or PR, check for existing work:

gh pr list --state open --json number,title,headRefName --jq '.[] | "#(.number) [(.headRefName)]: (.title)"' git branch -r --list 'origin/fix/*'

If an existing PR addresses the same problem, work on that PR instead.

Pushing to PR Branches

Always use git push without specifying a remote — gh pr checkout configures tracking to the correct remote, including for fork PRs. Specifying origin

explicitly can push to the wrong place.

If pushing fails (fork PR with edits disabled), fall back to posting code snippets in a comment. Don't reference commit SHAs from temporary branches — post code inline.

CI Monitoring

After pushing, wait for CI before reporting completion.

Use run_in_background: true for the polling loop so it does not block the session. When the background task completes you will be notified — check the result and take any follow-up action (dismiss approval, post analysis) at that point.

Run with Bash tool's run_in_background: true

for i in $(seq 1 10); do sleep 60 if ! gh pr checks <number> --required 2>&1 | grep -q 'pending|queued|in_progress'; then gh pr checks <number> --required exit 0 fi done echo "CI still running after 10 minutes" exit 1

Poll gh pr checks <number> --required every 60 seconds until all required checks complete (up to ~10 minutes). Ignore non-required checks (benchmarks).
If a required check fails, diagnose with gh run view <run-id> --log-failed , fix, commit, push, repeat.
After required checks pass, also poll codecov/patch via gh pr checks <number> (without --required ) — this repo treats it as mandatory despite being marked non-required. Fix coverage gaps if it fails.
Report completion only after all required checks and codecov/patch pass.

Never report "done" before CI passes — CI runs on Linux, Windows, and macOS. Avoid gh run watch and gh pr checks --watch — both can hang indefinitely.

Before dismissing local test failures as "pre-existing", check main branch CI:

gh api "repos/{owner}/{repo}/actions/runs?branch=main&status=completed&per_page=3"
--jq '.workflow_runs[] | {conclusion, created_at: .created_at}'

If you cannot verify, say "I haven't confirmed whether these failures are pre-existing."

Replying to Comments

Reply in context rather than creating new top-level comments:

Inline review comments (#discussion_r ): Reply in the review thread:

cat > /tmp/reply.md << 'EOF' Your response here EOF gh api repos/{owner}/{repo}/pulls/{number}/comments/{comment_id}/replies
-F body=@/tmp/reply.md

Conversation comments (#issuecomment- ): Post a regular comment (GitHub doesn't support threading).

Comment Formatting

Keep comments concise. Put supporting detail inside <details> tags — the reader should get the gist without expanding. Don't collapse content that is the answer (e.g., a requested analysis).

<details><summary>Detailed findings (6 files)</summary>

...details here...

</details>

Always use markdown links for files, issues, PRs, and docs. Prefer permalinks (commit SHA URLs) over branch-based links for line references — line numbers shift and blob/main/...#L42 links go stale.

Files: link to GitHub (blob/main/... for file-level, blob/<sha>/...#L42 for lines)
Issues/PRs: #123 shorthand
External: text format

Don't add job links or footers — claude-code-action adds these automatically.

Shell Quoting

Shell expansion corrupts $ and ! in arguments (bash history expansion mangles ! in double-quoted strings). Always use a temp file for comment bodies and shell-sensitive arguments:

Comments — ALWAYS use a file

cat > /tmp/comment.md << 'EOF' Fixed — the format! macro needed its arguments on separate lines. EOF gh pr comment 1286 -F /tmp/comment.md

Review replies — ALWAYS use a file

cat > /tmp/reply.md << 'EOF' Good catch! Updated to use assert_eq! instead. EOF gh api repos/{owner}/{repo}/pulls/{number}/comments/{id}/replies
-F body=@/tmp/reply.md

GraphQL — write query to a file

cat > /tmp/query.graphql << 'GRAPHQL' query($owner: String!, $repo: String!) { ... } GRAPHQL gh api graphql -F query=@/tmp/query.graphql -f owner="$OWNER"

jq with ! — use a file

cat > /tmp/jq_filter << 'EOF' select(.status != "COMPLETED") EOF gh api ... --jq "$(cat /tmp/jq_filter)"

Use << 'EOF' (single-quoted delimiter) to prevent expansion. For file-based bodies: gh pr comment uses -F /tmp/file (path directly), while gh api

uses -F body=@/tmp/file (field assignment with @ prefix).

Keeping PR Titles and Descriptions Current

When revising code after review feedback, update the title and description if the approach changed:

gh api repos/{owner}/{repo}/pulls/{number} -X PATCH
-f title="new title" -F body=@/tmp/updated-body.md

Atomic PRs

Split unrelated changes into separate PRs — one concern per PR. If one change could be reverted without affecting the other, they belong in separate PRs.

Investigating Other CI Runs

The primary evidence for diagnosing bot behavior is the session log artifact — not console output (show_full_output defaults to false ).

gh run download <run-id> -n claude-session-logs -D /tmp/session-logs-<run-id>

The artifact contains JSONL files (path like -home-runner-work-worktrunk-worktrunk/<session-id>.jsonl ). Each line has a type field (user , assistant , system ).

Skills loaded

jq -r 'select(.type == "assistant") | .message.content[]? | select(.type == "tool_use" and .name == "Skill") | .input.skill' <FILE>.jsonl

Tool calls

Assistant reasoning

jq -r 'select(.type == "assistant") | .message.content[]? | select(.type == "text") | .text' <FILE>.jsonl

Find the right run among multiple workflows:

gh api 'repos/{owner}/{repo}/actions/runs?per_page=30'
--jq '.workflow_runs[] | select(.name | startswith("claude-")) | {id, name, event, head_branch, created_at, conclusion}'

Check for artifacts before downloading:

gh api repos/{owner}/{repo}/actions/runs/<run-id>/artifacts
--jq '.artifacts[] | {name, size_in_bytes}'

Review-response runs triggered by pull_request_review or pull_request_review_comment events sometimes produce no artifact when the session is very short.

Grounded Analysis

CI runs are not interactive — every claim must be grounded in evidence. The user can't ask follow-up questions; treat every response as your final answer.

Read logs, code, and API data before drawing conclusions. Show evidence: cite log lines, file paths, commit SHAs. Trace causation — if two things co-occur, find the mechanism rather than saying "this may be related." Never claim a failure is "pre-existing" without checking main branch CI history. Distinguish what you verified from what you inferred.

Tone

Raise observations, don't assign work. Never create checklists or task lists for the PR author.

PR Review Comments

For review comments on specific lines ([Comment on path:line] ), read that file and examine the code at that line before answering.

running-in-ci

Safety Notice

Copy this and send it to your AI assistant to learn

Run with Bash tool's run_in_background: true

Comments — ALWAYS use a file

Review replies — ALWAYS use a file

GraphQL — write query to a file

jq with ! — use a file

Skills loaded

Tool calls

Assistant reasoning

Source Transparency

Related Skills

worktrunk

release

writing-user-outputs