Gemini Smart Search

Use this skill when Gemini should be the search backend, but gateway-level web_search config is too static or too disruptive to change.

Purpose

This skill is a script-backed search workflow, not a gateway tool override.

It exists to provide:

dynamic Gemini model selection
quota-aware fallback
a separate Gemini API key path if desired
structured JSON output
no gateway restart requirement for model changes

Modes

Model routing is split into two layers:

display chain: human-facing preferred model family labels
candidate API ids: the actual model ids to probe, especially for 3.x preview-era models

Current display chains:

cheap
- Prefer gemini-2.5-flash-lite
- Then gemini-3.1-flash-lite
- Then gemini-2.5-flash
balanced
- Prefer gemini-2.5-flash
- Then gemini-3-flash
- Then gemini-2.5-flash-lite
deep
- Prefer gemini-3-flash
- Then gemini-2.5-flash
- Then gemini-3.1-flash-lite

For 3.x models, do not assume the UI label is the raw API id. Probe candidate ids such as preview-suffixed names when needed.

Invocation

Run the Python script or the shell wrapper via exec and request JSON output.

Python is now the canonical entrypoint because it also loads repo-local .env.local when present. The shell wrapper remains a convenience layer.

Primary example (preferred):

python3 skills/gemini-smart-search/scripts/gemini_smart_search.py \
  --query "BoundaryML context engineering" \
  --mode cheap \
  --json

Wrapper example (convenience only):

bash skills/gemini-smart-search/scripts/gemini_smart_search.sh \
  --query "BoundaryML context engineering" \
  --mode cheap \
  --json

python -m gemini_smart_search may work when run from the scripts/ directory, but it is not a supported interface for agents right now. Do not depend on it.

Output contract

Expect JSON with at least:

ok
query
mode
model_used
fallback_chain
display_chain
answer
citations
error
escalation

Notes:

model_used is the actual probed API model id (for example gemini-3-flash-preview), not the human-facing display label.
Citation URLs may initially be Google/Vertex grounding redirect URLs instead of canonical source URLs; treat that as a known current limitation.
With --json, supported runtime paths should return structured JSON on both success and error. Invalid CLI arguments now also return JSON when --json is present.

API key policy

The script should prefer a dedicated key path for this skill, then fall back to the standard Gemini key.

Required key resolution order:

SMART_SEARCH_GEMINI_API_KEY (primary declared env)
GEMINI_API_KEY (compatibility fallback)

If neither key is present, the agent must explicitly ask the human for a Gemini API key before claiming setup is complete.

Do not store the key in tracked repository files. Prefer a repo-local, gitignored file such as .env.local.

See references/config.md.

When to use this skill instead of built-in `web_search`

Use this skill when:

you want Gemini-only search
you want to test or isolate quota pools
you want model routing without touching gateway config
you want predictable JSON output for downstream orchestration

Do not use this skill when:

a normal built-in web_search is sufficient
you need non-Gemini providers
you only need to fetch and read a known URL (web_fetch)
you need logged-in or JS-heavy page interaction (browser)

Fallback policy

Fallback only for errors like:

quota exceeded / 429
model unavailable
transient upstream failure

Do not silently fallback on obvious local/script bugs or invalid arguments.

References

references/config.md — environment variables and design notes
references/qa-test-plan.md — focused QA scope for v1 behavior and release gates
references/qa-results-2026-03-12.md — CLI-oriented QA outcomes from the current release cycle
references/agent-qa-cases.md — adversarial agent-style misuse review
references/model-id-recon.md — verified callable Gemini model IDs and mapping notes
references/escalation-design.md — when to return a GitHub issue URL for human escalation
references/release-checklist.md — artifact release checklist with current completion status
references/development-goals-v0.1.1.md — next small version scope and artifact policy
references/release-notes-v0.1.1.md — release notes for the current artifact
assets/example-output.json — expected response shape
scripts/smoke_test.sh — non-destructive local smoke checks for the scaffold
scripts/prepare_artifact.sh — deterministic clean artifact export helper

Status

Python implementation is now wired for a first real version:

direct Gemini API call path
Google Search grounding enabled
mode-based model routing
Python-side repo-local .env.local loading
fallback across Gemini Flash-Lite / Flash variants for retryable upstream errors
structured JSON output for orchestration

This is still intentionally minimal: it does not yet expose advanced tuning flags, caching, or richer citation post-processing.

Before publishing an artifact, consult references/release-checklist.md, review references/development-goals-v0.1.1.md, run scripts/prepare_artifact.sh, and publish the release note in references/release-notes-v0.1.1.md alongside the artifact.