Gemini Exploration Patterns

🚨 MANDATORY: Invoke gemini-cli-docs First

STOP - Before providing ANY response about Gemini exploration:

INVOKE gemini-cli-docs skill
QUERY for the specific exploration/context topic
BASE all responses EXCLUSIVELY on official documentation loaded

Overview

This skill provides strategic guidance for leveraging Gemini CLI's large context window for codebase exploration. It covers when to delegate exploration to Gemini, which model to use, and how to structure outputs for Claude to consume.

When to Use This Skill

Keywords: explore codebase, analyze architecture, large context, token limit, gemini exploration, codebase analysis, when to use gemini, model selection

Use this skill when:

Deciding whether to explore with Claude or Gemini
Planning a large codebase analysis
Choosing between Flash and Pro models
Structuring exploration output for cross-CLI consumption
Optimizing exploration for cost vs quality

Token Threshold Decision Matrix

Codebase Size Tokens Recommended Agent Rationale

Small <50K Claude native Claude's tools are faster

Medium 50K-500K Gemini Flash Good balance of speed/cost

Large 500K-1M Gemini Flash + chunking Stay within Flash limits

Very Large 1M-2M Gemini Pro Need extended context

Massive

2M Gemini Pro + progressive Multi-pass exploration

Token Estimation

Quick estimation: 1 token ~ 4 characters

chars=$(find . -name ".ts" -o -name ".py" | xargs wc -c | tail -1 | awk '{print $1}') tokens=$((chars / 4)) echo "Estimated tokens: $tokens"

Decision Rule

IF estimated_tokens < 50,000: USE Claude's native Explore agent

ELIF estimated_tokens < 1,000,000: USE Gemini Flash via /gemini-explore

ELIF estimated_tokens < 2,000,000: USE Gemini Pro via /gemini-explore --pro

ELSE: USE Progressive exploration (chunk by module)

Model Selection Guide

Gemini Flash (gemini-2.5-flash)

Context: Large (exact limits set by Google, check current API docs) Cost: Lower Speed: Faster

Best for:

Bulk file analysis
Pattern detection across codebase
Dependency mapping
Initial exploration passes
Log file analysis
Documentation generation

Gemini Pro (gemini-2.5-pro)

Context: Very large (exact limits set by Google, check current API docs) Cost: Higher Speed: Slower

Best for:

Complex architectural reasoning
Security-critical analysis
Nuanced code quality assessment
Very large codebases (>1M tokens)
Tasks requiring deep understanding

Exploration Strategies

Strategy 1: Full Codebase Sweep

Best for: Understanding overall architecture

Collect all source files

find . -type f ( -name ".ts" -o -name ".tsx" )
-not -path "/node_modules/"
-not -path "/.git/"
| xargs cat | gemini "Analyze architecture" --output-format json

Strategy 2: Module-by-Module

Best for: Very large codebases (>2M tokens)

Explore each top-level module separately

for dir in src//; do echo "=== Exploring $dir ===" find "$dir" -name ".ts" | xargs cat | gemini "Analyze this module" --output-format json done

Strategy 3: Entry-Point Focused

Best for: Understanding execution flow

Focus on entry points and their dependencies

cat package.json src/index.ts src/main.ts | gemini "Analyze entry points and startup flow" --output-format json

Strategy 4: Dependency-First

Best for: Understanding relationships

Package manifests + import statements

find . -name "package.json" -o -name "requirements.txt" -o -name "go.mod" | xargs cat grep -r "^import|^from" src/ | head -1000

Strategy 5: Progressive Depth

Best for: Iterative understanding

Pass 1: File tree + READMEs only
Pass 2: Package manifests + configs
Pass 3: Entry points + main modules
Pass 4: Deep dive on specific areas

Output Format Standards

All Gemini exploration outputs should follow this format for Claude consumption:

YAML Frontmatter (Required)

generated-by: gemini-cli model: gemini-2.5-flash timestamp: 2025-11-30T12:00:00Z tokens: 150000 scope: architecture|dependencies|patterns|all

Machine-Readable Summary (Required)

{ "type": "exploration", "scope": "architecture", "tokens_used": 150000, "model": "gemini-2.5-flash", "key_findings": [ "Uses Clean Architecture pattern", "React frontend with Express backend", "PostgreSQL database with Prisma ORM" ], "files_analyzed": 245, "entry_points": ["src/index.ts", "src/server.ts"] }

Human-Readable Content (Required)

Structured markdown with clear sections:

Overview: 2-3 sentence summary
Architecture: Directory structure, patterns
Key Components: Core modules and responsibilities
Dependencies: External and internal
Patterns: Conventions and style
Recommendations: What to read first, areas of concern

Recommendations for Claude (Required)

Specific, actionable guidance:

Recommendations for Claude

Files to Read First

src/index.ts - Main entry point
src/config/index.ts - Configuration patterns
CLAUDE.md - Project conventions

Patterns to Follow

Use dependency injection for services
Follow the existing error handling pattern in src/errors/

Areas of Concern

Complex state management in src/store/ - read carefully
Database migrations in prisma/migrations/ - check before schema changes

File Filtering Patterns

Include Patterns

Source code

-name ".ts" -o -name ".tsx" -o -name ".js" -o -name ".jsx" -name ".py" -o -name ".go" -o -name ".rs" -o -name ".java"

Configuration

-name ".json" -o -name ".yaml" -o -name ".yml" -o -name ".toml"

Documentation

-name ".md" -o -name "README"

Exclude Patterns

-not -path "/node_modules/" -not -path "/.git/" -not -path "/dist/" -not -path "/build/" -not -path "/pycache/" -not -path "/.next/" -not -path "/coverage/" -not -path "/.cache/"

Cost Optimization

Reduce Token Usage

Filter aggressively: Only include relevant file types
Limit file count: Use head -500 for file lists
Truncate large files: Cap individual files at reasonable sizes
Exclude generated code: dist/, build/, vendor/

Batch Efficiently

Bad: Many small calls

for file in *.ts; do gemini "analyze $file"; done

Good: One large call

cat *.ts | gemini "analyze all files"

Related Skills

gemini-delegation-patterns
When to delegate any task to Gemini
gemini-token-optimization
Cost optimization strategies
gemini-cli-execution
CLI invocation patterns
gemini-workspace-bridge
Artifact storage and exchange

Related Commands

/gemini-explore
Execute exploration with standard output
/gemini-plan
Generate implementation plans

Keyword Registry

Topic Keywords

Token limits context window , token limit , large context

Model selection flash vs pro , which model , model routing

Exploration explore codebase , analyze architecture , understand code

Cost reduce tokens , optimize cost , batch calls

Output exploration format , cross-cli artifact , claude readable

Test Scenarios

Scenario 1: Token Threshold Decision

Query: "Should I use Claude or Gemini to explore this codebase?" Expected Behavior:

Skill activates on "explore codebase" or "large context"
Provides token threshold decision matrix Success Criteria: User receives clear guidance based on codebase size

Scenario 2: Model Selection

Query: "Should I use Flash or Pro for codebase analysis?" Expected Behavior:

Skill activates on "flash vs pro" or "which model"
Provides model comparison and use cases Success Criteria: User receives model recommendation with rationale

Scenario 3: Exploration Strategy

Query: "How do I analyze a very large codebase with Gemini?" Expected Behavior:

Skill activates on "very large" or "analyze architecture"
Provides progressive exploration strategy Success Criteria: User receives module-by-module or chunking approach

Version History

v1.1.0 (2025-12-01): Added MANDATORY section, Test Scenarios, Version History
v1.0.0 (2025-11-25): Initial release

gemini-exploration-patterns

Safety Notice

Copy this and send it to your AI assistant to learn