rusty-page-indexer

High-performance semantic indexing and retrieval of local PDF and Markdown files with multi-repo support.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "rusty-page-indexer" with this command: npx skills add algiras/rusty-pageindex/algiras-rusty-pageindex-rusty-page-indexer

Rusty Page Indexer Skill

This skill allows agents to index and query local documentation using rusty-page-indexer. Unlike standard vector databases, it focuses on Structural Context, preserving the document's hierarchical tree for more logical retrieval.

Why Structure Matters

Standard RAG often slices documents into arbitrary chunks, losing context. rusty-page-indexer maintains a Folder → File → Section hierarchy.

  • Logical Retrieval: Instead of just matching a sentence, you match a logical section (e.g., an entire API endpoint description).
  • Project Awareness: The LLM understands where a file lives (e.g., src/auth) giving it contextual cues about the code's purpose.

Key Capabilities

  • Multi-repo indexing: Index multiple projects independently
  • Cross-repo search: Query across all indexed repositories
  • Parallel processing: Fast indexing with Rayon
  • Unified tree structure: Folder -> File -> Section hierarchy
  • Model flexibility: Works with OpenAI, Ollama, and OpenAI-compatible APIs

Commands

1. Install Tool

Install the rusty-page-indexer binary (Linux/macOS).

curl -fsSL https://raw.githubusercontent.com/Algiras/rusty-pageindex/main/install.sh | bash

2. Configure Authentication

OpenAI (remote):

rusty-page-indexer auth --api-key "sk-..." --model "gpt-4o-mini"

Ollama (local):

rusty-page-indexer auth --api-key "ollama" --api-base "http://localhost:11434/v1" --model "llama3.2"

Other OpenAI-compatible APIs:

# Groq
rusty-page-indexer auth --api-key "your-key" --api-base "https://api.groq.com/openai/v1" --model "llama3-70b-8192"

# Together AI
rusty-page-indexer auth --api-key "your-key" --api-base "https://api.together.xyz/v1" --model "meta-llama/Llama-3-70b-chat-hf"

3. Index Documents

Recursively index a directory of documents (.md, .pdf).

# Fast index (no LLM calls, instant)
rusty-page-indexer index <path>

# With LLM-generated summaries (higher quality, slower)
rusty-page-indexer index <path> --enrich

# Two-Phase Strategy (Recommended)
# 1. Index the whole folder quickly for structure
rusty-page-indexer index ./projects
# 2. Enrich only the critical "Holy Grail" docs (specs/manuals)
rusty-page-indexer index ./projects/docs/architecture_spec.md --enrich

[!TIP] Use --enrich sparingly on large folders. Index generally first, then enrich specific key files for deep reasoning.


### 4. List Indexed Repositories
View all indexed repositories with their sizes and paths.

```bash
rusty-page-indexer list

5. Query Index

Perform semantic search over indexed documents.

# Search all repositories
rusty-page-indexer query "How does authentication work"

# Path-Scoped Query (Search only specific component/folder)
# Useful for multi-repo work or filtering to 'backend' only.
rusty-page-indexer query "auth" --path backend

# Override model for this query (using argument separator)
rusty-page-indexer query "Explain API" -- --model gpt-4o

6. Clean Up Indices

Remove indexed repositories.

# Remove specific index
rusty-page-indexer clean <repo-name>

# Remove all indices
rusty-page-indexer clean --all

7. Check Status

Verify installation and configuration.

rusty-page-indexer info

Model Compatibility

OpenAI Models (Remote)

ModelCostRecommended For
gpt-4o-mini$General use (fast & cheap)
gpt-4o$$$Complex queries
gpt-4-turbo$$Large documents
gpt-3.5-turbocentsBudget option

Local Models (Ollama)

ModelSizeNotes
llama3.23BRecommended for local use
gemma34BHighly efficient reasoning model ⭐
mistral:7b7BFast and capable
qwen2.5:7b7BReliable

Troubleshooting

The tool provides detailed error messages:

ErrorMeaningFix
Model not foundInvalid model nameUse rusty-page-indexer config --model gpt-4o-mini
Invalid API keyWrong or expired keyRe-run auth with valid key
Quota exceededOut of creditsAdd billing at platform.openai.com or use Ollama
Rate limitToo many requestsWait 30-60s and retry
Connection refusedOllama not runningRun ollama serve
Invalid childrenCorrupted stateNuke & Pave: rm -rf ~/.rusty-page-indexer/manifest.json ~/.rusty-page-indexer/indices && rusty-page-indexer info

Typical Workflow

# 1. Install (if needed)
curl -fsSL https://raw.githubusercontent.com/Algiras/rusty-pageindex/main/install.sh | bash

# 2. Configure for Ollama (free, local)
rusty-page-indexer auth --api-key "ollama" --api-base "http://localhost:11434/v1" --model "llama3.2"

# 3. Index your project
rusty-page-indexer index ./docs --enrich

# 4. Search
rusty-page-indexer query "how does authentication work"

# 5. View indexed repos
rusty-page-indexer list

Storage Location

All data is stored in ~/.rusty-page-indexer/:

  • config.toml - API credentials and default model
  • manifest.json - Index registry
  • indices/ - Tree index files

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Research

rknowledge

No summary provided by upstream source.

Repository SourceNeeds Review
General

image-gen

Generate AI images from text prompts. Triggers on: "生成图片", "画一张", "AI图", "generate image", "配图", "create picture", "draw", "visualize", "generate an image".

Archived SourceRecently Updated
General

explainer

Create explainer videos with narration and AI-generated visuals. Triggers on: "解说视频", "explainer video", "explain this as a video", "tutorial video", "introduce X (video)", "解释一下XX(视频形式)".

Archived SourceRecently Updated
General

asr

Transcribe audio files to text using local speech recognition. Triggers on: "转录", "transcribe", "语音转文字", "ASR", "识别音频", "把这段音频转成文字".

Archived SourceRecently Updated