rag-systems

Build RAG systems - embeddings, vector stores, chunking, and retrieval optimization

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "rag-systems" with this command: npx skills add pluginagentmarketplace/custom-plugin-ai-agents/pluginagentmarketplace-custom-plugin-ai-agents-rag-systems

RAG Systems

Build Retrieval-Augmented Generation systems for grounded responses.

When to Use This Skill

Invoke this skill when:

  • Building Q&A over custom documents
  • Implementing semantic search
  • Setting up vector databases
  • Optimizing retrieval quality

Parameter Schema

ParameterTypeRequiredDescriptionDefault
taskstringYesRAG goal-
vector_dbenumNopinecone, weaviate, chroma, pgvectorchroma
embedding_modelstringNoEmbedding modeltext-embedding-3-small
chunk_sizeintNoChunk size in chars1000

Quick Start

from langchain_openai import OpenAIEmbeddings
from langchain_chroma import Chroma
from langchain_text_splitters import RecursiveCharacterTextSplitter

# 1. Split documents
splitter = RecursiveCharacterTextSplitter(chunk_size=1000, chunk_overlap=200)
chunks = splitter.split_documents(documents)

# 2. Create vector store
embeddings = OpenAIEmbeddings(model="text-embedding-3-small")
vectorstore = Chroma.from_documents(chunks, embeddings)

# 3. Retrieve
docs = vectorstore.similarity_search("query", k=5)

Chunking Strategy

Content TypeSizeOverlapRationale
Technical docs500-800100Preserve code
Legal docs1000-1500200Keep clauses
Q&A/FAQ200-40050Atomic answers

Embedding Costs

ModelCost/1M tokens
text-embedding-3-small$0.02
text-embedding-3-large$0.13
Cohere embed-v3$0.10

Troubleshooting

IssueSolution
Irrelevant resultsImprove chunking, add reranking
Missing contextIncrease k, use parent retriever
HallucinationsAdd "only use context" prompt
Slow retrievalAdd caching, reduce k

Best Practices

  • Always include source attribution
  • Use hybrid search (dense + BM25)
  • Implement reranking for quality
  • Evaluate with RAGAS metrics

Related Skills

  • llm-integration - LLM for generation
  • agent-memory - Memory retrieval
  • ai-agent-basics - Agentic RAG

References

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Automation

agent-memory

No summary provided by upstream source.

Repository SourceNeeds Review
Automation

multi-agent

No summary provided by upstream source.

Repository SourceNeeds Review
Automation

llm-integration

No summary provided by upstream source.

Repository SourceNeeds Review