RAG Research Skill

Use this skill when users ask about RAG (Retrieval-Augmented Generation), semantic search, document indexing, embeddings, vector databases, or chunking strategies. This skill provides best practices for working with the rag-research plugin and optimizing document retrieval.

When to Use

Trigger this skill when users:

•Ask about RAG, embeddings, or semantic search concepts
•Want to optimize their document indexing strategy
•Need help with chunking or retrieval quality
•Ask "how does rag-research work?" or "how to improve search results?"
•Troubleshoot poor search results or missing information

Core Concepts

Document Indexing Pipeline

•Load Document: Extract text from PDF, Markdown, or Text files
•Chunk Text: Split into overlapping segments (default: 512 chars, 50 overlap)
•Generate Embeddings: Convert chunks to vectors using FastEmbed (BAAI/bge-small-en-v1.5)
•Store in Qdrant: Persist vectors with metadata for retrieval

Embedding Models

The plugin uses FastEmbed with ONNX Runtime for efficient CPU inference:

Model	Dimensions	Speed	Quality	Use Case
`BAAI/bge-small-en-v1.5`	384	Fast	Good	Default, general use
`BAAI/bge-base-en-v1.5`	768	Medium	Better	Higher accuracy needs
`BAAI/bge-large-en-v1.5`	1024	Slow	Best	Maximum quality

Chunking Strategies

Chunk size affects retrieval quality:

•Smaller chunks (256-512): More precise, may lose context
•Larger chunks (1024+): More context, may dilute relevance
•Overlap (10-20%): Prevents information loss at boundaries

Recommend: Start with defaults (512/50), adjust based on results.

Search Quality Tips

•Use specific queries: "Mistral OCR API configuration" > "OCR"
•Check coverage: Run /rag-research:list to verify documents are indexed
•Increase limit: Use --limit 20 for comprehensive research
•Review scores: Scores > 0.7 are highly relevant, < 0.5 may be tangential

Troubleshooting

Poor Search Results

•Check if document is indexed: /rag-research:list --filter "keyword"
•Re-index with different chunking: Adjust CHUNK_SIZE in .env
•Use more specific queries: Add domain-specific terms
•Verify embeddings: Check model compatibility

PDF Extraction Issues

•Enable Mistral OCR: Set MISTRAL_API_KEY in .env for scanned PDFs
•Fallback to pypdf: Use --no-ocr flag for text-based PDFs
•Check file permissions: Ensure PDF is readable

Database Issues

•Reset database: rm -rf ~/.rag-research and re-index
•Check disk space: Qdrant needs space for vectors
•Verify installation: uv run rag-research stats

Configuration Reference

See references/configuration.md for detailed settings documentation.

Examples

See references/examples.md for common usage patterns.