Skool RAG Pipeline
Goal
Query Skool community content using a RAG (Retrieval-Augmented Generation) pipeline with vector search and reranking.
Scripts
- •
./scripts/skool_rag_prepare.py- Prepare content for indexing - •
./scripts/skool_rag_index.py- Index content in Pinecone - •
./scripts/skool_rag_query.py- Query the knowledge base
Pipeline
1. Prepare Content
bash
python3 ./scripts/skool_rag_prepare.py --community makerschool
Scrapes and chunks community content.
2. Index in Pinecone
bash
python3 ./scripts/skool_rag_index.py --input .tmp/skool_chunks.json
Creates OpenAI embeddings and stores in Pinecone.
3. Query
bash
python3 ./scripts/skool_rag_query.py --query "How do I get my first client?"
Pipeline:
- •OpenAI embeddings for query
- •Pinecone vector search
- •Cohere reranking
- •Claude response generation
Environment
code
PINECONE_API_KEY=your_key OPENAI_API_KEY=your_key COHERE_API_KEY=your_key ANTHROPIC_API_KEY=your_key