Production RAG Pipeline
In-memory RAG pipeline that demonstrates chunking, hybrid retrieval, caching, and answer assembly -- no external services required.
Focus: Ingestion, hybrid retrieval, caching, and grounded answers
- Chunking with configurable size and overlap
- Hybrid scoring that blends lexical overlap and cosine similarity
- Query-level caching with hit tracking