🔍 Advanced RAG Research Assistant

Production-grade Retrieval-Augmented Generation with hybrid retrieval, graph-based reasoning, and rigorous evaluation

🔬 What This Project Does

A 3-tier RAG system that goes far beyond basic "vector search → LLM" tutorials:

Tier	What It Does	How It's Different
Tier 1: Basic	Dense vector search → LLM	Baseline (what tutorials teach)
Tier 2: Hybrid	BM25 + Dense + RRF fusion + Cross-encoder reranking → LLM	Production-grade retrieval
Tier 3: Graph	LightRAG knowledge graph + multi-hop reasoning → LLM	Research-grade, multi-hop Q&A

All three tiers are evaluated with RAGAS metrics to prove the improvements aren't just theoretical.

📊 Results (RAGAS Evaluation)

Metric	Tier 1 (Basic)	Tier 2 (Hybrid)	Tier 3 (Graph)
Faithfulness	~X.XX	~X.XX	~X.XX
Answer Relevancy	~X.XX	~X.XX	~X.XX
Context Recall	~X.XX	~X.XX	~X.XX
Context Precision	~X.XX	~X.XX	~X.XX

(Fill in after running evaluation — these numbers go in your resume!)

🏗️ Architecture

┌──────────────────────────────────────────────────────────────────┐
│                    Advanced RAG Pipeline                          │
│                                                                   │
│  📄 Documents                                                     │
│   │                                                               │
│   ├──▶ 🔤 Chunking (recursive, 512 tokens, 50 overlap)          │
│   │                                                               │
│   ├──▶ 📊 TIER 1: Dense Retrieval                                │
│   │    └── BGE-small embeddings → FAISS index → Top-K            │
│   │                                                               │
│   ├──▶ 📊 TIER 2: Hybrid Retrieval                               │
│   │    ├── BM25 (sparse) ──┐                                     │
│   │    ├── BGE (dense) ────┤── RRF Fusion → Cross-encoder → Top-K│
│   │    └── Reciprocal Rank Fusion                                 │
│   │                                                               │
│   └──▶ 📊 TIER 3: Graph Retrieval (LightRAG)                    │
│        ├── Entity Extraction → Knowledge Graph                    │
│        ├── Local queries (specific entities)                      │
│        ├── Global queries (abstract themes)                       │
│        └── Hybrid mode (best of both)                             │
│                                                                   │
│  ❓ Query                                                         │
│   │                                                               │
│   ├──▶ Retrieve relevant contexts (any tier)                     │
│   ├──▶ Rerank with cross-encoder                                 │
│   ├──▶ Generate answer with LLM (Groq API / HF Inference)       │
│   └──▶ Evaluate with RAGAS (faithfulness, relevancy, recall)     │
│                                                                   │
│  🖥️ Gradio Interface                                             │
│   ├── Chat tab (ask questions)                                    │
│   ├── Upload tab (add documents)                                  │
│   ├── Compare tab (side-by-side tier comparison)                  │
│   └── Eval tab (RAGAS scores)                                    │
└──────────────────────────────────────────────────────────────────┘

🛠️ Tech Stack

Component	Tool	Why
Dense Embeddings	`BAAI/bge-small-en-v1.5` (33MB)	Best quality/size ratio, CPU-fast
Sparse Retrieval	`rank_bm25`	Classic term-matching, complements dense
Fusion	Reciprocal Rank Fusion (RRF)	No tuning needed, robust across domains
Reranker	`cross-encoder/ms-marco-MiniLM-L6-v2`	Best CPU reranker (74.3 NDCG@10)
Graph RAG	`LightRAG` (34K GitHub stars)	Entity-relationship graphs for multi-hop
LLM	Groq API (free, Llama 3.3 70B)	Zero cost, fast, high quality
Evaluation	RAGAS	Standard RAG evaluation framework
Frontend	Gradio → HF Spaces	Free deployment, no GPU needed
Vector Store	FAISS (CPU)	Fast, no server needed

📁 Project Structure

project2_advanced_rag/
├── README.md                  # This file
├── requirements.txt           # Dependencies  
├── rag_engine.py             # Core RAG engine (all 3 tiers)
├── evaluation.py              # RAGAS evaluation pipeline
├── app.py                     # Gradio web interface
├── ingest_sample_data.py      # Download & index sample documents
├── config.py                  # Configuration (API keys, model names)
└── sample_data/               # Sample documents for demo
    └── README.md

🚀 How to Run

Quick Start (5 minutes)

# 1. Install dependencies
pip install -r requirements.txt

# 2. Set up API key (free!)
#    Go to https://console.groq.com → Get API key
export GROQ_API_KEY="your-key-here"

# 3. Index sample documents
python ingest_sample_data.py

# 4. Launch the app
python app.py
# Opens at http://localhost:7860

Deploy on HF Spaces (Free)

# 1. Create a new Space on huggingface.co
# 2. Upload all files
# 3. Add GROQ_API_KEY to Space secrets
# 4. It deploys automatically!

💡 Key Concepts You'll Learn

Dense vs Sparse Retrieval: Why you need both
Reciprocal Rank Fusion: How to combine multiple retrievers
Cross-encoder Reranking: Why bi-encoders aren't enough
Knowledge Graphs: Entity extraction and relationship mapping
Multi-hop Reasoning: Answering questions that span multiple documents
RAGAS Evaluation: Measuring RAG quality scientifically

📄 Based On

LightRAG Paper — Graph-based retrieval
RAGAS Paper — RAG evaluation framework
Blended RAG — Hybrid retrieval (IBM Research)
RRF Paper — Reciprocal Rank Fusion

🔑 Getting Free API Keys

Service	What For	Link
Groq	LLM (Llama 3.3 70B)	console.groq.com
HuggingFace	Embeddings (optional, runs locally)	huggingface.co/settings/tokens

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Papers for Neha12210/project2-advanced-rag

LightRAG: Simple and Fast Retrieval-Augmented Generation

Paper • 2410.05779 • Published Oct 8, 2024 • 39

Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based Retrievers

Paper • 2404.07220 • Published Mar 22, 2024

RAGAS: Automated Evaluation of Retrieval Augmented Generation

Paper • 2309.15217 • Published Sep 26, 2023 • 6