Spaces:

joelg
/

discover_rag

Sleeping

App Files Files Community

discover_rag / README.md

joelg

README

5fffa7e about 1 month ago

preview code

raw

history blame contribute delete

2 kB

A newer version of the Gradio SDK is available: 5.49.1

Upgrade

metadata

title: RAG Pedagogical Demo
emoji: 🎓
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: false
license: mit

🎓 RAG Pedagogical Demo

An interactive educational application to learn about Retrieval Augmented Generation (RAG) systems.

What is RAG?

Retrieval Augmented Generation (RAG) combines information retrieval with language generation to create more accurate and grounded AI responses. Instead of relying solely on a language model's training data, RAG systems:

Retrieve relevant information from a document corpus
Augment the query with this retrieved context
Generate an answer based on both the query and the retrieved information

Features

📚 Upload your own PDFs or use the default corpus
🔧 Configure retrieval parameters: embedding models, chunk size, top-k, similarity threshold
🤖 Configure generation parameters: LLM selection, temperature, max tokens
📊 Visualize the process: see retrieved chunks, similarity scores, and prompts
🌍 Bilingual interface: English and French

How to Use

Corpus Tab: Upload a PDF or use the default corpus about RAG
Retrieval Tab: Choose embedding model and retrieval parameters
Generation Tab: Select language model and generation settings
Query Tab: Ask questions and see how RAG works!

Educational Value

This demo helps you understand:

How documents are processed and chunked
How semantic search retrieves relevant information
How context is provided to language models
How different parameters affect the results

Perfect for students, educators, and anyone curious about modern AI systems!

Technology

Framework: Gradio
Embeddings: Sentence Transformers
Vector Store: FAISS
LLMs: HuggingFace Inference API
Infrastructure: HuggingFace ZeroGPU

Note: This application runs on ZeroGPU. Initial requests may take longer as models are loaded.