discover_rag / README.md
joelg's picture
README
5fffa7e

A newer version of the Gradio SDK is available: 5.49.1

Upgrade
metadata
title: RAG Pedagogical Demo
emoji: πŸŽ“
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: false
license: mit

πŸŽ“ RAG Pedagogical Demo

An interactive educational application to learn about Retrieval Augmented Generation (RAG) systems.

What is RAG?

Retrieval Augmented Generation (RAG) combines information retrieval with language generation to create more accurate and grounded AI responses. Instead of relying solely on a language model's training data, RAG systems:

  1. Retrieve relevant information from a document corpus
  2. Augment the query with this retrieved context
  3. Generate an answer based on both the query and the retrieved information

Features

  • πŸ“š Upload your own PDFs or use the default corpus
  • πŸ”§ Configure retrieval parameters: embedding models, chunk size, top-k, similarity threshold
  • πŸ€– Configure generation parameters: LLM selection, temperature, max tokens
  • πŸ“Š Visualize the process: see retrieved chunks, similarity scores, and prompts
  • 🌍 Bilingual interface: English and French

How to Use

  1. Corpus Tab: Upload a PDF or use the default corpus about RAG
  2. Retrieval Tab: Choose embedding model and retrieval parameters
  3. Generation Tab: Select language model and generation settings
  4. Query Tab: Ask questions and see how RAG works!

Educational Value

This demo helps you understand:

  • How documents are processed and chunked
  • How semantic search retrieves relevant information
  • How context is provided to language models
  • How different parameters affect the results

Perfect for students, educators, and anyone curious about modern AI systems!

Technology

  • Framework: Gradio
  • Embeddings: Sentence Transformers
  • Vector Store: FAISS
  • LLMs: HuggingFace Inference API
  • Infrastructure: HuggingFace ZeroGPU

Note: This application runs on ZeroGPU. Initial requests may take longer as models are loaded.