RajatChaudhari commited on
Commit
a4f89e5
·
verified ·
1 Parent(s): e8811f5

Update app.py

Browse files
Files changed (1) hide show
  1. app.py +4 -3
app.py CHANGED
@@ -68,9 +68,10 @@ if __name__ == "__main__":
68
  <img src="https://superagi.com/wp-content/uploads/2023/10/Introduction-to-RAGA-Retrieval-Augmented-Generation-and-Actions-1200x600.png.webp" width=100%>
69
  <br>
70
  Demo using TinyLlama, a chat model finetuned on top of TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T. This space demonstrate application of RAG on a small model and its effectiveness, I used it because of the space constraint. The current space runs on mere <b>2GB of RAM</b>, hence there is some delay in generating output. Test this to your hearts content and let me know your thoughts, I will keep updating this space with tiny improvements on architecture and design
71
-
72
- update1: This space now does not create a faiss index on build, it uses a locally saved faiss index
73
- update2: This space now uses google/gemma-1.1-2b-it model to generate output, reduces the response time to 1/3rd
 
74
  """
75
  article = """<p style='text-align: center'>
76
  <ul>You can ask questions like -
 
68
  <img src="https://superagi.com/wp-content/uploads/2023/10/Introduction-to-RAGA-Retrieval-Augmented-Generation-and-Actions-1200x600.png.webp" width=100%>
69
  <br>
70
  Demo using TinyLlama, a chat model finetuned on top of TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T. This space demonstrate application of RAG on a small model and its effectiveness, I used it because of the space constraint. The current space runs on mere <b>2GB of RAM</b>, hence there is some delay in generating output. Test this to your hearts content and let me know your thoughts, I will keep updating this space with tiny improvements on architecture and design
71
+ <ul>
72
+ <li>update1: This space now does not create a faiss index on build, it uses a locally saved faiss index</li>
73
+ <li>update2: This space now uses google/gemma-1.1-2b-it model to generate output, reduces the response time to 1/3rd</li>
74
+ </ul>
75
  """
76
  article = """<p style='text-align: center'>
77
  <ul>You can ask questions like -