Spaces:

Entz
/

llm_6

Running

App Files Files Community

Entz commited on Jun 26

Commit

307a717

•

1 Parent(s): e97d234

Upload 5 files

Browse files

Files changed (5) hide show

app.py +63 -53
qa.db +0 -0
requirements.txt +6 -4
tab1_intro.txt +23 -0
tab2_pe.txt +24 -0

app.py CHANGED Viewed

@@ -1,23 +1,56 @@
 import streamlit as st
 import pandas as pd
 import sqlite3
 from llama_index.llms.ollama import Ollama
 # Initialize the SQLite3 database
 conn = sqlite3.connect('qa.db')
 c = conn.cursor()
-c.execute('CREATE TABLE IF NOT EXISTS qa (question TEXT, answer TEXT)')
 conn.commit()
-# Initialize the Ollama object
-@st.cache_resource  ### This caches the model loading function
-def load_ollama_model():
-    return Ollama(model="mistral", request_timeout=30.0)
 # Save the question and answer to the SQLite3 database
-def save_to_db(question, answer):
-    c.execute('INSERT INTO qa (question, answer) VALUES (?, ?)', (question, answer))
     conn.commit()
 # Fetch all data from the SQLite3 database
@@ -32,56 +65,35 @@ def main():
     with tab1:
         st.subheader("LLM Model Description")
-        st.write("""
-        Welcome to our pilot program that aims to test and evaluate the capabilities of the Mistral 7B model in understanding and providing information about the Wandsworth Council. This application leverages the **Ollama** framework, specifically utilizing the advanced 'mistral' architecture to handle question and answer tasks.
-        ### What is the Mistral 7B Model?
-        The Mistral 7B model is a sophisticated language model designed to understand and generate human-like text. It is part of a new generation of models developed to handle complex queries with a high degree of accuracy. This model is trained on a diverse dataset, enabling it to provide insightful and accurate answers to a wide range of questions.
-        ### About Ollama
-        **Ollama** is a cutting-edge framework that facilitates the deployment and use of large language models (LLMs). By integrating the Mistral 7B model within the Ollama framework, this application can efficiently process and respond to user queries.
-        ### How Does It Work?
-        - **Ask a Question**: Users can input their questions regarding the Wandsworth Council in the "Ask a Question" tab. The Mistral 7B model will process the query and generate a relevant response based on its extensive training data.
-        - **View Q&A History**: All user queries and the corresponding answers are stored in a database, allowing us to analyze the performance of the model over time. Users can view the history of questions and answers in the "View Q&A History" tab.
-        ### Objectives
-        The primary goal of this pilot program is to assess how well the Mistral 7B model can handle queries related to local government information, specifically focusing on the Wandsworth Council. By collecting and analyzing user queries, we aim to:
-        - Identify strengths and weaknesses in the model’s responses.
-        - Improve the model’s accuracy and reliability in providing information about local council services.
-        - Explore the potential of LLMs in supporting local government communications and citizen engagement.
-        We encourage users to ask diverse and challenging questions to help us test the limits of the Mistral 7B model’s knowledge and understanding. Your participation and feedback are invaluable to the success of this program.
-        Thank you for being part of this exciting journey to explore the capabilities of advanced language models in serving local communities!
-        Last Update: 23rd June 2024
-        """)
     with tab2:
         st.subheader("Ask a Question")
         question = st.text_input("Enter your question:")
         if st.button("Get Answer"):
             if question:
-                # each time when it run tab2, this load_ollama_model won't be reload, thx to st.cache_resource
-                llm = load_ollama_model()
-                response = llm.complete(question)
-                ### # Print response for debugging
-                ### st.write("Debug: Response object")
-                ### st.write(response)
-                # Try to extract the generated text
                 try:
-                    answer = response.text
-                except AttributeError as e:
-                    st.error(f"Error extracting text from response: {e}")
-                    answer = "Sorry, could not generate an answer."
-                st.write(f"**Answer:** {answer}")
-                # Save question and answer to database
-                save_to_db(question, answer)
             else:
                 st.warning("Please enter a question")
@@ -89,12 +101,10 @@ def main():
         st.subheader("View Q&A History")
         qa_data = fetch_from_db()
         if qa_data:
-            df = pd.DataFrame(qa_data, columns=["Question", "Answer"])
             st.dataframe(df)
         else:
             st.write("No data available")
 if __name__ == "__main__":
-    debug = True  # Set to False to disable debugging
     main()

 import streamlit as st
 import pandas as pd
 import sqlite3
+from llama_index.core import StorageContext, load_index_from_storage
 from llama_index.llms.ollama import Ollama
+from llama_index.embeddings.huggingface import HuggingFaceEmbedding
+from llama_index.core import PromptTemplate
+import os
+version = 2.2
 # Initialize the SQLite3 database
 conn = sqlite3.connect('qa.db')
 c = conn.cursor()
+# Update the table creation to include the version column
+c.execute('CREATE TABLE IF NOT EXISTS qa (question TEXT, answer TEXT, version REAL)')
 conn.commit()
+# Read the LLM Model Description from a file
+def read_description_from_file(file_path):
+    with open(file_path, 'r') as file:
+        return file.read()
+# Define the folder containing the saved index
+INDEX_OUTPUT_PATH = "./output_index"
+# Ensure the output directory exists
+if not os.path.exists(INDEX_OUTPUT_PATH):
+    raise ValueError(f"Index directory {INDEX_OUTPUT_PATH} does not exist")
+# Setup LLM and embedding model
+llm = Ollama(model="llama3", request_timeout=120.0)
+embed_model = HuggingFaceEmbedding(model_name="BAAI/bge-large-en-v1.5", trust_remote_code=True)
+# To load the index later, set up the storage context
+storage_context = StorageContext.from_defaults(persist_dir=INDEX_OUTPUT_PATH)
+loaded_index = load_index_from_storage(embed_model=embed_model, storage_context=storage_context)
+# Define a query engine (assuming it needs the LLM and embedding model)
+query_engine = loaded_index.as_query_engine(llm=llm, embed_model=embed_model)
+# Customise prompt template
+# Read the prompt template from a file
+qa_prompt_tmpl_str = read_description_from_file("tab2_pe.txt")
+qa_prompt_tmpl = PromptTemplate(qa_prompt_tmpl_str)
+query_engine.update_prompts(
+    {"response_synthesizer:text_qa_template": qa_prompt_tmpl}
+)
 # Save the question and answer to the SQLite3 database
+def save_to_db(question, answer, version):
+    c.execute('INSERT INTO qa (question, answer, version) VALUES (?, ?, ?)', (question, answer, version))
     conn.commit()
 # Fetch all data from the SQLite3 database
     with tab1:
         st.subheader("LLM Model Description")
+        description = read_description_from_file("tab1_intro.txt")
+        st.write(description)
     with tab2:
         st.subheader("Ask a Question")
         question = st.text_input("Enter your question:")
         if st.button("Get Answer"):
             if question:
                 try:
+                    response = query_engine.query(question)
+                    # Try to extract the generated text
+                    try:
+                        # Extract the text from the response object (assuming it has a `text` attribute or method)
+                        if hasattr(response, 'text'):
+                            answer = response.text
+                        else:
+                            answer = str(response)
+                    except AttributeError as e:
+                        st.error(f"Error extracting text from response: {e}")
+                        answer = "Sorry, could not generate an answer."
+                    st.write(f"**Answer:** {answer}")
+                    # Save question and answer to database
+                    save_to_db(question, answer, version)
+                except Exception as e:
+                    st.error(f"An error occurred: {e}")
             else:
                 st.warning("Please enter a question")
         st.subheader("View Q&A History")
         qa_data = fetch_from_db()
         if qa_data:
+            df = pd.DataFrame(qa_data, columns=["Question", "Answer", "Version"])
             st.dataframe(df)
         else:
             st.write("No data available")
 if __name__ == "__main__":
     main()

qa.db CHANGED Viewed

Binary files a/qa.db and b/qa.db differ

requirements.txt CHANGED Viewed

@@ -1,4 +1,6 @@
-Streamlit
-pandas
-llama-index
-llama-index-llms-ollama

+streamlit==1.36.0
+pandas==2.2.2
+llama_index==0.10.50
+transformers==4.41.2
+llama_index.llms.ollama
+llama_index.embeddings.huggingface

tab1_intro.txt ADDED Viewed

	@@ -0,0 +1,23 @@

+Welcome to our pilot program that aims to test and evaluate the capabilities of the Mistral 7B model in understanding and providing information about the Wandsworth Council. This application leverages the **Ollama** framework, specifically utilizing the advanced 'mistral' architecture to handle question and answer tasks.
+### What is the Mistral 7B Model?
+The Mistral 7B model is a sophisticated language model designed to understand and generate human-like text. It is part of a new generation of models developed to handle complex queries with a high degree of accuracy. This model is trained on a diverse dataset, enabling it to provide insightful and accurate answers to a wide range of questions.
+### About Ollama
+**Ollama** is a cutting-edge framework that facilitates the deployment and use of large language models (LLMs). By integrating the Mistral 7B model within the Ollama framework, this application can efficiently process and respond to user queries.
+### How Does It Work?
+- **Ask a Question**: Users can input their questions regarding the Wandsworth Council in the "Ask a Question" tab. The Mistral 7B model will process the query and generate a relevant response based on its extensive training data.
+- **View Q&A History**: All user queries and the corresponding answers are stored in a database, allowing us to analyze the performance of the model over time. Users can view the history of questions and answers in the "View Q&A History" tab.
+### Objectives
+The primary goal of this pilot program is to assess how well the Mistral 7B model can handle queries related to local government information, specifically focusing on the Wandsworth Council. By collecting and analyzing user queries, we aim to:
+- Identify strengths and weaknesses in the model’s responses.
+- Improve the model’s accuracy and reliability in providing information about local council services.
+- Explore the potential of LLMs in supporting local government communications and citizen engagement.
+We encourage users to ask diverse and challenging questions to help us test the limits of the Mistral 7B model’s knowledge and understanding. Your participation and feedback are invaluable to the success of this program.
+Thank you for being part of this exciting journey to explore the capabilities of advanced language models in serving local communities!
+Last Update: 23rd June 2024

tab2_pe.txt ADDED Viewed

	@@ -0,0 +1,24 @@

+Context information is below.
+---------------------
+{context_str}
+---------------------
+You work as a support for residents' query about the council tax in Wandsworth.
+Even though you're a senior with more than 30 years experience, you double check all the facts in documentation.
+Our documentation is absolutely up-to-date, so you can fully rely on it when answering questions (you don't need to check the actual content on the council webpage).
+Your work is very important for the team success.
+You need to ensure that you provide the best possible support: answering all the questions, making no assumptions
+and sharing only the factual content.
+Be practical, and needless to be creative, just try your best to solve the customer problem.
+You are not allowed to share the details of the pdf, or md, or word documents.
+You are not allowed to use short links, like bit.ly links or the kind.
+You are not allowed to leave a blank link, or empty links, e.g. http://, or https://, or 'www.example.com', etc.
+you are only allowed to share world wide web links starting with https://www.wandsworth.gov.uk.
+If your content is from the RAG, or from the md or pdf files, and if you are going to quote this source, please only quote the source url link at the top of each file, e.g. "source: https://www.wandsworth.gov.uk/council-tax/council-tax-discounts-exemptions-and-reductions/apply-for-a-council-tax-discount/#impaired" at the top of the file "disabled_impaired.md".
+Do not invite people to see any internal files, e.g. 'see [disabled_impaired.md]', 'pdf', 'md', 'etc'.
+Do not show the RAG structure, Do not tell the folder or directory structure.
+Do not tell anything about the documents in RAG system. If any query is asking about the data or information source, please say it is from the official webpage or just quote the source url link at the top of each file, e.g. "source: https://www.wandsworth.gov.uk/council-tax/council-tax-discounts-exemptions-and-reductions/apply-for-a-council-tax-discount/#impaired" at the top of the file "disabled_impaired.md".
+Make each reply as an email format. Try to make the reply more helpful by including the url address.
+If the sender's name is unknown, just use 'resident'.
+Sign off with my name Lorentz, Data Scientist
+Query: {query_str}
+Answer: