Spaces:

sgawtho
/

raqa-pinecone

Paused

App Files Files Community

sgawtho commited on Dec 5, 2023

Commit

cd4a2a8

•

1 Parent(s): 965b831

added dockerfile and chainlit python files

Browse files

Files changed (6) hide show

Dockerfile +11 -0
README.md +14 -10
app.py +79 -0
chainlit.md +15 -0
public/image1.png +0 -0
requirements.txt +5 -0

Dockerfile ADDED Viewed

	@@ -0,0 +1,11 @@

+FROM python:3.11
+RUN useradd -m -u 1000 user
+USER user
+ENV HOME=/home/user \
+    PATH=/home/user/.local/bin:$PATH
+WORKDIR $HOME/app
+COPY --chown=user . $HOME/app
+COPY ./requirements.txt ~/app/requirements.txt
+RUN pip install -r requirements.txt
+COPY . .
+CMD ["chainlit", "run", "app.py", "--port", "7860"]

README.md CHANGED Viewed

@@ -1,10 +1,14 @@
----
-title: Raqa Pinecone
-emoji: 🌖
-colorFrom: red
-colorTo: gray
-sdk: docker
-pinned: false
----
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

+# ChatWithPinecone 🌲
+This is a conversational application integrating multiple technologies: Pinecone, OpenAI embeddings, and the Chainlit framework. Here's an analysis of its key components and functionality:
+An index in Pinecone is was created to store and retriev vectorized data. OpenAI embeddings were used to convert text to vectors so they can be stored and searched in the Pinecone index.
+The following diagram shows the flow of data in the application:
+1. The user enters a message in the chatbot.
+2. The message is sent to the Pinecone Storage Index.
+3. The user's prompt is vectorized using OpenAI embeddings and sent to the Pinecone Storage Index to retrieve the top three documents relating to the prompt.
+4. The top three documents are summarized from an LLM providing an answer back to the user.
+![Flow Diagram](./image1.png)

app.py ADDED Viewed

	@@ -0,0 +1,79 @@

+import os
+from typing import List
+from langchain.embeddings.openai import OpenAIEmbeddings
+from langchain.vectorstores.pinecone import Pinecone
+from langchain.chains import ConversationalRetrievalChain
+from langchain.chat_models import ChatOpenAI
+from langchain.memory import ChatMessageHistory, ConversationBufferMemory
+from langchain.docstore.document import Document
+import pinecone
+import chainlit as cl
+pinecone.init(
+    api_key=os.environ.get("PINECONE_API_KEY"),
+    environment=os.environ.get("PINECONE_ENV"),
+)
+index_name = "langchain-demo"
+embeddings = OpenAIEmbeddings()
+welcome_message = "Welcome to the Chainlit Pinecone demo! Ask anything about Shakespeare's King Lear vectorized documents from Pinecone DB."
+@cl.on_chat_start
+async def start():
+    await cl.Message(content=welcome_message).send()
+    docsearch = Pinecone.from_existing_index(
+        index_name=index_name, embedding=embeddings
+        )
+    message_history = ChatMessageHistory()
+    memory = ConversationBufferMemory(
+        memory_key="chat_history",
+        output_key="answer",
+        chat_memory=message_history,
+        return_messages=True,
+    )
+    chain = ConversationalRetrievalChain.from_llm(
+        ChatOpenAI(
+            model_name="gpt-3.5-turbo",
+            temperature=0,
+            streaming=True),
+        chain_type="stuff",
+        retriever=docsearch.as_retriever(search_kwargs={'k': 3}), # I only want maximum of three document back with the highest similarity score
+        memory=memory,
+        return_source_documents=True,
+    )
+    cl.user_session.set("chain", chain)
+@cl.on_message
+async def main(message: cl.Message):
+    chain = cl.user_session.get("chain")  # type: ConversationalRetrievalChain
+    cb = cl.AsyncLangchainCallbackHandler()
+    res = await chain.acall(message.content, callbacks=[cb])
+    answer = res["answer"]
+    source_documents = res["source_documents"]  # type: List[Document]
+    text_elements = []  # type: List[cl.Text]
+    if source_documents:
+        for source_idx, source_doc in enumerate(source_documents):
+            source_name = f"source_{source_idx}"
+            # Create the text element referenced in the message
+            text_elements.append(
+                cl.Text(content=source_doc.page_content, name=source_name)
+            )
+        source_names = [text_el.name for text_el in text_elements]
+        if source_names:
+            answer += f"\nSources: {', '.join(source_names)}"
+        else:
+            answer += "\nNo sources found"
+    await cl.Message(content=answer, elements=text_elements).send()

chainlit.md ADDED Viewed

	@@ -0,0 +1,15 @@

+### ChatWithPinecone 🌲
+This is a conversational application integrating multiple technologies: Pinecone, OpenAI embeddings, and the Chainlit framework. Here's an analysis of its key components and functionality:
+An index in Pinecone is was created to store and retriev vectorized data. OpenAI embeddings were used to convert text to vectors so they can be stored and searched in the Pinecone index.
+The following diagram shows the flow of data in the application:
+1. The user enters a message in the chatbot.
+2. The message is sent to the Pinecone Storage Index.
+3. The user's prompt is vectorized using OpenAI embeddings and sent to the Pinecone Storage Index to retrieve the top three documents relating to the prompt.
+4. The top three documents are summarized from an LLM providing an answer back to the user.
+5. In-store memory is enabled using LangChain's memory caching feature. This allows the application to store the top three documents in memory for faster retrieval.
+![Flow Diagram](./public/image1.png)

public/image1.png ADDED Viewed

requirements.txt ADDED Viewed

	@@ -0,0 +1,5 @@

+pinecone-client==2.2.1
+tiktoken==0.3.3
+langchain
+chainlit
+openai