--- title: Buster emoji: 🤖 colorFrom: red colorTo: blue sdk: gradio app_file: buster/apps/gradio_app.py python_version: 3.10.8 pinned: false --- # Buster, the QA documentation chatbot! Buster is a question-answering chatbot that can be tuned to any source of documentations. # Demo You can try out our [live demo here](https://huggingface.co/spaces/jerpint/buster), where it will answer questions about a bunch of libraries we've already scraped, including [🤗 Transformers](https://huggingface.co/docs/transformers/index). # Quickstart Here is a quick guide to help you deploy buster on your own dataset! First step, install buster locally. Note that buster requires python>=3.10. ``` git clone https://github.com/jerpint/buster.git pip install -e . ``` Then, go to the examples folder. We've attached a sample `stackoverflow.csv` file to help you get started. You will convert the .csv to a `documents.db` file. ``` buster_csv_parser stackoverflow.csv --output-filepath documents.db ``` This will generate the embeddings and save them locally. Finally, run ``` gradio gradio_app.py ``` This will launch the gradio app locally, which you should be able to view on [localhost]( http://127.0.0.1:7860) ## How does Buster work? First, we parsed the documentation into snippets. For each snippet, we obtain an embedding by using the [OpenAI API](https://beta.openai.com/docs/guides/embeddings/what-are-embeddings). Then, when a user asks a question, we compute its embedding, and find the snippets from the doc with the highest cosine similarity to the question. Finally, we craft the prompt: - The most relevant snippets from the doc. - The engineering prompt. - The user's question. We send the prompt to the [OpenAI API](https://beta.openai.com/docs/api-reference/completions), and display the answer to the user! ### Currently available models - For embeddings: "text-embedding-ada-002" - For completion: We support both "text-davinci-003" and "gpt-3.5-turbo" ### Livestream For more information, you can watch the livestream where explain how buster works in detail! - [Livestream recording](https://youtu.be/LB5g-AhfPG8) - [Livestream notebook](https://colab.research.google.com/drive/1CosxSNod48KrkyBn5_vkeleb7u0CrBa6)