Spaces:
Build error
Build error
metadata
title: Datahub Qa Bot
emoji: π
colorFrom: gray
colorTo: purple
sdk: streamlit
sdk_version: 1.17.0
app_file: app.py
pinned: false
license: mit
DataHub documentation bot
Using OpenAI, Langchain and streamlit to train DataHub documentation to provide a DataHub QA BOT on huggingface space
How to run locally
- Clone the repo
- Run:
source .venv/bin/activate
pip install -r requirements.txt
streamlit run app.py
How to train your own model
- Delete the db folder
- Copy the docs folder from DataHub docs folder to
./docs
- Update the
os.environ["OPENAI_API_KEY"]
in thetrain.py
- Run
python3 train.py
The training will take 15 seconds, and cost around $0.20
chromadb.db.duckdb: loaded in 236 embeddings
chromadb.db.duckdb: loaded in 1 collections