Spaces:

huggingface
/

rlhf-interface

Configuration error

App Files Files Community

TristanThrush commited on Jan 30, 2023

Commit

013ce7b

unverified ·

2 Parent(s): 595a64b b0e9399

Merge pull request #1 from lewtun/add-langchain

Browse files

Files changed (7) hide show

.env.example +3 -0
.gitignore +163 -0
README.md +12 -1
app.py +71 -19
config.py.example +1 -1
prompt_templates/openai_chatgpt.json +9 -0
requirements.txt +1 -2

.env.example ADDED Viewed

	@@ -0,0 +1,3 @@

+DATASET_REPO_URL="https://huggingface.co/datasets/{DATASET_ID}"
+FORCE_PUSH="no"
+HF_TOKEN="hf_xxx"

.gitignore ADDED Viewed

	@@ -0,0 +1,163 @@

+# Byte-compiled / optimized / DLL files
+__pycache__/
+*.py[cod]
+*$py.class
+# C extensions
+*.so
+# Distribution / packaging
+.Python
+build/
+develop-eggs/
+dist/
+downloads/
+eggs/
+.eggs/
+lib/
+lib64/
+parts/
+sdist/
+var/
+wheels/
+share/python-wheels/
+*.egg-info/
+.installed.cfg
+*.egg
+MANIFEST
+# PyInstaller
+#  Usually these files are written by a python script from a template
+#  before PyInstaller builds the exe, so as to inject date/other infos into it.
+*.manifest
+*.spec
+# Installer logs
+pip-log.txt
+pip-delete-this-directory.txt
+# Unit test / coverage reports
+htmlcov/
+.tox/
+.nox/
+.coverage
+.coverage.*
+.cache
+nosetests.xml
+coverage.xml
+*.cover
+*.py,cover
+.hypothesis/
+.pytest_cache/
+cover/
+# Translations
+*.mo
+*.pot
+# Django stuff:
+*.log
+local_settings.py
+db.sqlite3
+db.sqlite3-journal
+# Flask stuff:
+instance/
+.webassets-cache
+# Scrapy stuff:
+.scrapy
+# Sphinx documentation
+docs/_build/
+# PyBuilder
+.pybuilder/
+target/
+# Jupyter Notebook
+.ipynb_checkpoints
+# IPython
+profile_default/
+ipython_config.py
+# pyenv
+#   For a library or package, you might want to ignore these files since the code is
+#   intended to run in multiple environments; otherwise, check them in:
+# .python-version
+# pipenv
+#   According to pypa/pipenv#598, it is recommended to include Pipfile.lock in version control.
+#   However, in case of collaboration, if having platform-specific dependencies or dependencies
+#   having no cross-platform support, pipenv may install dependencies that don't work, or not
+#   install all needed dependencies.
+#Pipfile.lock
+# poetry
+#   Similar to Pipfile.lock, it is generally recommended to include poetry.lock in version control.
+#   This is especially recommended for binary packages to ensure reproducibility, and is more
+#   commonly ignored for libraries.
+#   https://python-poetry.org/docs/basic-usage/#commit-your-poetrylock-file-to-version-control
+#poetry.lock
+# pdm
+#   Similar to Pipfile.lock, it is generally recommended to include pdm.lock in version control.
+#pdm.lock
+#   pdm stores project-wide configurations in .pdm.toml, but it is recommended to not include it
+#   in version control.
+#   https://pdm.fming.dev/#use-with-ide
+.pdm.toml
+# PEP 582; used by e.g. github.com/David-OConnor/pyflow and github.com/pdm-project/pdm
+__pypackages__/
+# Celery stuff
+celerybeat-schedule
+celerybeat.pid
+# SageMath parsed files
+*.sage.py
+# Environments
+.env
+.venv
+env/
+venv/
+ENV/
+env.bak/
+venv.bak/
+# Spyder project settings
+.spyderproject
+.spyproject
+# Rope project settings
+.ropeproject
+# mkdocs documentation
+/site
+# mypy
+.mypy_cache/
+.dmypy.json
+dmypy.json
+# Pyre type checker
+.pyre/
+# pytype static type analyzer
+.pytype/
+# Cython debug symbols
+cython_debug/
+# PyCharm
+#  JetBrains specific template is maintained in a separate JetBrains.gitignore that can
+#  be found at https://github.com/github/gitignore/blob/main/Global/JetBrains.gitignore
+#  and can be added to the global gitignore or merged into this file.  For a more nuclear
+#  option (not recommended) you can uncomment the following to ignore the entire idea folder.
+#.idea/
+# Local development
+data/

README.md CHANGED Viewed

@@ -14,6 +14,7 @@ A basic example of an RLHF interface with a Gradio app.
 **Instructions for someone to use for their own project:**
 *Setting up the Space*
 1. Clone this repo and deploy it on your own Hugging Face space.
 2. Add the following secrets to your space:
    - `HF_TOKEN`: One of your Hugging Face tokens.
@@ -24,11 +25,21 @@ A basic example of an RLHF interface with a Gradio app.
    huggingface.co, the app will use your token to automatically store new HITs
    in your dataset. Setting `FORCE_PUSH` to "yes" ensures that your repo will
    force push changes to the dataset during data collection. Otherwise,
-   accidental manual changes to your dataset could result in your space gettin
    merge conflicts as it automatically tries to push the dataset to the hub. For
    local development, add these three keys to a `.env` file, and consider setting
    `FORCE_PUSH` to "no".
 *Running Data Collection*
 1. On your local repo that you pulled, create a copy of `config.py.example`,
    just called `config.py`. Now, put keys from your AWS account in `config.py`.
    These keys should be for an AWS account that has the

 **Instructions for someone to use for their own project:**
 *Setting up the Space*
 1. Clone this repo and deploy it on your own Hugging Face space.
 2. Add the following secrets to your space:
    - `HF_TOKEN`: One of your Hugging Face tokens.
    huggingface.co, the app will use your token to automatically store new HITs
    in your dataset. Setting `FORCE_PUSH` to "yes" ensures that your repo will
    force push changes to the dataset during data collection. Otherwise,
+   accidental manual changes to your dataset could result in your space getting
    merge conflicts as it automatically tries to push the dataset to the hub. For
    local development, add these three keys to a `.env` file, and consider setting
    `FORCE_PUSH` to "no".
+To launch the Space locally, run:
+```bash
+python app.py
+```
+The app will then be available at a local address, such as http://127.0.0.1:7860
 *Running Data Collection*
 1. On your local repo that you pulled, create a copy of `config.py.example`,
    just called `config.py`. Now, put keys from your AWS account in `config.py`.
    These keys should be for an AWS account that has the

app.py CHANGED Viewed

@@ -1,18 +1,21 @@
 # Basic example for doing model-in-the-loop dynamic adversarial data collection
 # using Gradio Blocks.
 import os
-import random
 import uuid
 from urllib.parse import parse_qs
 import gradio as gr
-import requests
-from transformers import pipeline, Conversation
-from huggingface_hub import Repository
 from dotenv import load_dotenv
-from pathlib import Path
-import json
 from utils import force_git_push
-import threading
 # These variables are for storing the mturk HITs in a Hugging Face dataset.
 if Path(".env").is_file():
@@ -20,6 +23,10 @@ if Path(".env").is_file():
 DATASET_REPO_URL = os.getenv("DATASET_REPO_URL")
 FORCE_PUSH = os.getenv("FORCE_PUSH")
 HF_TOKEN = os.getenv("HF_TOKEN")
 DATA_FILENAME = "data.jsonl"
 DATA_FILE = os.path.join("data", DATA_FILENAME)
 repo = Repository(
@@ -49,7 +56,47 @@ f_stop = threading.Event()
 asynchronous_push(f_stop)
 # Now let's run the app!
-chatbot = pipeline(model="microsoft/DialoGPT-medium")
 demo = gr.Blocks()
@@ -65,6 +112,8 @@ with demo:
         "generated_responses": [],
         "response_1": "",
         "response_2": "",
         }
     state = gr.JSON(state_dict, visible=False)
@@ -74,31 +123,34 @@ with demo:
     state_display = gr.Markdown(f"Your messages: 0/{TOTAL_CNT}")
     # Generate model prediction
-    # Default model: distilbert-base-uncased-finetuned-sst-2-english
     def _predict(txt, state):
-        conversation_1 = Conversation(past_user_inputs=state["past_user_inputs"].copy(), generated_responses=state["generated_responses"].copy())
-        conversation_2 = Conversation(past_user_inputs=state["past_user_inputs"].copy(), generated_responses=state["generated_responses"].copy())
-        conversation_1.add_user_input(txt)
-        conversation_2.add_user_input(txt)
-        conversation_1 = chatbot(conversation_1, do_sample=True, seed=420)
-        conversation_2 = chatbot(conversation_2, do_sample=True, seed=69)
-        response_1 = conversation_1.generated_responses[-1]
-        response_2 = conversation_2.generated_responses[-1]
         state["cnt"] += 1
         new_state_md = f"Inputs remaining in HIT: {state['cnt']}/{TOTAL_CNT}"
-        state["data"].append({"cnt": state["cnt"], "text": txt, "response_1": response_1, "response_2": response_2})
         state["past_user_inputs"].append(txt)
         past_conversation_string = "<br />".join(["<br />".join(["😃: " + user_input, "🤖: " + model_response]) for user_input, model_response in zip(state["past_user_inputs"], state["generated_responses"] + [""])])
-        return gr.update(visible=False), gr.update(visible=True), gr.update(visible=True, choices=[response_1, response_2], interactive=True, value=response_1), gr.update(value=past_conversation_string), state, gr.update(visible=False), gr.update(visible=False), gr.update(visible=False), new_state_md, dummy
     def _select_response(selected_response, state, dummy):
         done = state["cnt"] == TOTAL_CNT
         state["generated_responses"].append(selected_response)
         state["data"][-1]["selected_response"] = selected_response
         if state["cnt"] == TOTAL_CNT:
             # Write the HIT data to our local dataset because the worker has
             # submitted everything now.

 # Basic example for doing model-in-the-loop dynamic adversarial data collection
 # using Gradio Blocks.
+import json
 import os
+import threading
 import uuid
+from pathlib import Path
 from urllib.parse import parse_qs
 import gradio as gr
 from dotenv import load_dotenv
+from huggingface_hub import Repository
+from langchain import ConversationChain
+from langchain.chains.conversation.memory import ConversationBufferMemory
+from langchain.llms import HuggingFaceHub
+from langchain.prompts import load_prompt
 from utils import force_git_push
 # These variables are for storing the mturk HITs in a Hugging Face dataset.
 if Path(".env").is_file():
 DATASET_REPO_URL = os.getenv("DATASET_REPO_URL")
 FORCE_PUSH = os.getenv("FORCE_PUSH")
 HF_TOKEN = os.getenv("HF_TOKEN")
+PROMPT_TEMPLATES = Path("prompt_templates")
+# Set env variable for langchain to communicate with Hugging Face Hub
+os.environ["HUGGINGFACEHUB_API_TOKEN"] = HF_TOKEN
 DATA_FILENAME = "data.jsonl"
 DATA_FILE = os.path.join("data", DATA_FILENAME)
 repo = Repository(
 asynchronous_push(f_stop)
 # Now let's run the app!
+prompt = load_prompt(PROMPT_TEMPLATES / "openai_chatgpt.json")
+chatbot_1 = ConversationChain(
+    llm=HuggingFaceHub(
+        repo_id="google/flan-t5-xl",
+        model_kwargs={"temperature": 1}
+    ),
+    prompt=prompt,
+    verbose=False,
+    memory=ConversationBufferMemory(ai_prefix="Assistant"),
+)
+chatbot_2 = ConversationChain(
+    llm=HuggingFaceHub(
+        repo_id="bigscience/bloom",
+        model_kwargs={"temperature": 0.7}
+    ),
+    prompt=prompt,
+    verbose=False,
+    memory=ConversationBufferMemory(ai_prefix="Assistant"),
+)
+chatbot_3 = ConversationChain(
+    llm=HuggingFaceHub(
+        repo_id="bigscience/T0_3B",
+        model_kwargs={"temperature": 1}
+    ),
+    prompt=prompt,
+    verbose=False,
+    memory=ConversationBufferMemory(ai_prefix="Assistant"),
+)
+chatbot_4 = ConversationChain(
+    llm=HuggingFaceHub(
+        repo_id="EleutherAI/gpt-j-6B",
+        model_kwargs={"temperature": 1}
+    ),
+    prompt=prompt,
+    verbose=False,
+    memory=ConversationBufferMemory(ai_prefix="Assistant"),
+)
 demo = gr.Blocks()
         "generated_responses": [],
         "response_1": "",
         "response_2": "",
+        "response_3": "",
+        "response_4": "",
         }
     state = gr.JSON(state_dict, visible=False)
     state_display = gr.Markdown(f"Your messages: 0/{TOTAL_CNT}")
     # Generate model prediction
     def _predict(txt, state):
+        # TODO: parallelize this!
+        response_1 = chatbot_1.predict(input=txt)
+        response_2 = chatbot_2.predict(input=txt)
+        response_3 = chatbot_3.predict(input=txt)
+        response_4 = chatbot_4.predict(input=txt)
+        response2model = {}
+        response2model[response_1] = chatbot_1.llm.repo_id
+        response2model[response_2] = chatbot_2.llm.repo_id
+        response2model[response_3] = chatbot_3.llm.repo_id
+        response2model[response_4] = chatbot_4.llm.repo_id
         state["cnt"] += 1
         new_state_md = f"Inputs remaining in HIT: {state['cnt']}/{TOTAL_CNT}"
+        state["data"].append({"cnt": state["cnt"], "text": txt, "response_1": response_1,  "response_2": response_2, "response_3": response_3, "response_4": response_4,"response2model": response2model})
         state["past_user_inputs"].append(txt)
         past_conversation_string = "<br />".join(["<br />".join(["😃: " + user_input, "🤖: " + model_response]) for user_input, model_response in zip(state["past_user_inputs"], state["generated_responses"] + [""])])
+        return gr.update(visible=False), gr.update(visible=True), gr.update(visible=True, choices=[response_1, response_2, response_3, response_4], interactive=True, value=response_1), gr.update(value=past_conversation_string), state, gr.update(visible=False), gr.update(visible=False), gr.update(visible=False), new_state_md, dummy
     def _select_response(selected_response, state, dummy):
         done = state["cnt"] == TOTAL_CNT
         state["generated_responses"].append(selected_response)
         state["data"][-1]["selected_response"] = selected_response
+        state["data"][-1]["selected_model"] = state["data"][-1]["response2model"][selected_response]
         if state["cnt"] == TOTAL_CNT:
             # Write the HIT data to our local dataset because the worker has
             # submitted everything now.

config.py.example CHANGED Viewed

@@ -3,4 +3,4 @@
 # and Access Management (IAM) panel.
 MTURK_KEY = ''
-MTURK_SECRET = '

 # and Access Management (IAM) panel.
 MTURK_KEY = ''
+MTURK_SECRET = ''

prompt_templates/openai_chatgpt.json ADDED Viewed

	@@ -0,0 +1,9 @@

+{
+    "input_variables": [
+        "history",
+        "input"
+    ],
+    "output_parser": null,
+    "template": "Assistant is a large language model trained by OpenAI.\n\nAssistant is designed to be able to assist with a wide range of tasks, from answering simple questions to providing in-depth explanations and discussions on a wide range of topics. As a language model, Assistant is able to generate human-like text based on the input it receives, allowing it to engage in natural-sounding conversations and provide responses that are coherent and relevant to the topic at hand.\n\nAssistant is constantly learning and improving, and its capabilities are constantly evolving. It is able to process and understand large amounts of text, and can use this knowledge to provide accurate and informative responses to a wide range of questions. Additionally, Assistant is able to generate its own text based on the input it receives, allowing it to engage in discussions and provide explanations and descriptions on a wide range of topics.\n\nOverall, Assistant is a powerful tool that can help with a wide range of tasks and provide valuable insights and information on a wide range of topics. Whether you need help with a specific question or just want to have a conversation about a particular topic, Assistant is here to assist.\n\n{history}\nHuman: {input}\nAssistant:",
+    "template_format": "f-string"
+}

requirements.txt CHANGED Viewed

@@ -1,5 +1,4 @@
-torch==1.12.0
-transformers==4.20.1
 boto3==1.24.32
 huggingface_hub==0.8.1
 python-dotenv==0.20.0

 boto3==1.24.32
 huggingface_hub==0.8.1
 python-dotenv==0.20.0
+langchain==0.0.74