Spaces:

LukeOLuck
/

Flan-T5_LLM_With_OpenOrca

Running

App Files Files Community

LukeOLuck commited on Feb 13

Commit

23e2f58

•

1 Parent(s): 2f430b3

init commit

Browse files

Files changed (6) hide show

app.py +88 -0
model.py +21 -0
questions_texts.txt +13 -0
requirements.txt +4 -0
response_texts.txt +14 -0
system_prompts.txt +5 -0

app.py ADDED Viewed

	@@ -0,0 +1,88 @@

+import gradio as gr
+import os
+import torch
+import pickle
+import gzip
+from model import create_flan_T5_model
+from timeit import default_timer as timer
+from typing import Tuple, Dict
+device = "cuda" if torch.cuda.is_available() else "cpu"
+### Load example texts ###
+questions_texts = []
+with open("questions_texts.txt", "r") as file:
+  questions_texts = [line.strip() for line in file.readlines()]
+system_prompts = []
+with open("system_prompts.txt", "r") as file:
+  system_prompts = [line.strip() for line in file.readlines()]
+response_texts = []
+with open("response_texts.txt", "r") as file:
+  response_texts = [line.strip() for line in file.readlines()]
+### Model and transforms preparation ###
+# Create model and tokenizer
+model, tokenizer = create_flan_T5_model()
+# Load saved weights
+model.load_state_dict(
+    torch.load(f="flan-t5-small.pth",
+               map_location=torch.device("cpu")) # load to CPU
+)
+### Predict function ###
+def predict(selection: str) -> Tuple[Dict, str, float]:
+  start_time = timer()
+  model.eval()
+  # Extract the question part from the selection
+  # Assuming the format "Prompt: {prompt} | Question: {question}"
+  question = selection.split("| Question: ")[1]
+  # Find the index of the question
+  idx = questions_texts.index(question)
+  # Now, use the index to get the system prompt and actual response
+  system_prompt = system_prompts[idx]
+  response = response_texts[idx]
+#
+  input_text = f"context: {system_prompt} question: {question}"
+  model_inputs = tokenizer(input_text, return_tensors="pt", max_length=512, padding='max_length', truncation=True).to(device)
+  with torch.inference_mode():
+    predicted_token_ids = model.generate(input_ids=model_inputs['input_ids'], attention_mask=model_inputs['attention_mask'], max_length=128)
+  result = tokenizer.decode(predicted_token_ids[0], skip_special_tokens=True)
+  end_time = timer()
+  pred_time = round(end_time - start_time, 4)
+  return {"Predicted Answer": result}, {"Actual Answer": response}, pred_time
+### 4. Gradio app ###
+# Create title, description and article
+title = "Prompt Answering with Google's flan-t5-small"
+description = "[google/flan-t5-small based model](https://huggingface.co/google/flan-t5-small) LLM model trained to take prompts and tasks on the  [HuggingFace 🤗 Open-Orca/OpenOrca](https://huggingface.co/datasets/Open-Orca/OpenOrca). [Source Code Found Here](https://colab.research.google.com/drive/1sIScjt_hyNegHC15Y76JVXEOUvdD_2dh?usp=sharing)"
+article = "Built with [Gradio](https://github.com/gradio-app/gradio) and [PyTorch](https://pytorch.org/). [Source Code Found Here](https://colab.research.google.com/drive/1sIScjt_hyNegHC15Y76JVXEOUvdD_2dh?usp=sharing)"
+dropdown_choices = [f"Prompt: {prompt} | Question: {question}" for prompt, question in zip(system_prompts, questions_texts)]
+# Create the Gradio demo
+demo = gr.Interface(fn=predict,
+    inputs=gr.Dropdown(choices=dropdown_choices, label="Select a Question and Prompt"),
+    outputs=[
+        gr.JSON(label="Predicted Answer"),
+        gr.Textbox(label="Actual Answer"),
+        gr.Number(label="Prediction time (s)")
+    ],
+    title=title,
+    description=description,
+    article=article)
+# Launch the demo
+demo.launch()

model.py ADDED Viewed

	@@ -0,0 +1,21 @@

+import torch
+from torch import nn
+from transformers import AutoTokenizer, T5ForConditionalGeneration
+device = "cuda" if torch.cuda.is_available() else "cpu"
+def create_flan_T5_model(device=device):
+  """Creates a HuggingFace all-MiniLM-L6-v2 model.
+  Args:
+    device: A torch.device
+  Returns:
+    A tuple of the model and tokenizer
+  """
+  tokenizer = AutoTokenizer.from_pretrained('google/flan-t5-small')
+  model = T5ForConditionalGeneration.from_pretrained('google/flan-t5-small').to(device)
+  return model, tokenizer
+# Example usage
+model, tokenizer = create_flan_T5_model()

questions_texts.txt ADDED Viewed

	@@ -0,0 +1,13 @@

+You will be given a definition of a task first, then some input of the task.
+This task is about using the specified sentence and converting the sentence to Resource Description Framework (RDF) triplets of the form (subject, predicate object). The RDF triplets generated must be such that the triplets accurately capture the structure and semantics of the input sentence. The input is a sentence and the output is a list of triplets of the form [subject, predicate, object] that capture the relationships present in the sentence. When a sentence has more than 1 RDF triplet possible, the output must contain all of them.
+AFC Ajax (amateurs)'s ground is Sportpark De Toekomst where Ajax Youth Academy also play.
+Output:
+Generate an approximately fifteen-word sentence that describes all this data: Midsummer House eatType restaurant; Midsummer House food Chinese; Midsummer House priceRange moderate; Midsummer House customer rating 3 out of 5; Midsummer House near All Bar One
+What happens next in this paragraph?
+She then rubs a needle on a cotton ball then pushing it onto a pencil and wrapping thread around it. She then holds up a box of a product and then pouring several liquids into a bowl. she
+Choose your answer from: A. adds saucepan and shakes up the product in a grinder. B. pinches the thread to style a cigarette, and then walks away. C. then dips the needle in ink and using the pencil to draw a design on her leg, rubbing it off with a rag in the end. D. begins to style her hair and cuts it several times before parting the ends of it to show the hairstyle she has created.
+Please answer the following question: I want to test the ability of students to read a passage and answer questions about it. Could you please come up with a good question for the passage "In 1901, the Federation of Australia was the process by which the six separate British self-governing colonies of New South Wales, Queensland, South Australia, Tasmania, Victoria and Western Australia formed one nation. They kept the systems of government that they had developed as separate colonies but also would have a federal government that was responsible for matters concerning the whole nation. When the Constitution of Australia came into force, the colonies collectively became states of the Commonwealth of Australia."?
+Answer:
+James runs a TV show and there are 5 main characters and 4 minor characters. He pays the minor characters $15,000 each episode. He paid the major characters three times as much. How much does he pay per episode? Let's be accurate as possible.

requirements.txt ADDED Viewed

	@@ -0,0 +1,4 @@

+torch==2.1.0
+torchvision==0.16.0
+gradio==3.50.2
+transformers==4.35.0

response_texts.txt ADDED Viewed

	@@ -0,0 +1,14 @@

+[
+  ["AFC Ajax (amateurs)", "has ground", "Sportpark De Toekomst"],
+  ["Ajax Youth Academy", "plays at", "Sportpark De Toekomst"]
+]
+Midsummer House is a moderately priced Chinese restaurant with a 3/5 customer rating, located near All Bar One.
+C. She then dips the needle in ink and using the pencil to draw a design on her leg, rubbing it off with a rag in the end. In this option, she is continuing the process of using the needle, pencil, and thread, which is most related to what she was doing in the previous sentence.
+Based on the passage, discuss the primary motivations and outcomes of the 1901 Federation of Australia, including the roles and responsibilities of the federal government, as well as the continued governmental structures of the individual states involved.
+James pays the minor characters $15,000 each episode. Since there are 4 minor characters, he pays them a total of 4 * $15,000 = $60,000 per episode.
+The major characters are paid three times as much. So, each major character gets paid 3 * $15,000 = $45,000 per episode.
+There are 5 main characters, so he pays them a total of 5 * $45,000 = $225,000 per episode.
+In total, James pays $225,000 (major characters) + $60,000 (minor characters) = $285,000 per episode.

system_prompts.txt ADDED Viewed

	@@ -0,0 +1,5 @@

+You are an AI assistant. You will be given a task. You must generate a detailed and long answer.
+You are a helpful assistant, who always provide explanation. Think like you are answering to a five year old.
+You are an AI assistant. You will be given a task. You must generate a detailed and long answer.
+You are an AI assistant that helps people find information.