nmarafo
/

Mistral-7B-Instruct-v0.2-TrueFalse-Feedback-GPTQ

PEFT

Safetensors

English

Spanish

Model card Files Files and versions Community

nmarafo commited on Feb 15

Commit

99fe6f7

•

1 Parent(s): 278113f

Update README.md

Browse files

Files changed (1) hide show

README.md +58 -3

README.md CHANGED Viewed

@@ -19,7 +19,7 @@ Determine if the student's answer is correct or not. It only returns True if the
 Add a brief comment explaining why the answer is correct or incorrect.\n\n
 Question: {question}\n
 Expected Answer: {best_answer}\n
-Student Answer: {student_answer}[/INST]</s>"
 ```
@@ -84,8 +84,63 @@ Student Answer: {student_answer}[/INST]</s>"
 Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
 ## How to Get Started with the Model
-Use the code below to get started with the model.
 [More Information Needed]

 Add a brief comment explaining why the answer is correct or incorrect.\n\n
 Question: {question}\n
 Expected Answer: {best_answer}\n
+Student Answer: {student_answer}[/INST]"
 ```
 Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
 ## How to Get Started with the Model
+In Google Colab:
+'''
+!pip install -q -U transformers peft accelerate optimum
+!pip install datasets==2.15.0
+!pip install auto-gptq --extra-index-url https://huggingface.github.io/autogptq-index/whl/cu117/
+from peft import AutoPeftModelForCausalLM
+from rich import print
+from transformers import GenerationConfig, AutoTokenizer
+import torch
+model_id = "TheBloke/Mistral-7B-Instruct-v0.2-GPTQ"
+adapter="nmarafo/Mistral-7B-Instruct-v0.2-TrueFalse-Feedback-GPTQ"
+def generate_prompt(data_point):
+    system_message = "Analyze the question, the expected answer, and the student's response. Determine if the student's answer is conceptually correct in relation to the expected answer, regardless of the exact wording. An answer will be considered correct if it accurately identifies the key information requested in the question, even if expressed differently. Return True if the student's answer is correct or False otherwise. Add a brief comment explaining the rationale behind the answer being correct or incorrect."
+    question = data_point["question"][0]
+    best_answer = data_point["best_answer"][0]
+    student_answer = data_point["student_answer"][0]
+    prompt = f"{system_message}\n\nQuestion: {question}\nExpected Answer: {best_answer}\nStudent Answer: {student_answer}"
+    return prompt
+tokenizer = AutoTokenizer.from_pretrained(
+            model_id,
+            trust_remote_code=True,
+            return_token_type_ids=False)
+tokenizer.pad_token = tokenizer.eos_token
+question="Name of Canary Island"
+best_answer="Tenerife, Fuerteventura, Gran Canaria, Lanzarote, La Palma, La Gomera, El Hierro, La Graciosa"
+student_answer="Tenerife"
+prompt = generate_prompt([{"question":question, "best_answer":best_answer,"student_answer":student_answer}])
+prompt_template=f'''<s>[INST] {prompt} [/INST]'''
+input_ids = tokenizer(prompt, return_tensors='pt').input_ids.cuda()
+output = persisted_model.generate(inputs=input_ids, temperature=0.7, do_sample=True, top_p=0.95, top_k=40, max_new_tokens=512)
+print(tokenizer.decode(output[0]))
+# To perform inference on the test dataset example load the model from the checkpoint
+persisted_model = AutoPeftModelForCausalLM.from_pretrained(
+    adapter,
+    low_cpu_mem_usage=True,
+    return_dict=True,
+    torch_dtype=torch.float16,
+    device_map="cuda")
+# Some gen config knobs
+generation_config = GenerationConfig(
+    penalty_alpha=0.6,
+    do_sample = True,
+    top_k=5,
+    temperature=0.5,
+    repetition_penalty=1.2,
+    max_new_tokens=512
+)
 [More Information Needed]