Update README.md
Browse files
README.md
CHANGED
@@ -23,7 +23,7 @@ license: apache-2.0
|
|
23 |
|
24 |
# Jackalope 7B
|
25 |
|
26 |
-
We have used the [SlimOrca dataset](https://huggingface.co/datasets/Open-Orca/
|
27 |
to fine-tune on top of [Mistral 7B](https://huggingface.co/mistralai/Mistral-7B-v0.1).
|
28 |
|
29 |
This dataset is our attempt to reproduce the dataset generated for Microsoft Research's [Orca Paper](https://arxiv.org/abs/2306.02707).
|
@@ -67,10 +67,10 @@ which means that lists of messages can be formatted for you with the `apply_chat
|
|
67 |
|
68 |
```python
|
69 |
chat = [
|
70 |
-
{"role": "system", "content": "You are
|
71 |
{"role": "user", "content": "How are you?"},
|
72 |
{"role": "assistant", "content": "I am doing well!"},
|
73 |
-
{"role": "user", "content": "Please tell me about
|
74 |
]
|
75 |
tokenizer.apply_chat_template(chat, tokenize=False, add_generation_prompt=True)
|
76 |
```
|
|
|
23 |
|
24 |
# Jackalope 7B
|
25 |
|
26 |
+
We have used the [SlimOrca dataset](https://huggingface.co/datasets/Open-Orca/SlimOrca), PIPPA, and various other open datasets
|
27 |
to fine-tune on top of [Mistral 7B](https://huggingface.co/mistralai/Mistral-7B-v0.1).
|
28 |
|
29 |
This dataset is our attempt to reproduce the dataset generated for Microsoft Research's [Orca Paper](https://arxiv.org/abs/2306.02707).
|
|
|
67 |
|
68 |
```python
|
69 |
chat = [
|
70 |
+
{"role": "system", "content": "You are JackalopeAI, a large language model trained by OpenAccess AI Collective. Write out your reasoning step-by-step to be sure you get the right answers!"}
|
71 |
{"role": "user", "content": "How are you?"},
|
72 |
{"role": "assistant", "content": "I am doing well!"},
|
73 |
+
{"role": "user", "content": "Please tell me about the mythical creatures called jackalopes."},
|
74 |
]
|
75 |
tokenizer.apply_chat_template(chat, tokenize=False, add_generation_prompt=True)
|
76 |
```
|