New v2 of Phi-2-Orange
Browse files
README.md
CHANGED
@@ -16,24 +16,12 @@ datasets:
|
|
16 |
|
17 |
A two-step finetune of Phi-2, with a bit more zest.
|
18 |
|
19 |
-
|
20 |
-
|
21 |
-
- [Open-Orca/SlimOrca-Dedup](https://huggingface.co/datasets/Open-Orca/SlimOrca-Dedup)
|
22 |
-
- [migtissera/Synthia-v1.3](https://huggingface.co/datasets/migtissera/Synthia-v1.3)
|
23 |
-
- [LDJnr/Verified-Camel](https://huggingface.co/datasets/LDJnr/Verified-Camel)
|
24 |
-
- [LDJnr/Pure-Dove](https://huggingface.co/datasets/LDJnr/Pure-Dove)
|
25 |
-
- [LDJnr/Capybara](https://huggingface.co/datasets/LDJnr/Capybara)
|
26 |
-
- [meta-math/MetaMathQA](https://huggingface.co/datasets/meta-math/MetaMathQA)
|
27 |
-
|
28 |
-
And then a DPO finetune using:
|
29 |
-
|
30 |
-
- [Intel/orca_dpo_pairs](https://huggingface.co/datasets/Intel/orca_dpo_pairs)
|
31 |
-
- [argilla/ultrafeedback-binarized-preferences-cleaned](https://huggingface.co/datasets/argilla/ultrafeedback-binarized-preferences-cleaned)
|
32 |
-
|
33 |
|
34 |
# Prompt Format
|
35 |
|
36 |
-
Phi-2 Orange uses ChatML as the prompt format, with or without the system instruction.
|
37 |
|
38 |
To prompt with a system instruction (use whatever system prompt you like):
|
39 |
|
|
|
16 |
|
17 |
A two-step finetune of Phi-2, with a bit more zest.
|
18 |
|
19 |
+
This is an improved version of the original [Phi-2-Orange](https://huggingface.co/rhysjones/phi-2-orange) that
|
20 |
+
uses an updated training process on the same datasets.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
21 |
|
22 |
# Prompt Format
|
23 |
|
24 |
+
Phi-2 Orange v2 uses ChatML as the prompt format, with or without the system instruction.
|
25 |
|
26 |
To prompt with a system instruction (use whatever system prompt you like):
|
27 |
|