rhysjones commited on
Commit
27a0225
1 Parent(s): 0b22f76

New v2 of Phi-2-Orange

Browse files
Files changed (1) hide show
  1. README.md +3 -15
README.md CHANGED
@@ -16,24 +16,12 @@ datasets:
16
 
17
  A two-step finetune of Phi-2, with a bit more zest.
18
 
19
- First using a collection of broad training data:
20
-
21
- - [Open-Orca/SlimOrca-Dedup](https://huggingface.co/datasets/Open-Orca/SlimOrca-Dedup)
22
- - [migtissera/Synthia-v1.3](https://huggingface.co/datasets/migtissera/Synthia-v1.3)
23
- - [LDJnr/Verified-Camel](https://huggingface.co/datasets/LDJnr/Verified-Camel)
24
- - [LDJnr/Pure-Dove](https://huggingface.co/datasets/LDJnr/Pure-Dove)
25
- - [LDJnr/Capybara](https://huggingface.co/datasets/LDJnr/Capybara)
26
- - [meta-math/MetaMathQA](https://huggingface.co/datasets/meta-math/MetaMathQA)
27
-
28
- And then a DPO finetune using:
29
-
30
- - [Intel/orca_dpo_pairs](https://huggingface.co/datasets/Intel/orca_dpo_pairs)
31
- - [argilla/ultrafeedback-binarized-preferences-cleaned](https://huggingface.co/datasets/argilla/ultrafeedback-binarized-preferences-cleaned)
32
-
33
 
34
  # Prompt Format
35
 
36
- Phi-2 Orange uses ChatML as the prompt format, with or without the system instruction.
37
 
38
  To prompt with a system instruction (use whatever system prompt you like):
39
 
 
16
 
17
  A two-step finetune of Phi-2, with a bit more zest.
18
 
19
+ This is an improved version of the original [Phi-2-Orange](https://huggingface.co/rhysjones/phi-2-orange) that
20
+ uses an updated training process on the same datasets.
 
 
 
 
 
 
 
 
 
 
 
 
21
 
22
  # Prompt Format
23
 
24
+ Phi-2 Orange v2 uses ChatML as the prompt format, with or without the system instruction.
25
 
26
  To prompt with a system instruction (use whatever system prompt you like):
27