rhysjones
/

phi-2-orange-v2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

rhysjones commited on Mar 4

Commit

27a0225

•

1 Parent(s): 0b22f76

New v2 of Phi-2-Orange

Files changed (1) hide show

README.md +3 -15

README.md CHANGED Viewed

@@ -16,24 +16,12 @@ datasets:
 A two-step finetune of Phi-2, with a bit more zest.
-First using a collection of broad training data:
-- [Open-Orca/SlimOrca-Dedup](https://huggingface.co/datasets/Open-Orca/SlimOrca-Dedup)
-- [migtissera/Synthia-v1.3](https://huggingface.co/datasets/migtissera/Synthia-v1.3)
-- [LDJnr/Verified-Camel](https://huggingface.co/datasets/LDJnr/Verified-Camel)
-- [LDJnr/Pure-Dove](https://huggingface.co/datasets/LDJnr/Pure-Dove)
-- [LDJnr/Capybara](https://huggingface.co/datasets/LDJnr/Capybara)
-- [meta-math/MetaMathQA](https://huggingface.co/datasets/meta-math/MetaMathQA)
-And then a DPO finetune using:
-- [Intel/orca_dpo_pairs](https://huggingface.co/datasets/Intel/orca_dpo_pairs)
-- [argilla/ultrafeedback-binarized-preferences-cleaned](https://huggingface.co/datasets/argilla/ultrafeedback-binarized-preferences-cleaned)
 # Prompt Format
-Phi-2 Orange uses ChatML as the prompt format, with or without the system instruction.
 To prompt with a system instruction (use whatever system prompt you like):

 A two-step finetune of Phi-2, with a bit more zest.
+This is an improved version of the original [Phi-2-Orange](https://huggingface.co/rhysjones/phi-2-orange) that
+uses an updated training process on the same datasets.
 # Prompt Format
+Phi-2 Orange v2 uses ChatML as the prompt format, with or without the system instruction.
 To prompt with a system instruction (use whatever system prompt you like):