|
--- |
|
license: mit |
|
datasets: |
|
- Open-Orca/SlimOrca-Dedup |
|
- migtissera/Synthia-v1.3 |
|
- LDJnr/Verified-Camel |
|
- LDJnr/Pure-Dove |
|
- LDJnr/Capybara |
|
- meta-math/MetaMathQA |
|
- Intel/orca_dpo_pairs |
|
- argilla/ultrafeedback-binarized-preferences-cleaned |
|
--- |
|
![Phi-2 Orange](https://huggingface.co/rhysjones/phi-2-orange-v2/resolve/main/phi-2-orange.jpg) |
|
|
|
# Phi-2 Orange Version 2 |
|
|
|
A two-step finetune of Phi-2, with a bit more zest. |
|
|
|
This is an improved version of the original [Phi-2-Orange](https://huggingface.co/rhysjones/phi-2-orange) that |
|
uses an updated training process on the same datasets. |
|
|
|
It also uses the latest updated model from Microsoft's [Phi-2](https://huggingface.co/microsoft/phi-2), making it directly usable |
|
within Hugging Face's Transformers library (without the need for trust remote code). |
|
|
|
# Prompt Format |
|
|
|
Phi-2 Orange v2 uses ChatML as the prompt format, with or without the system instruction. |
|
|
|
To prompt with a system instruction (use whatever system prompt you like): |
|
|
|
``` |
|
<|im_start|>system |
|
You are a helpful assistant for Python which outputs in Markdown format.<|im_end|> |
|
<|im_start|>user |
|
Write a function to calculate the Fibonacci sequence<|im_end|> |
|
<|im_start|>assistant |
|
|
|
``` |
|
|
|
You can also omit the system prompt if you wish: |
|
|
|
``` |
|
<|im_start|>user |
|
Why is the sky blue?<|im_end|> |
|
<|im_start|>assistant |
|
|
|
``` |
|
|
|
|