Commit
•
cb74054
1
Parent(s):
2390265
Update README.md
Browse files
README.md
CHANGED
@@ -16,11 +16,11 @@ language:
|
|
16 |
---
|
17 |
# mistral-orpo-mix-7k
|
18 |
|
19 |
-
This model is a ORPO fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the argilla/dpo-mix-7k dataset with the [huggingface/alignment-handbook](https://github.com/huggingface/alignment-handbook).
|
20 |
|
21 |
## Training procedure
|
22 |
|
23 |
-
Trained for 4
|
24 |
|
25 |
### Aligment Handbook recipe
|
26 |
|
|
|
16 |
---
|
17 |
# mistral-orpo-mix-7k
|
18 |
|
19 |
+
This model is a ORPO full fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the argilla/dpo-mix-7k dataset with the [huggingface/alignment-handbook](https://github.com/huggingface/alignment-handbook).
|
20 |
|
21 |
## Training procedure
|
22 |
|
23 |
+
Trained for 4.5 hours on 1xA100
|
24 |
|
25 |
### Aligment Handbook recipe
|
26 |
|