G-reen
/

EXPERIMENT-DPO-m7b2-1-merged

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

G-reen commited on Mar 25, 2024

Commit

f6b3b80

·

verified ·

1 Parent(s): 9fe100b

Update README.md

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -1,3 +1,7 @@
 *This model was trained as part of a series of experiments testing the performance of pure DPO vs SFT vs ORPO, all supported by Unsloth/Huggingface TRL.*
 **Benchmarks**
@@ -32,8 +36,4 @@ Prompt Format: ```You are a helpful assistant.<s>[INST] PROMPT [/INST]RESPONSE</
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/65a5c0e82823ba72ed2cee7d/8DQ0WiypkVIJeK_Y18Wv0.png)
-[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
----
-license: "apache 2.0"
----

+---
+license: "apache-2.0"
+---
 *This model was trained as part of a series of experiments testing the performance of pure DPO vs SFT vs ORPO, all supported by Unsloth/Huggingface TRL.*
 **Benchmarks**
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/65a5c0e82823ba72ed2cee7d/8DQ0WiypkVIJeK_Y18Wv0.png)
+[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)