Update README.md
Browse files
README.md
CHANGED
@@ -11,7 +11,7 @@ datasets:
|
|
11 |
- hromi/winograd_dpo_basic
|
12 |
---
|
13 |
|
14 |
-
![](https://wizzion.com/garrulus.jpg)
|
15 |
|
16 |
# UDKai_Garrulus
|
17 |
|
@@ -49,10 +49,16 @@ But before writing a paper with title "DPO-Contamination with Winogrande increas
|
|
49 |
* max_length=1536
|
50 |
|
51 |
## UDK.ai
|
52 |
-
This is the result of the first LLM-optimization experiment running on a hardware of Berlin University of the Arts.
|
|
|
|
|
|
|
|
|
53 |
|
54 |
# Garrulus
|
55 |
Originally I planned to call the model "ContaminatedWine" but then I had a nice winter encounter with a very convivial eurasian jay (Garrulus Glandarius in latin), hence the name.
|
56 |
|
57 |
# Thanks
|
58 |
-
Thanks to mlabonne and Cultrix for demonstrating that DPO is not 'rocket science' but within reach of anyone with an idea, a dataset and a GPU
|
|
|
|
|
|
11 |
- hromi/winograd_dpo_basic
|
12 |
---
|
13 |
|
14 |
+
![](https://wizzion.com/garrulus.jpg =300x300)
|
15 |
|
16 |
# UDKai_Garrulus
|
17 |
|
|
|
49 |
* max_length=1536
|
50 |
|
51 |
## UDK.ai
|
52 |
+
This is the result of the first LLM-optimization experiment running on a hardware of Berlin University of the Arts (UDK-berlin).
|
53 |
+
|
54 |
+
DPO took few minutes on a A40.
|
55 |
+
|
56 |
+
Check [udk.ai](https://udk.ai) from time to time, we plan to make some noise.
|
57 |
|
58 |
# Garrulus
|
59 |
Originally I planned to call the model "ContaminatedWine" but then I had a nice winter encounter with a very convivial eurasian jay (Garrulus Glandarius in latin), hence the name.
|
60 |
|
61 |
# Thanks
|
62 |
+
Thanks to mlabonne and Cultrix for demonstrating that DPO is not 'rocket science' but within reach of anyone with an idea, a dataset and a GPU.
|
63 |
+
|
64 |
+
And thanks to [unslothai](https://github.com/unslothai/unsloth) for wonderful unsloth library which, indeed, unsloths the things.
|