Add citation
Browse files
README.md
CHANGED
|
@@ -27,6 +27,39 @@ A replication attempt of Tulu 3 on the Qwen 2.5 base models.
|
|
| 27 |
Big thanks to Retis Labs for being providing my 8xH100 polycule used to train and test this model!
|
| 28 |
Another big thanks to AllenAI for publishing the Tülu 3 data and model series (as well as the paper and details on training), as well as Alibaba for training the original Qwen 2.5 base model series!
|
| 29 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 30 |
## Training procedure
|
| 31 |
|
| 32 |
[<img src="https://raw.githubusercontent.com/axolotl-ai-cloud/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/axolotl-ai-cloud/axolotl)
|
|
|
|
| 27 |
Big thanks to Retis Labs for being providing my 8xH100 polycule used to train and test this model!
|
| 28 |
Another big thanks to AllenAI for publishing the Tülu 3 data and model series (as well as the paper and details on training), as well as Alibaba for training the original Qwen 2.5 base model series!
|
| 29 |
|
| 30 |
+
```
|
| 31 |
+
@article{lambert2024tulu3,
|
| 32 |
+
title = {Tülu 3: Pushing Frontiers in Open Language Model Post-Training},
|
| 33 |
+
author = {
|
| 34 |
+
Nathan Lambert and
|
| 35 |
+
Jacob Morrison and
|
| 36 |
+
Valentina Pyatkin and
|
| 37 |
+
Shengyi Huang and
|
| 38 |
+
Hamish Ivison and
|
| 39 |
+
Faeze Brahman and
|
| 40 |
+
Lester James V. Miranda and
|
| 41 |
+
Alisa Liu and
|
| 42 |
+
Nouha Dziri and
|
| 43 |
+
Shane Lyu and
|
| 44 |
+
Yuling Gu and
|
| 45 |
+
Saumya Malik and
|
| 46 |
+
Victoria Graf and
|
| 47 |
+
Jena D. Hwang and
|
| 48 |
+
Jiangjiang Yang and
|
| 49 |
+
Ronan Le Bras and
|
| 50 |
+
Oyvind Tafjord and
|
| 51 |
+
Chris Wilhelm and
|
| 52 |
+
Luca Soldaini and
|
| 53 |
+
Noah A. Smith and
|
| 54 |
+
Yizhong Wang and
|
| 55 |
+
Pradeep Dasigi and
|
| 56 |
+
Hannaneh Hajishirzi
|
| 57 |
+
},
|
| 58 |
+
year = {2024},
|
| 59 |
+
email = {tulu@allenai.org}
|
| 60 |
+
}
|
| 61 |
+
```
|
| 62 |
+
|
| 63 |
## Training procedure
|
| 64 |
|
| 65 |
[<img src="https://raw.githubusercontent.com/axolotl-ai-cloud/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/axolotl-ai-cloud/axolotl)
|