Update README.md
Browse files
README.md
CHANGED
@@ -241,3 +241,30 @@ The following hyperparameters were used during training:
|
|
241 |
- Pytorch 2.1.2+cu121
|
242 |
- Datasets 2.14.6
|
243 |
- Tokenizers 0.15.1
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
241 |
- Pytorch 2.1.2+cu121
|
242 |
- Datasets 2.14.6
|
243 |
- Tokenizers 0.15.1
|
244 |
+
|
245 |
+
## Citation Information
|
246 |
+
|
247 |
+
If you find this model useful in your work, please consider citing the Zephyr technical report:
|
248 |
+
|
249 |
+
```
|
250 |
+
@misc{tunstall2023zephyr,
|
251 |
+
title={Zephyr: Direct Distillation of LM Alignment},
|
252 |
+
author={Lewis Tunstall and Edward Beeching and Nathan Lambert and Nazneen Rajani and Kashif Rasul and Younes Belkada and Shengyi Huang and Leandro von Werra and Clémentine Fourrier and Nathan Habib and Nathan Sarrazin and Omar Sanseviero and Alexander M. Rush and Thomas Wolf},
|
253 |
+
year={2023},
|
254 |
+
eprint={2310.16944},
|
255 |
+
archivePrefix={arXiv},
|
256 |
+
primaryClass={cs.LG}
|
257 |
+
}
|
258 |
+
|
259 |
+
You may also wish to cite the creators of this model as well:
|
260 |
+
|
261 |
+
```
|
262 |
+
@misc{zephyr_7b_gemma,
|
263 |
+
author = {Lewis Tunstall and Philipp Schmid},
|
264 |
+
title = {Zephyr 7B Gemma},
|
265 |
+
year = {2024},
|
266 |
+
publisher = {Hugging Face},
|
267 |
+
journal = {Hugging Face repository},
|
268 |
+
howpublished = {\url{https://huggingface.co/HuggingFaceH4/zephyr-7b-gemma-v0.1}}
|
269 |
+
}
|
270 |
+
```
|