File size: 236 Bytes
42855f8
 
 
bbeb07f
1667ad5
bbeb07f
1
2
3
4
5
6
---
license: apache-2.0
---
We train OPT-1.3B using three datasets: Dahoas/rm-static, Dahoas/full-hh-rlhf, and yitingxie/rlhf-reward-datasets. 

Dahoas/synthetic-instruct-gptj-pairwise is not used because of the adsence of test dataset.