Edit model card

train/rewards train/logits train/logps train

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference API
Unable to determine this model's library. Check the docs .

Dataset used to train PJMixers/LLaMa-3-Instruct-SmallPrefMix-ORPO-8B-QDoRA

Collection including PJMixers/LLaMa-3-Instruct-SmallPrefMix-ORPO-8B-QDoRA