Safetensors
llama
alignment-handbook
trl
dpo
Generated from Trainer
yiran-wang3's picture
Model save
2d9d8e6 verified
raw
history blame contribute delete
No virus
217 Bytes
{
"epoch": 1.0,
"total_flos": 0.0,
"train_loss": 4999.9289533089905,
"train_runtime": 2038.5346,
"train_samples": 17626,
"train_samples_per_second": 8.646,
"train_steps_per_second": 0.135
}