zephyr-7b-dpo-qlora-pairrm / train_results.json
shenxq's picture
Model save
41409cb verified
raw
history blame
No virus
195 Bytes
{
"epoch": 1.0,
"train_loss": 0.6475976420174225,
"train_runtime": 42677.4758,
"train_samples": 19996,
"train_samples_per_second": 0.469,
"train_steps_per_second": 0.029
}