zephyr-7b-dpo-qlora / train_results.json
lole25's picture
Model save
f872924 verified
{
"epoch": 1.0,
"train_loss": 0.6857879155593393,
"train_runtime": 5551.1855,
"train_samples": 12227,
"train_samples_per_second": 2.203,
"train_steps_per_second": 0.069
}