phi-2-gpo-ultrafeedback-lora / train_results.json
lole25's picture
Model save
3a19881 verified
{
"epoch": 2.0,
"train_loss": 2.2877212975025803e-05,
"train_runtime": 855.8096,
"train_samples": 30567,
"train_samples_per_second": 71.434,
"train_steps_per_second": 1.115
}