ALM-AHME's picture
End of training
63db5e6
{
"epoch": 12.0,
"total_flos": 1.2795133907158622e+19,
"train_loss": 0.7126436810024449,
"train_runtime": 8746.6735,
"train_samples_per_second": 5.338,
"train_steps_per_second": 0.167
}