mansee's picture
End of training
b110cba
raw
history blame
210 Bytes
{
"epoch": 9.98,
"total_flos": 1.0051627680166625e+19,
"train_loss": 0.5403492726857149,
"train_runtime": 6645.2491,
"train_samples_per_second": 60.946,
"train_steps_per_second": 0.476
}