BertjeWDialDataALLQonly / train_results.json
Jeska's picture
End of training
7473103
{
"epoch": 15.0,
"train_loss": 1.9284902631757455,
"train_runtime": 49525.3053,
"train_samples": 55736,
"train_samples_per_second": 16.881,
"train_steps_per_second": 0.264
}