flan-t5-definition-en-large / train_results.json
ltgoslo's picture
Large model
e0e38cf
{
"epoch": 15.0,
"train_loss": 1.5328590292826185,
"train_runtime": 18417.6233,
"train_samples": 175332,
"train_samples_per_second": 142.797,
"train_steps_per_second": 2.232
}