t5_large_baseline / train_results.json
TheLongSentance
add model
bddd597
raw
history blame contribute delete
192 Bytes
{
"epoch": 3.0,
"train_loss": 0.13440267986721463,
"train_runtime": 578.2653,
"train_samples": 600,
"train_samples_per_second": 3.113,
"train_steps_per_second": 0.778
}