gpt_16_5_3e-5_lp5_nb5 / train_results.txt
54data's picture
End of training
92f6ee3
epoch = 5.0
train_loss = 2.77278194830857
train_runtime = 10839.8418
train_samples = 42367
train_samples_per_second = 19.542
train_steps_per_second = 1.221