vit-lr-cosine-warmup / train_results.json
sharren's picture
🍻 cheers
b844ddf verified
{
"epoch": 13.0,
"total_flos": 5.166157498470679e+18,
"train_loss": 0.1749291451406399,
"train_runtime": 1863.5469,
"train_samples_per_second": 275.174,
"train_steps_per_second": 17.225
}