JackWong0911 commited on
Commit
df00ca1
1 Parent(s): edfcd5d

End of training

Browse files
all_results.json ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 5.17,
3
+ "eval_accuracy": 0.9411764705882353,
4
+ "eval_loss": 0.3731352388858795,
5
+ "eval_runtime": 54.8788,
6
+ "eval_samples_per_second": 1.239,
7
+ "eval_steps_per_second": 1.239
8
+ }
runs/Apr05_15-23-59_1f2ef3b3bb65/events.out.tfevents.1712331524.1f2ef3b3bb65.1323.1 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7fdd32f84c93d93c22efc24cd3b2bf8529a6fac9df08ff9f662d0de90bf17435
3
- size 7415
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:86e067999e5faaaa55ef2e2d30ca6b2fe3718ca2e09e9fbaf62b5a05648af090
3
+ size 14742
test_results.json ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 5.17,
3
+ "eval_accuracy": 0.9411764705882353,
4
+ "eval_loss": 0.3731352388858795,
5
+ "eval_runtime": 54.8788,
6
+ "eval_samples_per_second": 1.239,
7
+ "eval_steps_per_second": 1.239
8
+ }
trainer_state.json ADDED
@@ -0,0 +1,4652 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "best_metric": 0.9305555555555556,
3
+ "best_model_checkpoint": "timesformer-base-finetuned-k400-finetuned-kinetic400-subset-epoch6-num_frame_10_myViT/checkpoint-200",
4
+ "epoch": 5.166666666666667,
5
+ "eval_steps": 500,
6
+ "global_step": 300,
7
+ "is_hyper_param_search": false,
8
+ "is_local_process_zero": true,
9
+ "is_world_process_zero": true,
10
+ "log_history": [
11
+ {
12
+ "accuracy": 0.0,
13
+ "epoch": 0,
14
+ "step": 0
15
+ },
16
+ {
17
+ "accuracy": 1.0,
18
+ "epoch": 0.0,
19
+ "step": 1
20
+ },
21
+ {
22
+ "accuracy": 0.0,
23
+ "epoch": 0.01,
24
+ "step": 2
25
+ },
26
+ {
27
+ "accuracy": 0.0,
28
+ "epoch": 0.01,
29
+ "step": 3
30
+ },
31
+ {
32
+ "accuracy": 0.0,
33
+ "epoch": 0.01,
34
+ "step": 4
35
+ },
36
+ {
37
+ "accuracy": 0.0,
38
+ "epoch": 0.02,
39
+ "step": 5
40
+ },
41
+ {
42
+ "accuracy": 0.0,
43
+ "epoch": 0.02,
44
+ "step": 6
45
+ },
46
+ {
47
+ "accuracy": 0.0,
48
+ "epoch": 0.02,
49
+ "step": 7
50
+ },
51
+ {
52
+ "accuracy": 0.0,
53
+ "epoch": 0.03,
54
+ "step": 8
55
+ },
56
+ {
57
+ "accuracy": 1.0,
58
+ "epoch": 0.03,
59
+ "step": 9
60
+ },
61
+ {
62
+ "epoch": 0.03,
63
+ "grad_norm": 29.237531661987305,
64
+ "learning_rate": 1.6666666666666667e-05,
65
+ "loss": 1.5579,
66
+ "step": 10
67
+ },
68
+ {
69
+ "accuracy": 0.0,
70
+ "epoch": 0.03,
71
+ "step": 10
72
+ },
73
+ {
74
+ "accuracy": 1.0,
75
+ "epoch": 0.04,
76
+ "step": 11
77
+ },
78
+ {
79
+ "accuracy": 1.0,
80
+ "epoch": 0.04,
81
+ "step": 12
82
+ },
83
+ {
84
+ "accuracy": 0.0,
85
+ "epoch": 0.04,
86
+ "step": 13
87
+ },
88
+ {
89
+ "accuracy": 1.0,
90
+ "epoch": 0.05,
91
+ "step": 14
92
+ },
93
+ {
94
+ "accuracy": 0.0,
95
+ "epoch": 0.05,
96
+ "step": 15
97
+ },
98
+ {
99
+ "accuracy": 0.0,
100
+ "epoch": 0.05,
101
+ "step": 16
102
+ },
103
+ {
104
+ "accuracy": 1.0,
105
+ "epoch": 0.06,
106
+ "step": 17
107
+ },
108
+ {
109
+ "accuracy": 1.0,
110
+ "epoch": 0.06,
111
+ "step": 18
112
+ },
113
+ {
114
+ "accuracy": 1.0,
115
+ "epoch": 0.06,
116
+ "step": 19
117
+ },
118
+ {
119
+ "epoch": 0.07,
120
+ "grad_norm": 24.122037887573242,
121
+ "learning_rate": 3.3333333333333335e-05,
122
+ "loss": 1.4141,
123
+ "step": 20
124
+ },
125
+ {
126
+ "accuracy": 1.0,
127
+ "epoch": 0.07,
128
+ "step": 20
129
+ },
130
+ {
131
+ "accuracy": 1.0,
132
+ "epoch": 0.07,
133
+ "step": 21
134
+ },
135
+ {
136
+ "accuracy": 0.0,
137
+ "epoch": 0.07,
138
+ "step": 22
139
+ },
140
+ {
141
+ "accuracy": 0.0,
142
+ "epoch": 0.08,
143
+ "step": 23
144
+ },
145
+ {
146
+ "accuracy": 1.0,
147
+ "epoch": 0.08,
148
+ "step": 24
149
+ },
150
+ {
151
+ "accuracy": 1.0,
152
+ "epoch": 0.08,
153
+ "step": 25
154
+ },
155
+ {
156
+ "accuracy": 0.0,
157
+ "epoch": 0.09,
158
+ "step": 26
159
+ },
160
+ {
161
+ "accuracy": 1.0,
162
+ "epoch": 0.09,
163
+ "step": 27
164
+ },
165
+ {
166
+ "accuracy": 1.0,
167
+ "epoch": 0.09,
168
+ "step": 28
169
+ },
170
+ {
171
+ "accuracy": 1.0,
172
+ "epoch": 0.1,
173
+ "step": 29
174
+ },
175
+ {
176
+ "epoch": 0.1,
177
+ "grad_norm": 14.94096565246582,
178
+ "learning_rate": 5e-05,
179
+ "loss": 0.9652,
180
+ "step": 30
181
+ },
182
+ {
183
+ "accuracy": 0.0,
184
+ "epoch": 0.1,
185
+ "step": 30
186
+ },
187
+ {
188
+ "accuracy": 0.0,
189
+ "epoch": 0.1,
190
+ "step": 31
191
+ },
192
+ {
193
+ "accuracy": 1.0,
194
+ "epoch": 0.11,
195
+ "step": 32
196
+ },
197
+ {
198
+ "accuracy": 1.0,
199
+ "epoch": 0.11,
200
+ "step": 33
201
+ },
202
+ {
203
+ "accuracy": 1.0,
204
+ "epoch": 0.11,
205
+ "step": 34
206
+ },
207
+ {
208
+ "accuracy": 1.0,
209
+ "epoch": 0.12,
210
+ "step": 35
211
+ },
212
+ {
213
+ "accuracy": 1.0,
214
+ "epoch": 0.12,
215
+ "step": 36
216
+ },
217
+ {
218
+ "accuracy": 1.0,
219
+ "epoch": 0.12,
220
+ "step": 37
221
+ },
222
+ {
223
+ "accuracy": 1.0,
224
+ "epoch": 0.13,
225
+ "step": 38
226
+ },
227
+ {
228
+ "accuracy": 1.0,
229
+ "epoch": 0.13,
230
+ "step": 39
231
+ },
232
+ {
233
+ "epoch": 0.13,
234
+ "grad_norm": 12.056232452392578,
235
+ "learning_rate": 4.814814814814815e-05,
236
+ "loss": 0.6132,
237
+ "step": 40
238
+ },
239
+ {
240
+ "accuracy": 1.0,
241
+ "epoch": 0.13,
242
+ "step": 40
243
+ },
244
+ {
245
+ "accuracy": 1.0,
246
+ "epoch": 0.14,
247
+ "step": 41
248
+ },
249
+ {
250
+ "accuracy": 1.0,
251
+ "epoch": 0.14,
252
+ "step": 42
253
+ },
254
+ {
255
+ "accuracy": 1.0,
256
+ "epoch": 0.14,
257
+ "step": 43
258
+ },
259
+ {
260
+ "accuracy": 1.0,
261
+ "epoch": 0.15,
262
+ "step": 44
263
+ },
264
+ {
265
+ "accuracy": 1.0,
266
+ "epoch": 0.15,
267
+ "step": 45
268
+ },
269
+ {
270
+ "accuracy": 1.0,
271
+ "epoch": 0.15,
272
+ "step": 46
273
+ },
274
+ {
275
+ "accuracy": 1.0,
276
+ "epoch": 0.16,
277
+ "step": 47
278
+ },
279
+ {
280
+ "accuracy": 1.0,
281
+ "epoch": 0.16,
282
+ "step": 48
283
+ },
284
+ {
285
+ "accuracy": 1.0,
286
+ "epoch": 0.16,
287
+ "step": 49
288
+ },
289
+ {
290
+ "epoch": 0.17,
291
+ "grad_norm": 12.105801582336426,
292
+ "learning_rate": 4.62962962962963e-05,
293
+ "loss": 0.1507,
294
+ "step": 50
295
+ },
296
+ {
297
+ "accuracy": 0.0,
298
+ "epoch": 0.17,
299
+ "step": 50
300
+ },
301
+ {
302
+ "accuracy": 0.0,
303
+ "epoch": 0.17,
304
+ "step": 50
305
+ },
306
+ {
307
+ "accuracy": 0.0,
308
+ "epoch": 0.17,
309
+ "step": 50
310
+ },
311
+ {
312
+ "accuracy": 1.0,
313
+ "epoch": 0.17,
314
+ "step": 50
315
+ },
316
+ {
317
+ "accuracy": 1.0,
318
+ "epoch": 0.17,
319
+ "step": 50
320
+ },
321
+ {
322
+ "accuracy": 1.0,
323
+ "epoch": 0.17,
324
+ "step": 50
325
+ },
326
+ {
327
+ "accuracy": 0.0,
328
+ "epoch": 0.17,
329
+ "step": 50
330
+ },
331
+ {
332
+ "accuracy": 0.0,
333
+ "epoch": 0.17,
334
+ "step": 50
335
+ },
336
+ {
337
+ "accuracy": 0.0,
338
+ "epoch": 0.17,
339
+ "step": 50
340
+ },
341
+ {
342
+ "accuracy": 1.0,
343
+ "epoch": 0.17,
344
+ "step": 50
345
+ },
346
+ {
347
+ "accuracy": 1.0,
348
+ "epoch": 0.17,
349
+ "step": 50
350
+ },
351
+ {
352
+ "accuracy": 1.0,
353
+ "epoch": 0.17,
354
+ "step": 50
355
+ },
356
+ {
357
+ "accuracy": 1.0,
358
+ "epoch": 0.17,
359
+ "step": 50
360
+ },
361
+ {
362
+ "accuracy": 1.0,
363
+ "epoch": 0.17,
364
+ "step": 50
365
+ },
366
+ {
367
+ "accuracy": 1.0,
368
+ "epoch": 0.17,
369
+ "step": 50
370
+ },
371
+ {
372
+ "accuracy": 1.0,
373
+ "epoch": 0.17,
374
+ "step": 50
375
+ },
376
+ {
377
+ "accuracy": 1.0,
378
+ "epoch": 0.17,
379
+ "step": 50
380
+ },
381
+ {
382
+ "accuracy": 1.0,
383
+ "epoch": 0.17,
384
+ "step": 50
385
+ },
386
+ {
387
+ "accuracy": 1.0,
388
+ "epoch": 0.17,
389
+ "step": 50
390
+ },
391
+ {
392
+ "accuracy": 1.0,
393
+ "epoch": 0.17,
394
+ "step": 50
395
+ },
396
+ {
397
+ "accuracy": 1.0,
398
+ "epoch": 0.17,
399
+ "step": 50
400
+ },
401
+ {
402
+ "accuracy": 1.0,
403
+ "epoch": 0.17,
404
+ "step": 50
405
+ },
406
+ {
407
+ "accuracy": 1.0,
408
+ "epoch": 0.17,
409
+ "step": 50
410
+ },
411
+ {
412
+ "accuracy": 1.0,
413
+ "epoch": 0.17,
414
+ "step": 50
415
+ },
416
+ {
417
+ "accuracy": 1.0,
418
+ "epoch": 0.17,
419
+ "step": 50
420
+ },
421
+ {
422
+ "accuracy": 1.0,
423
+ "epoch": 0.17,
424
+ "step": 50
425
+ },
426
+ {
427
+ "accuracy": 1.0,
428
+ "epoch": 0.17,
429
+ "step": 50
430
+ },
431
+ {
432
+ "accuracy": 1.0,
433
+ "epoch": 0.17,
434
+ "step": 50
435
+ },
436
+ {
437
+ "accuracy": 1.0,
438
+ "epoch": 0.17,
439
+ "step": 50
440
+ },
441
+ {
442
+ "accuracy": 1.0,
443
+ "epoch": 0.17,
444
+ "step": 50
445
+ },
446
+ {
447
+ "accuracy": 1.0,
448
+ "epoch": 0.17,
449
+ "step": 50
450
+ },
451
+ {
452
+ "accuracy": 1.0,
453
+ "epoch": 0.17,
454
+ "step": 50
455
+ },
456
+ {
457
+ "accuracy": 0.0,
458
+ "epoch": 0.17,
459
+ "step": 50
460
+ },
461
+ {
462
+ "accuracy": 1.0,
463
+ "epoch": 0.17,
464
+ "step": 50
465
+ },
466
+ {
467
+ "accuracy": 1.0,
468
+ "epoch": 0.17,
469
+ "step": 50
470
+ },
471
+ {
472
+ "accuracy": 1.0,
473
+ "epoch": 0.17,
474
+ "step": 50
475
+ },
476
+ {
477
+ "accuracy": 1.0,
478
+ "epoch": 0.17,
479
+ "step": 50
480
+ },
481
+ {
482
+ "accuracy": 1.0,
483
+ "epoch": 0.17,
484
+ "step": 50
485
+ },
486
+ {
487
+ "accuracy": 1.0,
488
+ "epoch": 0.17,
489
+ "step": 50
490
+ },
491
+ {
492
+ "accuracy": 1.0,
493
+ "epoch": 0.17,
494
+ "step": 50
495
+ },
496
+ {
497
+ "accuracy": 1.0,
498
+ "epoch": 0.17,
499
+ "step": 50
500
+ },
501
+ {
502
+ "accuracy": 1.0,
503
+ "epoch": 0.17,
504
+ "step": 50
505
+ },
506
+ {
507
+ "accuracy": 1.0,
508
+ "epoch": 0.17,
509
+ "step": 50
510
+ },
511
+ {
512
+ "accuracy": 1.0,
513
+ "epoch": 0.17,
514
+ "step": 50
515
+ },
516
+ {
517
+ "accuracy": 1.0,
518
+ "epoch": 0.17,
519
+ "step": 50
520
+ },
521
+ {
522
+ "accuracy": 1.0,
523
+ "epoch": 0.17,
524
+ "step": 50
525
+ },
526
+ {
527
+ "accuracy": 1.0,
528
+ "epoch": 0.17,
529
+ "step": 50
530
+ },
531
+ {
532
+ "accuracy": 1.0,
533
+ "epoch": 0.17,
534
+ "step": 50
535
+ },
536
+ {
537
+ "accuracy": 1.0,
538
+ "epoch": 0.17,
539
+ "step": 50
540
+ },
541
+ {
542
+ "accuracy": 1.0,
543
+ "epoch": 0.17,
544
+ "step": 50
545
+ },
546
+ {
547
+ "accuracy": 1.0,
548
+ "epoch": 0.17,
549
+ "step": 50
550
+ },
551
+ {
552
+ "accuracy": 1.0,
553
+ "epoch": 0.17,
554
+ "step": 50
555
+ },
556
+ {
557
+ "accuracy": 1.0,
558
+ "epoch": 0.17,
559
+ "step": 50
560
+ },
561
+ {
562
+ "accuracy": 1.0,
563
+ "epoch": 0.17,
564
+ "step": 50
565
+ },
566
+ {
567
+ "accuracy": 1.0,
568
+ "epoch": 0.17,
569
+ "step": 50
570
+ },
571
+ {
572
+ "accuracy": 1.0,
573
+ "epoch": 0.17,
574
+ "step": 50
575
+ },
576
+ {
577
+ "accuracy": 1.0,
578
+ "epoch": 0.17,
579
+ "step": 50
580
+ },
581
+ {
582
+ "accuracy": 1.0,
583
+ "epoch": 0.17,
584
+ "step": 50
585
+ },
586
+ {
587
+ "accuracy": 1.0,
588
+ "epoch": 0.17,
589
+ "step": 50
590
+ },
591
+ {
592
+ "accuracy": 1.0,
593
+ "epoch": 0.17,
594
+ "step": 50
595
+ },
596
+ {
597
+ "accuracy": 0.0,
598
+ "epoch": 0.17,
599
+ "step": 50
600
+ },
601
+ {
602
+ "accuracy": 1.0,
603
+ "epoch": 0.17,
604
+ "step": 50
605
+ },
606
+ {
607
+ "accuracy": 1.0,
608
+ "epoch": 0.17,
609
+ "step": 50
610
+ },
611
+ {
612
+ "accuracy": 1.0,
613
+ "epoch": 0.17,
614
+ "step": 50
615
+ },
616
+ {
617
+ "accuracy": 1.0,
618
+ "epoch": 0.17,
619
+ "step": 50
620
+ },
621
+ {
622
+ "accuracy": 1.0,
623
+ "epoch": 0.17,
624
+ "step": 50
625
+ },
626
+ {
627
+ "accuracy": 1.0,
628
+ "epoch": 0.17,
629
+ "step": 50
630
+ },
631
+ {
632
+ "accuracy": 1.0,
633
+ "epoch": 0.17,
634
+ "step": 50
635
+ },
636
+ {
637
+ "accuracy": 1.0,
638
+ "epoch": 0.17,
639
+ "step": 50
640
+ },
641
+ {
642
+ "accuracy": 1.0,
643
+ "epoch": 0.17,
644
+ "step": 50
645
+ },
646
+ {
647
+ "accuracy": 1.0,
648
+ "epoch": 0.17,
649
+ "step": 50
650
+ },
651
+ {
652
+ "accuracy": 1.0,
653
+ "epoch": 0.17,
654
+ "step": 50
655
+ },
656
+ {
657
+ "epoch": 0.17,
658
+ "eval_accuracy": 0.8888888888888888,
659
+ "eval_loss": 0.4026397466659546,
660
+ "eval_runtime": 87.7974,
661
+ "eval_samples_per_second": 0.82,
662
+ "eval_steps_per_second": 0.82,
663
+ "step": 50
664
+ },
665
+ {
666
+ "accuracy": 1.0,
667
+ "epoch": 0.17,
668
+ "step": 50
669
+ },
670
+ {
671
+ "accuracy": 1.0,
672
+ "epoch": 1.0,
673
+ "step": 51
674
+ },
675
+ {
676
+ "accuracy": 1.0,
677
+ "epoch": 1.01,
678
+ "step": 52
679
+ },
680
+ {
681
+ "accuracy": 1.0,
682
+ "epoch": 1.01,
683
+ "step": 53
684
+ },
685
+ {
686
+ "accuracy": 1.0,
687
+ "epoch": 1.01,
688
+ "step": 54
689
+ },
690
+ {
691
+ "accuracy": 1.0,
692
+ "epoch": 1.02,
693
+ "step": 55
694
+ },
695
+ {
696
+ "accuracy": 1.0,
697
+ "epoch": 1.02,
698
+ "step": 56
699
+ },
700
+ {
701
+ "accuracy": 1.0,
702
+ "epoch": 1.02,
703
+ "step": 57
704
+ },
705
+ {
706
+ "accuracy": 1.0,
707
+ "epoch": 1.03,
708
+ "step": 58
709
+ },
710
+ {
711
+ "accuracy": 1.0,
712
+ "epoch": 1.03,
713
+ "step": 59
714
+ },
715
+ {
716
+ "epoch": 1.03,
717
+ "grad_norm": 0.14552272856235504,
718
+ "learning_rate": 4.4444444444444447e-05,
719
+ "loss": 0.1926,
720
+ "step": 60
721
+ },
722
+ {
723
+ "accuracy": 1.0,
724
+ "epoch": 1.03,
725
+ "step": 60
726
+ },
727
+ {
728
+ "accuracy": 1.0,
729
+ "epoch": 1.04,
730
+ "step": 61
731
+ },
732
+ {
733
+ "accuracy": 1.0,
734
+ "epoch": 1.04,
735
+ "step": 62
736
+ },
737
+ {
738
+ "accuracy": 1.0,
739
+ "epoch": 1.04,
740
+ "step": 63
741
+ },
742
+ {
743
+ "accuracy": 1.0,
744
+ "epoch": 1.05,
745
+ "step": 64
746
+ },
747
+ {
748
+ "accuracy": 1.0,
749
+ "epoch": 1.05,
750
+ "step": 65
751
+ },
752
+ {
753
+ "accuracy": 1.0,
754
+ "epoch": 1.05,
755
+ "step": 66
756
+ },
757
+ {
758
+ "accuracy": 1.0,
759
+ "epoch": 1.06,
760
+ "step": 67
761
+ },
762
+ {
763
+ "accuracy": 1.0,
764
+ "epoch": 1.06,
765
+ "step": 68
766
+ },
767
+ {
768
+ "accuracy": 1.0,
769
+ "epoch": 1.06,
770
+ "step": 69
771
+ },
772
+ {
773
+ "epoch": 1.07,
774
+ "grad_norm": 0.07848536968231201,
775
+ "learning_rate": 4.259259259259259e-05,
776
+ "loss": 0.0464,
777
+ "step": 70
778
+ },
779
+ {
780
+ "accuracy": 1.0,
781
+ "epoch": 1.07,
782
+ "step": 70
783
+ },
784
+ {
785
+ "accuracy": 1.0,
786
+ "epoch": 1.07,
787
+ "step": 71
788
+ },
789
+ {
790
+ "accuracy": 1.0,
791
+ "epoch": 1.07,
792
+ "step": 72
793
+ },
794
+ {
795
+ "accuracy": 1.0,
796
+ "epoch": 1.08,
797
+ "step": 73
798
+ },
799
+ {
800
+ "accuracy": 1.0,
801
+ "epoch": 1.08,
802
+ "step": 74
803
+ },
804
+ {
805
+ "accuracy": 1.0,
806
+ "epoch": 1.08,
807
+ "step": 75
808
+ },
809
+ {
810
+ "accuracy": 1.0,
811
+ "epoch": 1.09,
812
+ "step": 76
813
+ },
814
+ {
815
+ "accuracy": 1.0,
816
+ "epoch": 1.09,
817
+ "step": 77
818
+ },
819
+ {
820
+ "accuracy": 1.0,
821
+ "epoch": 1.09,
822
+ "step": 78
823
+ },
824
+ {
825
+ "accuracy": 1.0,
826
+ "epoch": 1.1,
827
+ "step": 79
828
+ },
829
+ {
830
+ "epoch": 1.1,
831
+ "grad_norm": 0.050905052572488785,
832
+ "learning_rate": 4.074074074074074e-05,
833
+ "loss": 0.0058,
834
+ "step": 80
835
+ },
836
+ {
837
+ "accuracy": 1.0,
838
+ "epoch": 1.1,
839
+ "step": 80
840
+ },
841
+ {
842
+ "accuracy": 1.0,
843
+ "epoch": 1.1,
844
+ "step": 81
845
+ },
846
+ {
847
+ "accuracy": 1.0,
848
+ "epoch": 1.11,
849
+ "step": 82
850
+ },
851
+ {
852
+ "accuracy": 1.0,
853
+ "epoch": 1.11,
854
+ "step": 83
855
+ },
856
+ {
857
+ "accuracy": 1.0,
858
+ "epoch": 1.11,
859
+ "step": 84
860
+ },
861
+ {
862
+ "accuracy": 1.0,
863
+ "epoch": 1.12,
864
+ "step": 85
865
+ },
866
+ {
867
+ "accuracy": 1.0,
868
+ "epoch": 1.12,
869
+ "step": 86
870
+ },
871
+ {
872
+ "accuracy": 1.0,
873
+ "epoch": 1.12,
874
+ "step": 87
875
+ },
876
+ {
877
+ "accuracy": 1.0,
878
+ "epoch": 1.13,
879
+ "step": 88
880
+ },
881
+ {
882
+ "accuracy": 1.0,
883
+ "epoch": 1.13,
884
+ "step": 89
885
+ },
886
+ {
887
+ "epoch": 1.13,
888
+ "grad_norm": 6.470664024353027,
889
+ "learning_rate": 3.888888888888889e-05,
890
+ "loss": 0.0394,
891
+ "step": 90
892
+ },
893
+ {
894
+ "accuracy": 1.0,
895
+ "epoch": 1.13,
896
+ "step": 90
897
+ },
898
+ {
899
+ "accuracy": 1.0,
900
+ "epoch": 1.14,
901
+ "step": 91
902
+ },
903
+ {
904
+ "accuracy": 1.0,
905
+ "epoch": 1.14,
906
+ "step": 92
907
+ },
908
+ {
909
+ "accuracy": 1.0,
910
+ "epoch": 1.14,
911
+ "step": 93
912
+ },
913
+ {
914
+ "accuracy": 1.0,
915
+ "epoch": 1.15,
916
+ "step": 94
917
+ },
918
+ {
919
+ "accuracy": 1.0,
920
+ "epoch": 1.15,
921
+ "step": 95
922
+ },
923
+ {
924
+ "accuracy": 1.0,
925
+ "epoch": 1.15,
926
+ "step": 96
927
+ },
928
+ {
929
+ "accuracy": 1.0,
930
+ "epoch": 1.16,
931
+ "step": 97
932
+ },
933
+ {
934
+ "accuracy": 1.0,
935
+ "epoch": 1.16,
936
+ "step": 98
937
+ },
938
+ {
939
+ "accuracy": 1.0,
940
+ "epoch": 1.16,
941
+ "step": 99
942
+ },
943
+ {
944
+ "epoch": 1.17,
945
+ "grad_norm": 0.0948905348777771,
946
+ "learning_rate": 3.7037037037037037e-05,
947
+ "loss": 0.0261,
948
+ "step": 100
949
+ },
950
+ {
951
+ "accuracy": 1.0,
952
+ "epoch": 1.17,
953
+ "step": 100
954
+ },
955
+ {
956
+ "accuracy": 1.0,
957
+ "epoch": 1.17,
958
+ "step": 100
959
+ },
960
+ {
961
+ "accuracy": 1.0,
962
+ "epoch": 1.17,
963
+ "step": 100
964
+ },
965
+ {
966
+ "accuracy": 0.0,
967
+ "epoch": 1.17,
968
+ "step": 100
969
+ },
970
+ {
971
+ "accuracy": 1.0,
972
+ "epoch": 1.17,
973
+ "step": 100
974
+ },
975
+ {
976
+ "accuracy": 1.0,
977
+ "epoch": 1.17,
978
+ "step": 100
979
+ },
980
+ {
981
+ "accuracy": 0.0,
982
+ "epoch": 1.17,
983
+ "step": 100
984
+ },
985
+ {
986
+ "accuracy": 0.0,
987
+ "epoch": 1.17,
988
+ "step": 100
989
+ },
990
+ {
991
+ "accuracy": 0.0,
992
+ "epoch": 1.17,
993
+ "step": 100
994
+ },
995
+ {
996
+ "accuracy": 1.0,
997
+ "epoch": 1.17,
998
+ "step": 100
999
+ },
1000
+ {
1001
+ "accuracy": 1.0,
1002
+ "epoch": 1.17,
1003
+ "step": 100
1004
+ },
1005
+ {
1006
+ "accuracy": 1.0,
1007
+ "epoch": 1.17,
1008
+ "step": 100
1009
+ },
1010
+ {
1011
+ "accuracy": 1.0,
1012
+ "epoch": 1.17,
1013
+ "step": 100
1014
+ },
1015
+ {
1016
+ "accuracy": 1.0,
1017
+ "epoch": 1.17,
1018
+ "step": 100
1019
+ },
1020
+ {
1021
+ "accuracy": 1.0,
1022
+ "epoch": 1.17,
1023
+ "step": 100
1024
+ },
1025
+ {
1026
+ "accuracy": 1.0,
1027
+ "epoch": 1.17,
1028
+ "step": 100
1029
+ },
1030
+ {
1031
+ "accuracy": 1.0,
1032
+ "epoch": 1.17,
1033
+ "step": 100
1034
+ },
1035
+ {
1036
+ "accuracy": 1.0,
1037
+ "epoch": 1.17,
1038
+ "step": 100
1039
+ },
1040
+ {
1041
+ "accuracy": 1.0,
1042
+ "epoch": 1.17,
1043
+ "step": 100
1044
+ },
1045
+ {
1046
+ "accuracy": 1.0,
1047
+ "epoch": 1.17,
1048
+ "step": 100
1049
+ },
1050
+ {
1051
+ "accuracy": 1.0,
1052
+ "epoch": 1.17,
1053
+ "step": 100
1054
+ },
1055
+ {
1056
+ "accuracy": 1.0,
1057
+ "epoch": 1.17,
1058
+ "step": 100
1059
+ },
1060
+ {
1061
+ "accuracy": 1.0,
1062
+ "epoch": 1.17,
1063
+ "step": 100
1064
+ },
1065
+ {
1066
+ "accuracy": 1.0,
1067
+ "epoch": 1.17,
1068
+ "step": 100
1069
+ },
1070
+ {
1071
+ "accuracy": 1.0,
1072
+ "epoch": 1.17,
1073
+ "step": 100
1074
+ },
1075
+ {
1076
+ "accuracy": 1.0,
1077
+ "epoch": 1.17,
1078
+ "step": 100
1079
+ },
1080
+ {
1081
+ "accuracy": 1.0,
1082
+ "epoch": 1.17,
1083
+ "step": 100
1084
+ },
1085
+ {
1086
+ "accuracy": 1.0,
1087
+ "epoch": 1.17,
1088
+ "step": 100
1089
+ },
1090
+ {
1091
+ "accuracy": 1.0,
1092
+ "epoch": 1.17,
1093
+ "step": 100
1094
+ },
1095
+ {
1096
+ "accuracy": 1.0,
1097
+ "epoch": 1.17,
1098
+ "step": 100
1099
+ },
1100
+ {
1101
+ "accuracy": 1.0,
1102
+ "epoch": 1.17,
1103
+ "step": 100
1104
+ },
1105
+ {
1106
+ "accuracy": 1.0,
1107
+ "epoch": 1.17,
1108
+ "step": 100
1109
+ },
1110
+ {
1111
+ "accuracy": 1.0,
1112
+ "epoch": 1.17,
1113
+ "step": 100
1114
+ },
1115
+ {
1116
+ "accuracy": 1.0,
1117
+ "epoch": 1.17,
1118
+ "step": 100
1119
+ },
1120
+ {
1121
+ "accuracy": 1.0,
1122
+ "epoch": 1.17,
1123
+ "step": 100
1124
+ },
1125
+ {
1126
+ "accuracy": 1.0,
1127
+ "epoch": 1.17,
1128
+ "step": 100
1129
+ },
1130
+ {
1131
+ "accuracy": 0.0,
1132
+ "epoch": 1.17,
1133
+ "step": 100
1134
+ },
1135
+ {
1136
+ "accuracy": 1.0,
1137
+ "epoch": 1.17,
1138
+ "step": 100
1139
+ },
1140
+ {
1141
+ "accuracy": 1.0,
1142
+ "epoch": 1.17,
1143
+ "step": 100
1144
+ },
1145
+ {
1146
+ "accuracy": 1.0,
1147
+ "epoch": 1.17,
1148
+ "step": 100
1149
+ },
1150
+ {
1151
+ "accuracy": 1.0,
1152
+ "epoch": 1.17,
1153
+ "step": 100
1154
+ },
1155
+ {
1156
+ "accuracy": 1.0,
1157
+ "epoch": 1.17,
1158
+ "step": 100
1159
+ },
1160
+ {
1161
+ "accuracy": 1.0,
1162
+ "epoch": 1.17,
1163
+ "step": 100
1164
+ },
1165
+ {
1166
+ "accuracy": 1.0,
1167
+ "epoch": 1.17,
1168
+ "step": 100
1169
+ },
1170
+ {
1171
+ "accuracy": 1.0,
1172
+ "epoch": 1.17,
1173
+ "step": 100
1174
+ },
1175
+ {
1176
+ "accuracy": 0.0,
1177
+ "epoch": 1.17,
1178
+ "step": 100
1179
+ },
1180
+ {
1181
+ "accuracy": 0.0,
1182
+ "epoch": 1.17,
1183
+ "step": 100
1184
+ },
1185
+ {
1186
+ "accuracy": 0.0,
1187
+ "epoch": 1.17,
1188
+ "step": 100
1189
+ },
1190
+ {
1191
+ "accuracy": 1.0,
1192
+ "epoch": 1.17,
1193
+ "step": 100
1194
+ },
1195
+ {
1196
+ "accuracy": 1.0,
1197
+ "epoch": 1.17,
1198
+ "step": 100
1199
+ },
1200
+ {
1201
+ "accuracy": 1.0,
1202
+ "epoch": 1.17,
1203
+ "step": 100
1204
+ },
1205
+ {
1206
+ "accuracy": 1.0,
1207
+ "epoch": 1.17,
1208
+ "step": 100
1209
+ },
1210
+ {
1211
+ "accuracy": 1.0,
1212
+ "epoch": 1.17,
1213
+ "step": 100
1214
+ },
1215
+ {
1216
+ "accuracy": 1.0,
1217
+ "epoch": 1.17,
1218
+ "step": 100
1219
+ },
1220
+ {
1221
+ "accuracy": 1.0,
1222
+ "epoch": 1.17,
1223
+ "step": 100
1224
+ },
1225
+ {
1226
+ "accuracy": 1.0,
1227
+ "epoch": 1.17,
1228
+ "step": 100
1229
+ },
1230
+ {
1231
+ "accuracy": 1.0,
1232
+ "epoch": 1.17,
1233
+ "step": 100
1234
+ },
1235
+ {
1236
+ "accuracy": 1.0,
1237
+ "epoch": 1.17,
1238
+ "step": 100
1239
+ },
1240
+ {
1241
+ "accuracy": 1.0,
1242
+ "epoch": 1.17,
1243
+ "step": 100
1244
+ },
1245
+ {
1246
+ "accuracy": 1.0,
1247
+ "epoch": 1.17,
1248
+ "step": 100
1249
+ },
1250
+ {
1251
+ "accuracy": 0.0,
1252
+ "epoch": 1.17,
1253
+ "step": 100
1254
+ },
1255
+ {
1256
+ "accuracy": 1.0,
1257
+ "epoch": 1.17,
1258
+ "step": 100
1259
+ },
1260
+ {
1261
+ "accuracy": 1.0,
1262
+ "epoch": 1.17,
1263
+ "step": 100
1264
+ },
1265
+ {
1266
+ "accuracy": 1.0,
1267
+ "epoch": 1.17,
1268
+ "step": 100
1269
+ },
1270
+ {
1271
+ "accuracy": 1.0,
1272
+ "epoch": 1.17,
1273
+ "step": 100
1274
+ },
1275
+ {
1276
+ "accuracy": 1.0,
1277
+ "epoch": 1.17,
1278
+ "step": 100
1279
+ },
1280
+ {
1281
+ "accuracy": 1.0,
1282
+ "epoch": 1.17,
1283
+ "step": 100
1284
+ },
1285
+ {
1286
+ "accuracy": 1.0,
1287
+ "epoch": 1.17,
1288
+ "step": 100
1289
+ },
1290
+ {
1291
+ "accuracy": 1.0,
1292
+ "epoch": 1.17,
1293
+ "step": 100
1294
+ },
1295
+ {
1296
+ "accuracy": 0.0,
1297
+ "epoch": 1.17,
1298
+ "step": 100
1299
+ },
1300
+ {
1301
+ "accuracy": 0.0,
1302
+ "epoch": 1.17,
1303
+ "step": 100
1304
+ },
1305
+ {
1306
+ "accuracy": 0.0,
1307
+ "epoch": 1.17,
1308
+ "step": 100
1309
+ },
1310
+ {
1311
+ "epoch": 1.17,
1312
+ "eval_accuracy": 0.8333333333333334,
1313
+ "eval_loss": 0.5325552821159363,
1314
+ "eval_runtime": 87.2796,
1315
+ "eval_samples_per_second": 0.825,
1316
+ "eval_steps_per_second": 0.825,
1317
+ "step": 100
1318
+ },
1319
+ {
1320
+ "accuracy": 1.0,
1321
+ "epoch": 1.17,
1322
+ "step": 100
1323
+ },
1324
+ {
1325
+ "accuracy": 1.0,
1326
+ "epoch": 2.0,
1327
+ "step": 101
1328
+ },
1329
+ {
1330
+ "accuracy": 1.0,
1331
+ "epoch": 2.01,
1332
+ "step": 102
1333
+ },
1334
+ {
1335
+ "accuracy": 1.0,
1336
+ "epoch": 2.01,
1337
+ "step": 103
1338
+ },
1339
+ {
1340
+ "accuracy": 1.0,
1341
+ "epoch": 2.01,
1342
+ "step": 104
1343
+ },
1344
+ {
1345
+ "accuracy": 1.0,
1346
+ "epoch": 2.02,
1347
+ "step": 105
1348
+ },
1349
+ {
1350
+ "accuracy": 1.0,
1351
+ "epoch": 2.02,
1352
+ "step": 106
1353
+ },
1354
+ {
1355
+ "accuracy": 1.0,
1356
+ "epoch": 2.02,
1357
+ "step": 107
1358
+ },
1359
+ {
1360
+ "accuracy": 1.0,
1361
+ "epoch": 2.03,
1362
+ "step": 108
1363
+ },
1364
+ {
1365
+ "accuracy": 1.0,
1366
+ "epoch": 2.03,
1367
+ "step": 109
1368
+ },
1369
+ {
1370
+ "epoch": 2.03,
1371
+ "grad_norm": 0.04049674794077873,
1372
+ "learning_rate": 3.518518518518519e-05,
1373
+ "loss": 0.0025,
1374
+ "step": 110
1375
+ },
1376
+ {
1377
+ "accuracy": 1.0,
1378
+ "epoch": 2.03,
1379
+ "step": 110
1380
+ },
1381
+ {
1382
+ "accuracy": 1.0,
1383
+ "epoch": 2.04,
1384
+ "step": 111
1385
+ },
1386
+ {
1387
+ "accuracy": 1.0,
1388
+ "epoch": 2.04,
1389
+ "step": 112
1390
+ },
1391
+ {
1392
+ "accuracy": 1.0,
1393
+ "epoch": 2.04,
1394
+ "step": 113
1395
+ },
1396
+ {
1397
+ "accuracy": 1.0,
1398
+ "epoch": 2.05,
1399
+ "step": 114
1400
+ },
1401
+ {
1402
+ "accuracy": 1.0,
1403
+ "epoch": 2.05,
1404
+ "step": 115
1405
+ },
1406
+ {
1407
+ "accuracy": 1.0,
1408
+ "epoch": 2.05,
1409
+ "step": 116
1410
+ },
1411
+ {
1412
+ "accuracy": 1.0,
1413
+ "epoch": 2.06,
1414
+ "step": 117
1415
+ },
1416
+ {
1417
+ "accuracy": 1.0,
1418
+ "epoch": 2.06,
1419
+ "step": 118
1420
+ },
1421
+ {
1422
+ "accuracy": 1.0,
1423
+ "epoch": 2.06,
1424
+ "step": 119
1425
+ },
1426
+ {
1427
+ "epoch": 2.07,
1428
+ "grad_norm": 0.015014845877885818,
1429
+ "learning_rate": 3.3333333333333335e-05,
1430
+ "loss": 0.0043,
1431
+ "step": 120
1432
+ },
1433
+ {
1434
+ "accuracy": 1.0,
1435
+ "epoch": 2.07,
1436
+ "step": 120
1437
+ },
1438
+ {
1439
+ "accuracy": 1.0,
1440
+ "epoch": 2.07,
1441
+ "step": 121
1442
+ },
1443
+ {
1444
+ "accuracy": 1.0,
1445
+ "epoch": 2.07,
1446
+ "step": 122
1447
+ },
1448
+ {
1449
+ "accuracy": 1.0,
1450
+ "epoch": 2.08,
1451
+ "step": 123
1452
+ },
1453
+ {
1454
+ "accuracy": 1.0,
1455
+ "epoch": 2.08,
1456
+ "step": 124
1457
+ },
1458
+ {
1459
+ "accuracy": 1.0,
1460
+ "epoch": 2.08,
1461
+ "step": 125
1462
+ },
1463
+ {
1464
+ "accuracy": 1.0,
1465
+ "epoch": 2.09,
1466
+ "step": 126
1467
+ },
1468
+ {
1469
+ "accuracy": 1.0,
1470
+ "epoch": 2.09,
1471
+ "step": 127
1472
+ },
1473
+ {
1474
+ "accuracy": 1.0,
1475
+ "epoch": 2.09,
1476
+ "step": 128
1477
+ },
1478
+ {
1479
+ "accuracy": 1.0,
1480
+ "epoch": 2.1,
1481
+ "step": 129
1482
+ },
1483
+ {
1484
+ "epoch": 2.1,
1485
+ "grad_norm": 0.015865737572312355,
1486
+ "learning_rate": 3.148148148148148e-05,
1487
+ "loss": 0.0477,
1488
+ "step": 130
1489
+ },
1490
+ {
1491
+ "accuracy": 1.0,
1492
+ "epoch": 2.1,
1493
+ "step": 130
1494
+ },
1495
+ {
1496
+ "accuracy": 1.0,
1497
+ "epoch": 2.1,
1498
+ "step": 131
1499
+ },
1500
+ {
1501
+ "accuracy": 1.0,
1502
+ "epoch": 2.11,
1503
+ "step": 132
1504
+ },
1505
+ {
1506
+ "accuracy": 1.0,
1507
+ "epoch": 2.11,
1508
+ "step": 133
1509
+ },
1510
+ {
1511
+ "accuracy": 1.0,
1512
+ "epoch": 2.11,
1513
+ "step": 134
1514
+ },
1515
+ {
1516
+ "accuracy": 1.0,
1517
+ "epoch": 2.12,
1518
+ "step": 135
1519
+ },
1520
+ {
1521
+ "accuracy": 1.0,
1522
+ "epoch": 2.12,
1523
+ "step": 136
1524
+ },
1525
+ {
1526
+ "accuracy": 1.0,
1527
+ "epoch": 2.12,
1528
+ "step": 137
1529
+ },
1530
+ {
1531
+ "accuracy": 1.0,
1532
+ "epoch": 2.13,
1533
+ "step": 138
1534
+ },
1535
+ {
1536
+ "accuracy": 1.0,
1537
+ "epoch": 2.13,
1538
+ "step": 139
1539
+ },
1540
+ {
1541
+ "epoch": 2.13,
1542
+ "grad_norm": 0.20402489602565765,
1543
+ "learning_rate": 2.962962962962963e-05,
1544
+ "loss": 0.002,
1545
+ "step": 140
1546
+ },
1547
+ {
1548
+ "accuracy": 1.0,
1549
+ "epoch": 2.13,
1550
+ "step": 140
1551
+ },
1552
+ {
1553
+ "accuracy": 1.0,
1554
+ "epoch": 2.14,
1555
+ "step": 141
1556
+ },
1557
+ {
1558
+ "accuracy": 1.0,
1559
+ "epoch": 2.14,
1560
+ "step": 142
1561
+ },
1562
+ {
1563
+ "accuracy": 1.0,
1564
+ "epoch": 2.14,
1565
+ "step": 143
1566
+ },
1567
+ {
1568
+ "accuracy": 1.0,
1569
+ "epoch": 2.15,
1570
+ "step": 144
1571
+ },
1572
+ {
1573
+ "accuracy": 1.0,
1574
+ "epoch": 2.15,
1575
+ "step": 145
1576
+ },
1577
+ {
1578
+ "accuracy": 1.0,
1579
+ "epoch": 2.15,
1580
+ "step": 146
1581
+ },
1582
+ {
1583
+ "accuracy": 1.0,
1584
+ "epoch": 2.16,
1585
+ "step": 147
1586
+ },
1587
+ {
1588
+ "accuracy": 1.0,
1589
+ "epoch": 2.16,
1590
+ "step": 148
1591
+ },
1592
+ {
1593
+ "accuracy": 1.0,
1594
+ "epoch": 2.16,
1595
+ "step": 149
1596
+ },
1597
+ {
1598
+ "epoch": 2.17,
1599
+ "grad_norm": 0.13866350054740906,
1600
+ "learning_rate": 2.777777777777778e-05,
1601
+ "loss": 0.0047,
1602
+ "step": 150
1603
+ },
1604
+ {
1605
+ "accuracy": 1.0,
1606
+ "epoch": 2.17,
1607
+ "step": 150
1608
+ },
1609
+ {
1610
+ "accuracy": 1.0,
1611
+ "epoch": 2.17,
1612
+ "step": 150
1613
+ },
1614
+ {
1615
+ "accuracy": 1.0,
1616
+ "epoch": 2.17,
1617
+ "step": 150
1618
+ },
1619
+ {
1620
+ "accuracy": 1.0,
1621
+ "epoch": 2.17,
1622
+ "step": 150
1623
+ },
1624
+ {
1625
+ "accuracy": 1.0,
1626
+ "epoch": 2.17,
1627
+ "step": 150
1628
+ },
1629
+ {
1630
+ "accuracy": 1.0,
1631
+ "epoch": 2.17,
1632
+ "step": 150
1633
+ },
1634
+ {
1635
+ "accuracy": 1.0,
1636
+ "epoch": 2.17,
1637
+ "step": 150
1638
+ },
1639
+ {
1640
+ "accuracy": 1.0,
1641
+ "epoch": 2.17,
1642
+ "step": 150
1643
+ },
1644
+ {
1645
+ "accuracy": 1.0,
1646
+ "epoch": 2.17,
1647
+ "step": 150
1648
+ },
1649
+ {
1650
+ "accuracy": 1.0,
1651
+ "epoch": 2.17,
1652
+ "step": 150
1653
+ },
1654
+ {
1655
+ "accuracy": 1.0,
1656
+ "epoch": 2.17,
1657
+ "step": 150
1658
+ },
1659
+ {
1660
+ "accuracy": 1.0,
1661
+ "epoch": 2.17,
1662
+ "step": 150
1663
+ },
1664
+ {
1665
+ "accuracy": 1.0,
1666
+ "epoch": 2.17,
1667
+ "step": 150
1668
+ },
1669
+ {
1670
+ "accuracy": 1.0,
1671
+ "epoch": 2.17,
1672
+ "step": 150
1673
+ },
1674
+ {
1675
+ "accuracy": 1.0,
1676
+ "epoch": 2.17,
1677
+ "step": 150
1678
+ },
1679
+ {
1680
+ "accuracy": 1.0,
1681
+ "epoch": 2.17,
1682
+ "step": 150
1683
+ },
1684
+ {
1685
+ "accuracy": 1.0,
1686
+ "epoch": 2.17,
1687
+ "step": 150
1688
+ },
1689
+ {
1690
+ "accuracy": 1.0,
1691
+ "epoch": 2.17,
1692
+ "step": 150
1693
+ },
1694
+ {
1695
+ "accuracy": 1.0,
1696
+ "epoch": 2.17,
1697
+ "step": 150
1698
+ },
1699
+ {
1700
+ "accuracy": 1.0,
1701
+ "epoch": 2.17,
1702
+ "step": 150
1703
+ },
1704
+ {
1705
+ "accuracy": 1.0,
1706
+ "epoch": 2.17,
1707
+ "step": 150
1708
+ },
1709
+ {
1710
+ "accuracy": 1.0,
1711
+ "epoch": 2.17,
1712
+ "step": 150
1713
+ },
1714
+ {
1715
+ "accuracy": 1.0,
1716
+ "epoch": 2.17,
1717
+ "step": 150
1718
+ },
1719
+ {
1720
+ "accuracy": 1.0,
1721
+ "epoch": 2.17,
1722
+ "step": 150
1723
+ },
1724
+ {
1725
+ "accuracy": 1.0,
1726
+ "epoch": 2.17,
1727
+ "step": 150
1728
+ },
1729
+ {
1730
+ "accuracy": 1.0,
1731
+ "epoch": 2.17,
1732
+ "step": 150
1733
+ },
1734
+ {
1735
+ "accuracy": 1.0,
1736
+ "epoch": 2.17,
1737
+ "step": 150
1738
+ },
1739
+ {
1740
+ "accuracy": 1.0,
1741
+ "epoch": 2.17,
1742
+ "step": 150
1743
+ },
1744
+ {
1745
+ "accuracy": 1.0,
1746
+ "epoch": 2.17,
1747
+ "step": 150
1748
+ },
1749
+ {
1750
+ "accuracy": 1.0,
1751
+ "epoch": 2.17,
1752
+ "step": 150
1753
+ },
1754
+ {
1755
+ "accuracy": 1.0,
1756
+ "epoch": 2.17,
1757
+ "step": 150
1758
+ },
1759
+ {
1760
+ "accuracy": 1.0,
1761
+ "epoch": 2.17,
1762
+ "step": 150
1763
+ },
1764
+ {
1765
+ "accuracy": 1.0,
1766
+ "epoch": 2.17,
1767
+ "step": 150
1768
+ },
1769
+ {
1770
+ "accuracy": 1.0,
1771
+ "epoch": 2.17,
1772
+ "step": 150
1773
+ },
1774
+ {
1775
+ "accuracy": 1.0,
1776
+ "epoch": 2.17,
1777
+ "step": 150
1778
+ },
1779
+ {
1780
+ "accuracy": 1.0,
1781
+ "epoch": 2.17,
1782
+ "step": 150
1783
+ },
1784
+ {
1785
+ "accuracy": 1.0,
1786
+ "epoch": 2.17,
1787
+ "step": 150
1788
+ },
1789
+ {
1790
+ "accuracy": 1.0,
1791
+ "epoch": 2.17,
1792
+ "step": 150
1793
+ },
1794
+ {
1795
+ "accuracy": 1.0,
1796
+ "epoch": 2.17,
1797
+ "step": 150
1798
+ },
1799
+ {
1800
+ "accuracy": 1.0,
1801
+ "epoch": 2.17,
1802
+ "step": 150
1803
+ },
1804
+ {
1805
+ "accuracy": 1.0,
1806
+ "epoch": 2.17,
1807
+ "step": 150
1808
+ },
1809
+ {
1810
+ "accuracy": 1.0,
1811
+ "epoch": 2.17,
1812
+ "step": 150
1813
+ },
1814
+ {
1815
+ "accuracy": 1.0,
1816
+ "epoch": 2.17,
1817
+ "step": 150
1818
+ },
1819
+ {
1820
+ "accuracy": 1.0,
1821
+ "epoch": 2.17,
1822
+ "step": 150
1823
+ },
1824
+ {
1825
+ "accuracy": 1.0,
1826
+ "epoch": 2.17,
1827
+ "step": 150
1828
+ },
1829
+ {
1830
+ "accuracy": 1.0,
1831
+ "epoch": 2.17,
1832
+ "step": 150
1833
+ },
1834
+ {
1835
+ "accuracy": 1.0,
1836
+ "epoch": 2.17,
1837
+ "step": 150
1838
+ },
1839
+ {
1840
+ "accuracy": 1.0,
1841
+ "epoch": 2.17,
1842
+ "step": 150
1843
+ },
1844
+ {
1845
+ "accuracy": 0.0,
1846
+ "epoch": 2.17,
1847
+ "step": 150
1848
+ },
1849
+ {
1850
+ "accuracy": 1.0,
1851
+ "epoch": 2.17,
1852
+ "step": 150
1853
+ },
1854
+ {
1855
+ "accuracy": 1.0,
1856
+ "epoch": 2.17,
1857
+ "step": 150
1858
+ },
1859
+ {
1860
+ "accuracy": 1.0,
1861
+ "epoch": 2.17,
1862
+ "step": 150
1863
+ },
1864
+ {
1865
+ "accuracy": 1.0,
1866
+ "epoch": 2.17,
1867
+ "step": 150
1868
+ },
1869
+ {
1870
+ "accuracy": 1.0,
1871
+ "epoch": 2.17,
1872
+ "step": 150
1873
+ },
1874
+ {
1875
+ "accuracy": 0.0,
1876
+ "epoch": 2.17,
1877
+ "step": 150
1878
+ },
1879
+ {
1880
+ "accuracy": 0.0,
1881
+ "epoch": 2.17,
1882
+ "step": 150
1883
+ },
1884
+ {
1885
+ "accuracy": 0.0,
1886
+ "epoch": 2.17,
1887
+ "step": 150
1888
+ },
1889
+ {
1890
+ "accuracy": 1.0,
1891
+ "epoch": 2.17,
1892
+ "step": 150
1893
+ },
1894
+ {
1895
+ "accuracy": 1.0,
1896
+ "epoch": 2.17,
1897
+ "step": 150
1898
+ },
1899
+ {
1900
+ "accuracy": 1.0,
1901
+ "epoch": 2.17,
1902
+ "step": 150
1903
+ },
1904
+ {
1905
+ "accuracy": 0.0,
1906
+ "epoch": 2.17,
1907
+ "step": 150
1908
+ },
1909
+ {
1910
+ "accuracy": 0.0,
1911
+ "epoch": 2.17,
1912
+ "step": 150
1913
+ },
1914
+ {
1915
+ "accuracy": 1.0,
1916
+ "epoch": 2.17,
1917
+ "step": 150
1918
+ },
1919
+ {
1920
+ "accuracy": 1.0,
1921
+ "epoch": 2.17,
1922
+ "step": 150
1923
+ },
1924
+ {
1925
+ "accuracy": 1.0,
1926
+ "epoch": 2.17,
1927
+ "step": 150
1928
+ },
1929
+ {
1930
+ "accuracy": 1.0,
1931
+ "epoch": 2.17,
1932
+ "step": 150
1933
+ },
1934
+ {
1935
+ "accuracy": 1.0,
1936
+ "epoch": 2.17,
1937
+ "step": 150
1938
+ },
1939
+ {
1940
+ "accuracy": 1.0,
1941
+ "epoch": 2.17,
1942
+ "step": 150
1943
+ },
1944
+ {
1945
+ "accuracy": 1.0,
1946
+ "epoch": 2.17,
1947
+ "step": 150
1948
+ },
1949
+ {
1950
+ "accuracy": 1.0,
1951
+ "epoch": 2.17,
1952
+ "step": 150
1953
+ },
1954
+ {
1955
+ "accuracy": 1.0,
1956
+ "epoch": 2.17,
1957
+ "step": 150
1958
+ },
1959
+ {
1960
+ "accuracy": 1.0,
1961
+ "epoch": 2.17,
1962
+ "step": 150
1963
+ },
1964
+ {
1965
+ "epoch": 2.17,
1966
+ "eval_accuracy": 0.9166666666666666,
1967
+ "eval_loss": 0.33773934841156006,
1968
+ "eval_runtime": 86.6338,
1969
+ "eval_samples_per_second": 0.831,
1970
+ "eval_steps_per_second": 0.831,
1971
+ "step": 150
1972
+ },
1973
+ {
1974
+ "accuracy": 1.0,
1975
+ "epoch": 2.17,
1976
+ "step": 150
1977
+ },
1978
+ {
1979
+ "accuracy": 1.0,
1980
+ "epoch": 3.0,
1981
+ "step": 151
1982
+ },
1983
+ {
1984
+ "accuracy": 1.0,
1985
+ "epoch": 3.01,
1986
+ "step": 152
1987
+ },
1988
+ {
1989
+ "accuracy": 1.0,
1990
+ "epoch": 3.01,
1991
+ "step": 153
1992
+ },
1993
+ {
1994
+ "accuracy": 1.0,
1995
+ "epoch": 3.01,
1996
+ "step": 154
1997
+ },
1998
+ {
1999
+ "accuracy": 1.0,
2000
+ "epoch": 3.02,
2001
+ "step": 155
2002
+ },
2003
+ {
2004
+ "accuracy": 1.0,
2005
+ "epoch": 3.02,
2006
+ "step": 156
2007
+ },
2008
+ {
2009
+ "accuracy": 1.0,
2010
+ "epoch": 3.02,
2011
+ "step": 157
2012
+ },
2013
+ {
2014
+ "accuracy": 1.0,
2015
+ "epoch": 3.03,
2016
+ "step": 158
2017
+ },
2018
+ {
2019
+ "accuracy": 1.0,
2020
+ "epoch": 3.03,
2021
+ "step": 159
2022
+ },
2023
+ {
2024
+ "epoch": 3.03,
2025
+ "grad_norm": 0.026771828532218933,
2026
+ "learning_rate": 2.5925925925925925e-05,
2027
+ "loss": 0.0034,
2028
+ "step": 160
2029
+ },
2030
+ {
2031
+ "accuracy": 1.0,
2032
+ "epoch": 3.03,
2033
+ "step": 160
2034
+ },
2035
+ {
2036
+ "accuracy": 1.0,
2037
+ "epoch": 3.04,
2038
+ "step": 161
2039
+ },
2040
+ {
2041
+ "accuracy": 1.0,
2042
+ "epoch": 3.04,
2043
+ "step": 162
2044
+ },
2045
+ {
2046
+ "accuracy": 1.0,
2047
+ "epoch": 3.04,
2048
+ "step": 163
2049
+ },
2050
+ {
2051
+ "accuracy": 1.0,
2052
+ "epoch": 3.05,
2053
+ "step": 164
2054
+ },
2055
+ {
2056
+ "accuracy": 1.0,
2057
+ "epoch": 3.05,
2058
+ "step": 165
2059
+ },
2060
+ {
2061
+ "accuracy": 1.0,
2062
+ "epoch": 3.05,
2063
+ "step": 166
2064
+ },
2065
+ {
2066
+ "accuracy": 1.0,
2067
+ "epoch": 3.06,
2068
+ "step": 167
2069
+ },
2070
+ {
2071
+ "accuracy": 0.0,
2072
+ "epoch": 3.06,
2073
+ "step": 168
2074
+ },
2075
+ {
2076
+ "accuracy": 1.0,
2077
+ "epoch": 3.06,
2078
+ "step": 169
2079
+ },
2080
+ {
2081
+ "epoch": 3.07,
2082
+ "grad_norm": 0.006014752667397261,
2083
+ "learning_rate": 2.4074074074074074e-05,
2084
+ "loss": 0.6783,
2085
+ "step": 170
2086
+ },
2087
+ {
2088
+ "accuracy": 1.0,
2089
+ "epoch": 3.07,
2090
+ "step": 170
2091
+ },
2092
+ {
2093
+ "accuracy": 1.0,
2094
+ "epoch": 3.07,
2095
+ "step": 171
2096
+ },
2097
+ {
2098
+ "accuracy": 1.0,
2099
+ "epoch": 3.07,
2100
+ "step": 172
2101
+ },
2102
+ {
2103
+ "accuracy": 1.0,
2104
+ "epoch": 3.08,
2105
+ "step": 173
2106
+ },
2107
+ {
2108
+ "accuracy": 1.0,
2109
+ "epoch": 3.08,
2110
+ "step": 174
2111
+ },
2112
+ {
2113
+ "accuracy": 1.0,
2114
+ "epoch": 3.08,
2115
+ "step": 175
2116
+ },
2117
+ {
2118
+ "accuracy": 1.0,
2119
+ "epoch": 3.09,
2120
+ "step": 176
2121
+ },
2122
+ {
2123
+ "accuracy": 1.0,
2124
+ "epoch": 3.09,
2125
+ "step": 177
2126
+ },
2127
+ {
2128
+ "accuracy": 1.0,
2129
+ "epoch": 3.09,
2130
+ "step": 178
2131
+ },
2132
+ {
2133
+ "accuracy": 1.0,
2134
+ "epoch": 3.1,
2135
+ "step": 179
2136
+ },
2137
+ {
2138
+ "epoch": 3.1,
2139
+ "grad_norm": 0.0051905447617173195,
2140
+ "learning_rate": 2.2222222222222223e-05,
2141
+ "loss": 0.0007,
2142
+ "step": 180
2143
+ },
2144
+ {
2145
+ "accuracy": 1.0,
2146
+ "epoch": 3.1,
2147
+ "step": 180
2148
+ },
2149
+ {
2150
+ "accuracy": 1.0,
2151
+ "epoch": 3.1,
2152
+ "step": 181
2153
+ },
2154
+ {
2155
+ "accuracy": 1.0,
2156
+ "epoch": 3.11,
2157
+ "step": 182
2158
+ },
2159
+ {
2160
+ "accuracy": 1.0,
2161
+ "epoch": 3.11,
2162
+ "step": 183
2163
+ },
2164
+ {
2165
+ "accuracy": 1.0,
2166
+ "epoch": 3.11,
2167
+ "step": 184
2168
+ },
2169
+ {
2170
+ "accuracy": 1.0,
2171
+ "epoch": 3.12,
2172
+ "step": 185
2173
+ },
2174
+ {
2175
+ "accuracy": 1.0,
2176
+ "epoch": 3.12,
2177
+ "step": 186
2178
+ },
2179
+ {
2180
+ "accuracy": 1.0,
2181
+ "epoch": 3.12,
2182
+ "step": 187
2183
+ },
2184
+ {
2185
+ "accuracy": 1.0,
2186
+ "epoch": 3.13,
2187
+ "step": 188
2188
+ },
2189
+ {
2190
+ "accuracy": 1.0,
2191
+ "epoch": 3.13,
2192
+ "step": 189
2193
+ },
2194
+ {
2195
+ "epoch": 3.13,
2196
+ "grad_norm": 0.029750045388936996,
2197
+ "learning_rate": 2.037037037037037e-05,
2198
+ "loss": 0.0004,
2199
+ "step": 190
2200
+ },
2201
+ {
2202
+ "accuracy": 1.0,
2203
+ "epoch": 3.13,
2204
+ "step": 190
2205
+ },
2206
+ {
2207
+ "accuracy": 1.0,
2208
+ "epoch": 3.14,
2209
+ "step": 191
2210
+ },
2211
+ {
2212
+ "accuracy": 1.0,
2213
+ "epoch": 3.14,
2214
+ "step": 192
2215
+ },
2216
+ {
2217
+ "accuracy": 1.0,
2218
+ "epoch": 3.14,
2219
+ "step": 193
2220
+ },
2221
+ {
2222
+ "accuracy": 1.0,
2223
+ "epoch": 3.15,
2224
+ "step": 194
2225
+ },
2226
+ {
2227
+ "accuracy": 1.0,
2228
+ "epoch": 3.15,
2229
+ "step": 195
2230
+ },
2231
+ {
2232
+ "accuracy": 1.0,
2233
+ "epoch": 3.15,
2234
+ "step": 196
2235
+ },
2236
+ {
2237
+ "accuracy": 1.0,
2238
+ "epoch": 3.16,
2239
+ "step": 197
2240
+ },
2241
+ {
2242
+ "accuracy": 1.0,
2243
+ "epoch": 3.16,
2244
+ "step": 198
2245
+ },
2246
+ {
2247
+ "accuracy": 1.0,
2248
+ "epoch": 3.16,
2249
+ "step": 199
2250
+ },
2251
+ {
2252
+ "epoch": 3.17,
2253
+ "grad_norm": 0.016466189175844193,
2254
+ "learning_rate": 1.8518518518518518e-05,
2255
+ "loss": 0.0053,
2256
+ "step": 200
2257
+ },
2258
+ {
2259
+ "accuracy": 1.0,
2260
+ "epoch": 3.17,
2261
+ "step": 200
2262
+ },
2263
+ {
2264
+ "accuracy": 1.0,
2265
+ "epoch": 3.17,
2266
+ "step": 200
2267
+ },
2268
+ {
2269
+ "accuracy": 1.0,
2270
+ "epoch": 3.17,
2271
+ "step": 200
2272
+ },
2273
+ {
2274
+ "accuracy": 1.0,
2275
+ "epoch": 3.17,
2276
+ "step": 200
2277
+ },
2278
+ {
2279
+ "accuracy": 1.0,
2280
+ "epoch": 3.17,
2281
+ "step": 200
2282
+ },
2283
+ {
2284
+ "accuracy": 1.0,
2285
+ "epoch": 3.17,
2286
+ "step": 200
2287
+ },
2288
+ {
2289
+ "accuracy": 1.0,
2290
+ "epoch": 3.17,
2291
+ "step": 200
2292
+ },
2293
+ {
2294
+ "accuracy": 1.0,
2295
+ "epoch": 3.17,
2296
+ "step": 200
2297
+ },
2298
+ {
2299
+ "accuracy": 1.0,
2300
+ "epoch": 3.17,
2301
+ "step": 200
2302
+ },
2303
+ {
2304
+ "accuracy": 1.0,
2305
+ "epoch": 3.17,
2306
+ "step": 200
2307
+ },
2308
+ {
2309
+ "accuracy": 1.0,
2310
+ "epoch": 3.17,
2311
+ "step": 200
2312
+ },
2313
+ {
2314
+ "accuracy": 1.0,
2315
+ "epoch": 3.17,
2316
+ "step": 200
2317
+ },
2318
+ {
2319
+ "accuracy": 1.0,
2320
+ "epoch": 3.17,
2321
+ "step": 200
2322
+ },
2323
+ {
2324
+ "accuracy": 1.0,
2325
+ "epoch": 3.17,
2326
+ "step": 200
2327
+ },
2328
+ {
2329
+ "accuracy": 1.0,
2330
+ "epoch": 3.17,
2331
+ "step": 200
2332
+ },
2333
+ {
2334
+ "accuracy": 0.0,
2335
+ "epoch": 3.17,
2336
+ "step": 200
2337
+ },
2338
+ {
2339
+ "accuracy": 0.0,
2340
+ "epoch": 3.17,
2341
+ "step": 200
2342
+ },
2343
+ {
2344
+ "accuracy": 1.0,
2345
+ "epoch": 3.17,
2346
+ "step": 200
2347
+ },
2348
+ {
2349
+ "accuracy": 1.0,
2350
+ "epoch": 3.17,
2351
+ "step": 200
2352
+ },
2353
+ {
2354
+ "accuracy": 1.0,
2355
+ "epoch": 3.17,
2356
+ "step": 200
2357
+ },
2358
+ {
2359
+ "accuracy": 1.0,
2360
+ "epoch": 3.17,
2361
+ "step": 200
2362
+ },
2363
+ {
2364
+ "accuracy": 1.0,
2365
+ "epoch": 3.17,
2366
+ "step": 200
2367
+ },
2368
+ {
2369
+ "accuracy": 1.0,
2370
+ "epoch": 3.17,
2371
+ "step": 200
2372
+ },
2373
+ {
2374
+ "accuracy": 1.0,
2375
+ "epoch": 3.17,
2376
+ "step": 200
2377
+ },
2378
+ {
2379
+ "accuracy": 1.0,
2380
+ "epoch": 3.17,
2381
+ "step": 200
2382
+ },
2383
+ {
2384
+ "accuracy": 1.0,
2385
+ "epoch": 3.17,
2386
+ "step": 200
2387
+ },
2388
+ {
2389
+ "accuracy": 1.0,
2390
+ "epoch": 3.17,
2391
+ "step": 200
2392
+ },
2393
+ {
2394
+ "accuracy": 1.0,
2395
+ "epoch": 3.17,
2396
+ "step": 200
2397
+ },
2398
+ {
2399
+ "accuracy": 1.0,
2400
+ "epoch": 3.17,
2401
+ "step": 200
2402
+ },
2403
+ {
2404
+ "accuracy": 1.0,
2405
+ "epoch": 3.17,
2406
+ "step": 200
2407
+ },
2408
+ {
2409
+ "accuracy": 1.0,
2410
+ "epoch": 3.17,
2411
+ "step": 200
2412
+ },
2413
+ {
2414
+ "accuracy": 1.0,
2415
+ "epoch": 3.17,
2416
+ "step": 200
2417
+ },
2418
+ {
2419
+ "accuracy": 1.0,
2420
+ "epoch": 3.17,
2421
+ "step": 200
2422
+ },
2423
+ {
2424
+ "accuracy": 1.0,
2425
+ "epoch": 3.17,
2426
+ "step": 200
2427
+ },
2428
+ {
2429
+ "accuracy": 1.0,
2430
+ "epoch": 3.17,
2431
+ "step": 200
2432
+ },
2433
+ {
2434
+ "accuracy": 1.0,
2435
+ "epoch": 3.17,
2436
+ "step": 200
2437
+ },
2438
+ {
2439
+ "accuracy": 1.0,
2440
+ "epoch": 3.17,
2441
+ "step": 200
2442
+ },
2443
+ {
2444
+ "accuracy": 1.0,
2445
+ "epoch": 3.17,
2446
+ "step": 200
2447
+ },
2448
+ {
2449
+ "accuracy": 1.0,
2450
+ "epoch": 3.17,
2451
+ "step": 200
2452
+ },
2453
+ {
2454
+ "accuracy": 1.0,
2455
+ "epoch": 3.17,
2456
+ "step": 200
2457
+ },
2458
+ {
2459
+ "accuracy": 1.0,
2460
+ "epoch": 3.17,
2461
+ "step": 200
2462
+ },
2463
+ {
2464
+ "accuracy": 1.0,
2465
+ "epoch": 3.17,
2466
+ "step": 200
2467
+ },
2468
+ {
2469
+ "accuracy": 1.0,
2470
+ "epoch": 3.17,
2471
+ "step": 200
2472
+ },
2473
+ {
2474
+ "accuracy": 1.0,
2475
+ "epoch": 3.17,
2476
+ "step": 200
2477
+ },
2478
+ {
2479
+ "accuracy": 1.0,
2480
+ "epoch": 3.17,
2481
+ "step": 200
2482
+ },
2483
+ {
2484
+ "accuracy": 1.0,
2485
+ "epoch": 3.17,
2486
+ "step": 200
2487
+ },
2488
+ {
2489
+ "accuracy": 0.0,
2490
+ "epoch": 3.17,
2491
+ "step": 200
2492
+ },
2493
+ {
2494
+ "accuracy": 1.0,
2495
+ "epoch": 3.17,
2496
+ "step": 200
2497
+ },
2498
+ {
2499
+ "accuracy": 1.0,
2500
+ "epoch": 3.17,
2501
+ "step": 200
2502
+ },
2503
+ {
2504
+ "accuracy": 0.0,
2505
+ "epoch": 3.17,
2506
+ "step": 200
2507
+ },
2508
+ {
2509
+ "accuracy": 0.0,
2510
+ "epoch": 3.17,
2511
+ "step": 200
2512
+ },
2513
+ {
2514
+ "accuracy": 1.0,
2515
+ "epoch": 3.17,
2516
+ "step": 200
2517
+ },
2518
+ {
2519
+ "accuracy": 1.0,
2520
+ "epoch": 3.17,
2521
+ "step": 200
2522
+ },
2523
+ {
2524
+ "accuracy": 1.0,
2525
+ "epoch": 3.17,
2526
+ "step": 200
2527
+ },
2528
+ {
2529
+ "accuracy": 1.0,
2530
+ "epoch": 3.17,
2531
+ "step": 200
2532
+ },
2533
+ {
2534
+ "accuracy": 1.0,
2535
+ "epoch": 3.17,
2536
+ "step": 200
2537
+ },
2538
+ {
2539
+ "accuracy": 1.0,
2540
+ "epoch": 3.17,
2541
+ "step": 200
2542
+ },
2543
+ {
2544
+ "accuracy": 1.0,
2545
+ "epoch": 3.17,
2546
+ "step": 200
2547
+ },
2548
+ {
2549
+ "accuracy": 1.0,
2550
+ "epoch": 3.17,
2551
+ "step": 200
2552
+ },
2553
+ {
2554
+ "accuracy": 1.0,
2555
+ "epoch": 3.17,
2556
+ "step": 200
2557
+ },
2558
+ {
2559
+ "accuracy": 1.0,
2560
+ "epoch": 3.17,
2561
+ "step": 200
2562
+ },
2563
+ {
2564
+ "accuracy": 1.0,
2565
+ "epoch": 3.17,
2566
+ "step": 200
2567
+ },
2568
+ {
2569
+ "accuracy": 1.0,
2570
+ "epoch": 3.17,
2571
+ "step": 200
2572
+ },
2573
+ {
2574
+ "accuracy": 1.0,
2575
+ "epoch": 3.17,
2576
+ "step": 200
2577
+ },
2578
+ {
2579
+ "accuracy": 1.0,
2580
+ "epoch": 3.17,
2581
+ "step": 200
2582
+ },
2583
+ {
2584
+ "accuracy": 1.0,
2585
+ "epoch": 3.17,
2586
+ "step": 200
2587
+ },
2588
+ {
2589
+ "accuracy": 1.0,
2590
+ "epoch": 3.17,
2591
+ "step": 200
2592
+ },
2593
+ {
2594
+ "accuracy": 1.0,
2595
+ "epoch": 3.17,
2596
+ "step": 200
2597
+ },
2598
+ {
2599
+ "accuracy": 1.0,
2600
+ "epoch": 3.17,
2601
+ "step": 200
2602
+ },
2603
+ {
2604
+ "accuracy": 1.0,
2605
+ "epoch": 3.17,
2606
+ "step": 200
2607
+ },
2608
+ {
2609
+ "accuracy": 1.0,
2610
+ "epoch": 3.17,
2611
+ "step": 200
2612
+ },
2613
+ {
2614
+ "accuracy": 1.0,
2615
+ "epoch": 3.17,
2616
+ "step": 200
2617
+ },
2618
+ {
2619
+ "epoch": 3.17,
2620
+ "eval_accuracy": 0.9305555555555556,
2621
+ "eval_loss": 0.20178744196891785,
2622
+ "eval_runtime": 87.0152,
2623
+ "eval_samples_per_second": 0.827,
2624
+ "eval_steps_per_second": 0.827,
2625
+ "step": 200
2626
+ },
2627
+ {
2628
+ "accuracy": 1.0,
2629
+ "epoch": 3.17,
2630
+ "step": 200
2631
+ },
2632
+ {
2633
+ "accuracy": 1.0,
2634
+ "epoch": 4.0,
2635
+ "step": 201
2636
+ },
2637
+ {
2638
+ "accuracy": 1.0,
2639
+ "epoch": 4.01,
2640
+ "step": 202
2641
+ },
2642
+ {
2643
+ "accuracy": 1.0,
2644
+ "epoch": 4.01,
2645
+ "step": 203
2646
+ },
2647
+ {
2648
+ "accuracy": 1.0,
2649
+ "epoch": 4.01,
2650
+ "step": 204
2651
+ },
2652
+ {
2653
+ "accuracy": 1.0,
2654
+ "epoch": 4.02,
2655
+ "step": 205
2656
+ },
2657
+ {
2658
+ "accuracy": 1.0,
2659
+ "epoch": 4.02,
2660
+ "step": 206
2661
+ },
2662
+ {
2663
+ "accuracy": 1.0,
2664
+ "epoch": 4.02,
2665
+ "step": 207
2666
+ },
2667
+ {
2668
+ "accuracy": 1.0,
2669
+ "epoch": 4.03,
2670
+ "step": 208
2671
+ },
2672
+ {
2673
+ "accuracy": 1.0,
2674
+ "epoch": 4.03,
2675
+ "step": 209
2676
+ },
2677
+ {
2678
+ "epoch": 4.03,
2679
+ "grad_norm": 0.01589026115834713,
2680
+ "learning_rate": 1.6666666666666667e-05,
2681
+ "loss": 0.0004,
2682
+ "step": 210
2683
+ },
2684
+ {
2685
+ "accuracy": 1.0,
2686
+ "epoch": 4.03,
2687
+ "step": 210
2688
+ },
2689
+ {
2690
+ "accuracy": 1.0,
2691
+ "epoch": 4.04,
2692
+ "step": 211
2693
+ },
2694
+ {
2695
+ "accuracy": 1.0,
2696
+ "epoch": 4.04,
2697
+ "step": 212
2698
+ },
2699
+ {
2700
+ "accuracy": 1.0,
2701
+ "epoch": 4.04,
2702
+ "step": 213
2703
+ },
2704
+ {
2705
+ "accuracy": 1.0,
2706
+ "epoch": 4.05,
2707
+ "step": 214
2708
+ },
2709
+ {
2710
+ "accuracy": 1.0,
2711
+ "epoch": 4.05,
2712
+ "step": 215
2713
+ },
2714
+ {
2715
+ "accuracy": 1.0,
2716
+ "epoch": 4.05,
2717
+ "step": 216
2718
+ },
2719
+ {
2720
+ "accuracy": 1.0,
2721
+ "epoch": 4.06,
2722
+ "step": 217
2723
+ },
2724
+ {
2725
+ "accuracy": 1.0,
2726
+ "epoch": 4.06,
2727
+ "step": 218
2728
+ },
2729
+ {
2730
+ "accuracy": 1.0,
2731
+ "epoch": 4.06,
2732
+ "step": 219
2733
+ },
2734
+ {
2735
+ "epoch": 4.07,
2736
+ "grad_norm": 0.050206832587718964,
2737
+ "learning_rate": 1.4814814814814815e-05,
2738
+ "loss": 0.0005,
2739
+ "step": 220
2740
+ },
2741
+ {
2742
+ "accuracy": 1.0,
2743
+ "epoch": 4.07,
2744
+ "step": 220
2745
+ },
2746
+ {
2747
+ "accuracy": 1.0,
2748
+ "epoch": 4.07,
2749
+ "step": 221
2750
+ },
2751
+ {
2752
+ "accuracy": 1.0,
2753
+ "epoch": 4.07,
2754
+ "step": 222
2755
+ },
2756
+ {
2757
+ "accuracy": 1.0,
2758
+ "epoch": 4.08,
2759
+ "step": 223
2760
+ },
2761
+ {
2762
+ "accuracy": 1.0,
2763
+ "epoch": 4.08,
2764
+ "step": 224
2765
+ },
2766
+ {
2767
+ "accuracy": 1.0,
2768
+ "epoch": 4.08,
2769
+ "step": 225
2770
+ },
2771
+ {
2772
+ "accuracy": 1.0,
2773
+ "epoch": 4.09,
2774
+ "step": 226
2775
+ },
2776
+ {
2777
+ "accuracy": 1.0,
2778
+ "epoch": 4.09,
2779
+ "step": 227
2780
+ },
2781
+ {
2782
+ "accuracy": 1.0,
2783
+ "epoch": 4.09,
2784
+ "step": 228
2785
+ },
2786
+ {
2787
+ "accuracy": 1.0,
2788
+ "epoch": 4.1,
2789
+ "step": 229
2790
+ },
2791
+ {
2792
+ "epoch": 4.1,
2793
+ "grad_norm": 0.5100947618484497,
2794
+ "learning_rate": 1.2962962962962962e-05,
2795
+ "loss": 0.0024,
2796
+ "step": 230
2797
+ },
2798
+ {
2799
+ "accuracy": 1.0,
2800
+ "epoch": 4.1,
2801
+ "step": 230
2802
+ },
2803
+ {
2804
+ "accuracy": 1.0,
2805
+ "epoch": 4.1,
2806
+ "step": 231
2807
+ },
2808
+ {
2809
+ "accuracy": 1.0,
2810
+ "epoch": 4.11,
2811
+ "step": 232
2812
+ },
2813
+ {
2814
+ "accuracy": 1.0,
2815
+ "epoch": 4.11,
2816
+ "step": 233
2817
+ },
2818
+ {
2819
+ "accuracy": 1.0,
2820
+ "epoch": 4.11,
2821
+ "step": 234
2822
+ },
2823
+ {
2824
+ "accuracy": 1.0,
2825
+ "epoch": 4.12,
2826
+ "step": 235
2827
+ },
2828
+ {
2829
+ "accuracy": 1.0,
2830
+ "epoch": 4.12,
2831
+ "step": 236
2832
+ },
2833
+ {
2834
+ "accuracy": 1.0,
2835
+ "epoch": 4.12,
2836
+ "step": 237
2837
+ },
2838
+ {
2839
+ "accuracy": 1.0,
2840
+ "epoch": 4.13,
2841
+ "step": 238
2842
+ },
2843
+ {
2844
+ "accuracy": 1.0,
2845
+ "epoch": 4.13,
2846
+ "step": 239
2847
+ },
2848
+ {
2849
+ "epoch": 4.13,
2850
+ "grad_norm": 0.014125952497124672,
2851
+ "learning_rate": 1.1111111111111112e-05,
2852
+ "loss": 0.0061,
2853
+ "step": 240
2854
+ },
2855
+ {
2856
+ "accuracy": 1.0,
2857
+ "epoch": 4.13,
2858
+ "step": 240
2859
+ },
2860
+ {
2861
+ "accuracy": 1.0,
2862
+ "epoch": 4.14,
2863
+ "step": 241
2864
+ },
2865
+ {
2866
+ "accuracy": 1.0,
2867
+ "epoch": 4.14,
2868
+ "step": 242
2869
+ },
2870
+ {
2871
+ "accuracy": 1.0,
2872
+ "epoch": 4.14,
2873
+ "step": 243
2874
+ },
2875
+ {
2876
+ "accuracy": 1.0,
2877
+ "epoch": 4.15,
2878
+ "step": 244
2879
+ },
2880
+ {
2881
+ "accuracy": 1.0,
2882
+ "epoch": 4.15,
2883
+ "step": 245
2884
+ },
2885
+ {
2886
+ "accuracy": 1.0,
2887
+ "epoch": 4.15,
2888
+ "step": 246
2889
+ },
2890
+ {
2891
+ "accuracy": 1.0,
2892
+ "epoch": 4.16,
2893
+ "step": 247
2894
+ },
2895
+ {
2896
+ "accuracy": 1.0,
2897
+ "epoch": 4.16,
2898
+ "step": 248
2899
+ },
2900
+ {
2901
+ "accuracy": 1.0,
2902
+ "epoch": 4.16,
2903
+ "step": 249
2904
+ },
2905
+ {
2906
+ "epoch": 4.17,
2907
+ "grad_norm": 0.006884186994284391,
2908
+ "learning_rate": 9.259259259259259e-06,
2909
+ "loss": 0.0006,
2910
+ "step": 250
2911
+ },
2912
+ {
2913
+ "accuracy": 1.0,
2914
+ "epoch": 4.17,
2915
+ "step": 250
2916
+ },
2917
+ {
2918
+ "accuracy": 1.0,
2919
+ "epoch": 4.17,
2920
+ "step": 250
2921
+ },
2922
+ {
2923
+ "accuracy": 1.0,
2924
+ "epoch": 4.17,
2925
+ "step": 250
2926
+ },
2927
+ {
2928
+ "accuracy": 1.0,
2929
+ "epoch": 4.17,
2930
+ "step": 250
2931
+ },
2932
+ {
2933
+ "accuracy": 1.0,
2934
+ "epoch": 4.17,
2935
+ "step": 250
2936
+ },
2937
+ {
2938
+ "accuracy": 0.0,
2939
+ "epoch": 4.17,
2940
+ "step": 250
2941
+ },
2942
+ {
2943
+ "accuracy": 1.0,
2944
+ "epoch": 4.17,
2945
+ "step": 250
2946
+ },
2947
+ {
2948
+ "accuracy": 1.0,
2949
+ "epoch": 4.17,
2950
+ "step": 250
2951
+ },
2952
+ {
2953
+ "accuracy": 1.0,
2954
+ "epoch": 4.17,
2955
+ "step": 250
2956
+ },
2957
+ {
2958
+ "accuracy": 1.0,
2959
+ "epoch": 4.17,
2960
+ "step": 250
2961
+ },
2962
+ {
2963
+ "accuracy": 1.0,
2964
+ "epoch": 4.17,
2965
+ "step": 250
2966
+ },
2967
+ {
2968
+ "accuracy": 1.0,
2969
+ "epoch": 4.17,
2970
+ "step": 250
2971
+ },
2972
+ {
2973
+ "accuracy": 1.0,
2974
+ "epoch": 4.17,
2975
+ "step": 250
2976
+ },
2977
+ {
2978
+ "accuracy": 1.0,
2979
+ "epoch": 4.17,
2980
+ "step": 250
2981
+ },
2982
+ {
2983
+ "accuracy": 1.0,
2984
+ "epoch": 4.17,
2985
+ "step": 250
2986
+ },
2987
+ {
2988
+ "accuracy": 1.0,
2989
+ "epoch": 4.17,
2990
+ "step": 250
2991
+ },
2992
+ {
2993
+ "accuracy": 1.0,
2994
+ "epoch": 4.17,
2995
+ "step": 250
2996
+ },
2997
+ {
2998
+ "accuracy": 1.0,
2999
+ "epoch": 4.17,
3000
+ "step": 250
3001
+ },
3002
+ {
3003
+ "accuracy": 1.0,
3004
+ "epoch": 4.17,
3005
+ "step": 250
3006
+ },
3007
+ {
3008
+ "accuracy": 1.0,
3009
+ "epoch": 4.17,
3010
+ "step": 250
3011
+ },
3012
+ {
3013
+ "accuracy": 0.0,
3014
+ "epoch": 4.17,
3015
+ "step": 250
3016
+ },
3017
+ {
3018
+ "accuracy": 0.0,
3019
+ "epoch": 4.17,
3020
+ "step": 250
3021
+ },
3022
+ {
3023
+ "accuracy": 0.0,
3024
+ "epoch": 4.17,
3025
+ "step": 250
3026
+ },
3027
+ {
3028
+ "accuracy": 1.0,
3029
+ "epoch": 4.17,
3030
+ "step": 250
3031
+ },
3032
+ {
3033
+ "accuracy": 1.0,
3034
+ "epoch": 4.17,
3035
+ "step": 250
3036
+ },
3037
+ {
3038
+ "accuracy": 0.0,
3039
+ "epoch": 4.17,
3040
+ "step": 250
3041
+ },
3042
+ {
3043
+ "accuracy": 1.0,
3044
+ "epoch": 4.17,
3045
+ "step": 250
3046
+ },
3047
+ {
3048
+ "accuracy": 1.0,
3049
+ "epoch": 4.17,
3050
+ "step": 250
3051
+ },
3052
+ {
3053
+ "accuracy": 1.0,
3054
+ "epoch": 4.17,
3055
+ "step": 250
3056
+ },
3057
+ {
3058
+ "accuracy": 1.0,
3059
+ "epoch": 4.17,
3060
+ "step": 250
3061
+ },
3062
+ {
3063
+ "accuracy": 1.0,
3064
+ "epoch": 4.17,
3065
+ "step": 250
3066
+ },
3067
+ {
3068
+ "accuracy": 1.0,
3069
+ "epoch": 4.17,
3070
+ "step": 250
3071
+ },
3072
+ {
3073
+ "accuracy": 1.0,
3074
+ "epoch": 4.17,
3075
+ "step": 250
3076
+ },
3077
+ {
3078
+ "accuracy": 1.0,
3079
+ "epoch": 4.17,
3080
+ "step": 250
3081
+ },
3082
+ {
3083
+ "accuracy": 1.0,
3084
+ "epoch": 4.17,
3085
+ "step": 250
3086
+ },
3087
+ {
3088
+ "accuracy": 1.0,
3089
+ "epoch": 4.17,
3090
+ "step": 250
3091
+ },
3092
+ {
3093
+ "accuracy": 1.0,
3094
+ "epoch": 4.17,
3095
+ "step": 250
3096
+ },
3097
+ {
3098
+ "accuracy": 1.0,
3099
+ "epoch": 4.17,
3100
+ "step": 250
3101
+ },
3102
+ {
3103
+ "accuracy": 1.0,
3104
+ "epoch": 4.17,
3105
+ "step": 250
3106
+ },
3107
+ {
3108
+ "accuracy": 1.0,
3109
+ "epoch": 4.17,
3110
+ "step": 250
3111
+ },
3112
+ {
3113
+ "accuracy": 1.0,
3114
+ "epoch": 4.17,
3115
+ "step": 250
3116
+ },
3117
+ {
3118
+ "accuracy": 1.0,
3119
+ "epoch": 4.17,
3120
+ "step": 250
3121
+ },
3122
+ {
3123
+ "accuracy": 1.0,
3124
+ "epoch": 4.17,
3125
+ "step": 250
3126
+ },
3127
+ {
3128
+ "accuracy": 1.0,
3129
+ "epoch": 4.17,
3130
+ "step": 250
3131
+ },
3132
+ {
3133
+ "accuracy": 1.0,
3134
+ "epoch": 4.17,
3135
+ "step": 250
3136
+ },
3137
+ {
3138
+ "accuracy": 1.0,
3139
+ "epoch": 4.17,
3140
+ "step": 250
3141
+ },
3142
+ {
3143
+ "accuracy": 1.0,
3144
+ "epoch": 4.17,
3145
+ "step": 250
3146
+ },
3147
+ {
3148
+ "accuracy": 1.0,
3149
+ "epoch": 4.17,
3150
+ "step": 250
3151
+ },
3152
+ {
3153
+ "accuracy": 0.0,
3154
+ "epoch": 4.17,
3155
+ "step": 250
3156
+ },
3157
+ {
3158
+ "accuracy": 1.0,
3159
+ "epoch": 4.17,
3160
+ "step": 250
3161
+ },
3162
+ {
3163
+ "accuracy": 1.0,
3164
+ "epoch": 4.17,
3165
+ "step": 250
3166
+ },
3167
+ {
3168
+ "accuracy": 1.0,
3169
+ "epoch": 4.17,
3170
+ "step": 250
3171
+ },
3172
+ {
3173
+ "accuracy": 1.0,
3174
+ "epoch": 4.17,
3175
+ "step": 250
3176
+ },
3177
+ {
3178
+ "accuracy": 1.0,
3179
+ "epoch": 4.17,
3180
+ "step": 250
3181
+ },
3182
+ {
3183
+ "accuracy": 1.0,
3184
+ "epoch": 4.17,
3185
+ "step": 250
3186
+ },
3187
+ {
3188
+ "accuracy": 1.0,
3189
+ "epoch": 4.17,
3190
+ "step": 250
3191
+ },
3192
+ {
3193
+ "accuracy": 1.0,
3194
+ "epoch": 4.17,
3195
+ "step": 250
3196
+ },
3197
+ {
3198
+ "accuracy": 1.0,
3199
+ "epoch": 4.17,
3200
+ "step": 250
3201
+ },
3202
+ {
3203
+ "accuracy": 1.0,
3204
+ "epoch": 4.17,
3205
+ "step": 250
3206
+ },
3207
+ {
3208
+ "accuracy": 1.0,
3209
+ "epoch": 4.17,
3210
+ "step": 250
3211
+ },
3212
+ {
3213
+ "accuracy": 1.0,
3214
+ "epoch": 4.17,
3215
+ "step": 250
3216
+ },
3217
+ {
3218
+ "accuracy": 1.0,
3219
+ "epoch": 4.17,
3220
+ "step": 250
3221
+ },
3222
+ {
3223
+ "accuracy": 1.0,
3224
+ "epoch": 4.17,
3225
+ "step": 250
3226
+ },
3227
+ {
3228
+ "accuracy": 1.0,
3229
+ "epoch": 4.17,
3230
+ "step": 250
3231
+ },
3232
+ {
3233
+ "accuracy": 1.0,
3234
+ "epoch": 4.17,
3235
+ "step": 250
3236
+ },
3237
+ {
3238
+ "accuracy": 0.0,
3239
+ "epoch": 4.17,
3240
+ "step": 250
3241
+ },
3242
+ {
3243
+ "accuracy": 1.0,
3244
+ "epoch": 4.17,
3245
+ "step": 250
3246
+ },
3247
+ {
3248
+ "accuracy": 1.0,
3249
+ "epoch": 4.17,
3250
+ "step": 250
3251
+ },
3252
+ {
3253
+ "accuracy": 1.0,
3254
+ "epoch": 4.17,
3255
+ "step": 250
3256
+ },
3257
+ {
3258
+ "accuracy": 1.0,
3259
+ "epoch": 4.17,
3260
+ "step": 250
3261
+ },
3262
+ {
3263
+ "accuracy": 1.0,
3264
+ "epoch": 4.17,
3265
+ "step": 250
3266
+ },
3267
+ {
3268
+ "accuracy": 1.0,
3269
+ "epoch": 4.17,
3270
+ "step": 250
3271
+ },
3272
+ {
3273
+ "epoch": 4.17,
3274
+ "eval_accuracy": 0.9027777777777778,
3275
+ "eval_loss": 0.23007525503635406,
3276
+ "eval_runtime": 86.171,
3277
+ "eval_samples_per_second": 0.836,
3278
+ "eval_steps_per_second": 0.836,
3279
+ "step": 250
3280
+ },
3281
+ {
3282
+ "accuracy": 1.0,
3283
+ "epoch": 4.17,
3284
+ "step": 250
3285
+ },
3286
+ {
3287
+ "accuracy": 1.0,
3288
+ "epoch": 5.0,
3289
+ "step": 251
3290
+ },
3291
+ {
3292
+ "accuracy": 1.0,
3293
+ "epoch": 5.01,
3294
+ "step": 252
3295
+ },
3296
+ {
3297
+ "accuracy": 1.0,
3298
+ "epoch": 5.01,
3299
+ "step": 253
3300
+ },
3301
+ {
3302
+ "accuracy": 1.0,
3303
+ "epoch": 5.01,
3304
+ "step": 254
3305
+ },
3306
+ {
3307
+ "accuracy": 1.0,
3308
+ "epoch": 5.02,
3309
+ "step": 255
3310
+ },
3311
+ {
3312
+ "accuracy": 1.0,
3313
+ "epoch": 5.02,
3314
+ "step": 256
3315
+ },
3316
+ {
3317
+ "accuracy": 1.0,
3318
+ "epoch": 5.02,
3319
+ "step": 257
3320
+ },
3321
+ {
3322
+ "accuracy": 1.0,
3323
+ "epoch": 5.03,
3324
+ "step": 258
3325
+ },
3326
+ {
3327
+ "accuracy": 1.0,
3328
+ "epoch": 5.03,
3329
+ "step": 259
3330
+ },
3331
+ {
3332
+ "epoch": 5.03,
3333
+ "grad_norm": 16.793664932250977,
3334
+ "learning_rate": 7.4074074074074075e-06,
3335
+ "loss": 0.0225,
3336
+ "step": 260
3337
+ },
3338
+ {
3339
+ "accuracy": 1.0,
3340
+ "epoch": 5.03,
3341
+ "step": 260
3342
+ },
3343
+ {
3344
+ "accuracy": 1.0,
3345
+ "epoch": 5.04,
3346
+ "step": 261
3347
+ },
3348
+ {
3349
+ "accuracy": 1.0,
3350
+ "epoch": 5.04,
3351
+ "step": 262
3352
+ },
3353
+ {
3354
+ "accuracy": 1.0,
3355
+ "epoch": 5.04,
3356
+ "step": 263
3357
+ },
3358
+ {
3359
+ "accuracy": 1.0,
3360
+ "epoch": 5.05,
3361
+ "step": 264
3362
+ },
3363
+ {
3364
+ "accuracy": 1.0,
3365
+ "epoch": 5.05,
3366
+ "step": 265
3367
+ },
3368
+ {
3369
+ "accuracy": 1.0,
3370
+ "epoch": 5.05,
3371
+ "step": 266
3372
+ },
3373
+ {
3374
+ "accuracy": 1.0,
3375
+ "epoch": 5.06,
3376
+ "step": 267
3377
+ },
3378
+ {
3379
+ "accuracy": 1.0,
3380
+ "epoch": 5.06,
3381
+ "step": 268
3382
+ },
3383
+ {
3384
+ "accuracy": 1.0,
3385
+ "epoch": 5.06,
3386
+ "step": 269
3387
+ },
3388
+ {
3389
+ "epoch": 5.07,
3390
+ "grad_norm": 0.021140459924936295,
3391
+ "learning_rate": 5.555555555555556e-06,
3392
+ "loss": 0.0042,
3393
+ "step": 270
3394
+ },
3395
+ {
3396
+ "accuracy": 1.0,
3397
+ "epoch": 5.07,
3398
+ "step": 270
3399
+ },
3400
+ {
3401
+ "accuracy": 1.0,
3402
+ "epoch": 5.07,
3403
+ "step": 271
3404
+ },
3405
+ {
3406
+ "accuracy": 1.0,
3407
+ "epoch": 5.07,
3408
+ "step": 272
3409
+ },
3410
+ {
3411
+ "accuracy": 1.0,
3412
+ "epoch": 5.08,
3413
+ "step": 273
3414
+ },
3415
+ {
3416
+ "accuracy": 1.0,
3417
+ "epoch": 5.08,
3418
+ "step": 274
3419
+ },
3420
+ {
3421
+ "accuracy": 1.0,
3422
+ "epoch": 5.08,
3423
+ "step": 275
3424
+ },
3425
+ {
3426
+ "accuracy": 1.0,
3427
+ "epoch": 5.09,
3428
+ "step": 276
3429
+ },
3430
+ {
3431
+ "accuracy": 1.0,
3432
+ "epoch": 5.09,
3433
+ "step": 277
3434
+ },
3435
+ {
3436
+ "accuracy": 1.0,
3437
+ "epoch": 5.09,
3438
+ "step": 278
3439
+ },
3440
+ {
3441
+ "accuracy": 1.0,
3442
+ "epoch": 5.1,
3443
+ "step": 279
3444
+ },
3445
+ {
3446
+ "epoch": 5.1,
3447
+ "grad_norm": 0.022641900926828384,
3448
+ "learning_rate": 3.7037037037037037e-06,
3449
+ "loss": 0.0031,
3450
+ "step": 280
3451
+ },
3452
+ {
3453
+ "accuracy": 1.0,
3454
+ "epoch": 5.1,
3455
+ "step": 280
3456
+ },
3457
+ {
3458
+ "accuracy": 1.0,
3459
+ "epoch": 5.1,
3460
+ "step": 281
3461
+ },
3462
+ {
3463
+ "accuracy": 1.0,
3464
+ "epoch": 5.11,
3465
+ "step": 282
3466
+ },
3467
+ {
3468
+ "accuracy": 1.0,
3469
+ "epoch": 5.11,
3470
+ "step": 283
3471
+ },
3472
+ {
3473
+ "accuracy": 1.0,
3474
+ "epoch": 5.11,
3475
+ "step": 284
3476
+ },
3477
+ {
3478
+ "accuracy": 1.0,
3479
+ "epoch": 5.12,
3480
+ "step": 285
3481
+ },
3482
+ {
3483
+ "accuracy": 1.0,
3484
+ "epoch": 5.12,
3485
+ "step": 286
3486
+ },
3487
+ {
3488
+ "accuracy": 1.0,
3489
+ "epoch": 5.12,
3490
+ "step": 287
3491
+ },
3492
+ {
3493
+ "accuracy": 1.0,
3494
+ "epoch": 5.13,
3495
+ "step": 288
3496
+ },
3497
+ {
3498
+ "accuracy": 1.0,
3499
+ "epoch": 5.13,
3500
+ "step": 289
3501
+ },
3502
+ {
3503
+ "epoch": 5.13,
3504
+ "grad_norm": 0.04152318835258484,
3505
+ "learning_rate": 1.8518518518518519e-06,
3506
+ "loss": 0.0281,
3507
+ "step": 290
3508
+ },
3509
+ {
3510
+ "accuracy": 1.0,
3511
+ "epoch": 5.13,
3512
+ "step": 290
3513
+ },
3514
+ {
3515
+ "accuracy": 1.0,
3516
+ "epoch": 5.14,
3517
+ "step": 291
3518
+ },
3519
+ {
3520
+ "accuracy": 1.0,
3521
+ "epoch": 5.14,
3522
+ "step": 292
3523
+ },
3524
+ {
3525
+ "accuracy": 1.0,
3526
+ "epoch": 5.14,
3527
+ "step": 293
3528
+ },
3529
+ {
3530
+ "accuracy": 1.0,
3531
+ "epoch": 5.15,
3532
+ "step": 294
3533
+ },
3534
+ {
3535
+ "accuracy": 1.0,
3536
+ "epoch": 5.15,
3537
+ "step": 295
3538
+ },
3539
+ {
3540
+ "accuracy": 1.0,
3541
+ "epoch": 5.15,
3542
+ "step": 296
3543
+ },
3544
+ {
3545
+ "accuracy": 1.0,
3546
+ "epoch": 5.16,
3547
+ "step": 297
3548
+ },
3549
+ {
3550
+ "accuracy": 1.0,
3551
+ "epoch": 5.16,
3552
+ "step": 298
3553
+ },
3554
+ {
3555
+ "accuracy": 1.0,
3556
+ "epoch": 5.16,
3557
+ "step": 299
3558
+ },
3559
+ {
3560
+ "epoch": 5.17,
3561
+ "grad_norm": 0.007157259155064821,
3562
+ "learning_rate": 0.0,
3563
+ "loss": 0.0004,
3564
+ "step": 300
3565
+ },
3566
+ {
3567
+ "accuracy": 1.0,
3568
+ "epoch": 5.17,
3569
+ "step": 300
3570
+ },
3571
+ {
3572
+ "accuracy": 1.0,
3573
+ "epoch": 5.17,
3574
+ "step": 300
3575
+ },
3576
+ {
3577
+ "accuracy": 1.0,
3578
+ "epoch": 5.17,
3579
+ "step": 300
3580
+ },
3581
+ {
3582
+ "accuracy": 1.0,
3583
+ "epoch": 5.17,
3584
+ "step": 300
3585
+ },
3586
+ {
3587
+ "accuracy": 1.0,
3588
+ "epoch": 5.17,
3589
+ "step": 300
3590
+ },
3591
+ {
3592
+ "accuracy": 1.0,
3593
+ "epoch": 5.17,
3594
+ "step": 300
3595
+ },
3596
+ {
3597
+ "accuracy": 1.0,
3598
+ "epoch": 5.17,
3599
+ "step": 300
3600
+ },
3601
+ {
3602
+ "accuracy": 1.0,
3603
+ "epoch": 5.17,
3604
+ "step": 300
3605
+ },
3606
+ {
3607
+ "accuracy": 0.0,
3608
+ "epoch": 5.17,
3609
+ "step": 300
3610
+ },
3611
+ {
3612
+ "accuracy": 1.0,
3613
+ "epoch": 5.17,
3614
+ "step": 300
3615
+ },
3616
+ {
3617
+ "accuracy": 0.0,
3618
+ "epoch": 5.17,
3619
+ "step": 300
3620
+ },
3621
+ {
3622
+ "accuracy": 0.0,
3623
+ "epoch": 5.17,
3624
+ "step": 300
3625
+ },
3626
+ {
3627
+ "accuracy": 1.0,
3628
+ "epoch": 5.17,
3629
+ "step": 300
3630
+ },
3631
+ {
3632
+ "accuracy": 1.0,
3633
+ "epoch": 5.17,
3634
+ "step": 300
3635
+ },
3636
+ {
3637
+ "accuracy": 1.0,
3638
+ "epoch": 5.17,
3639
+ "step": 300
3640
+ },
3641
+ {
3642
+ "accuracy": 1.0,
3643
+ "epoch": 5.17,
3644
+ "step": 300
3645
+ },
3646
+ {
3647
+ "accuracy": 1.0,
3648
+ "epoch": 5.17,
3649
+ "step": 300
3650
+ },
3651
+ {
3652
+ "accuracy": 0.0,
3653
+ "epoch": 5.17,
3654
+ "step": 300
3655
+ },
3656
+ {
3657
+ "accuracy": 1.0,
3658
+ "epoch": 5.17,
3659
+ "step": 300
3660
+ },
3661
+ {
3662
+ "accuracy": 1.0,
3663
+ "epoch": 5.17,
3664
+ "step": 300
3665
+ },
3666
+ {
3667
+ "accuracy": 1.0,
3668
+ "epoch": 5.17,
3669
+ "step": 300
3670
+ },
3671
+ {
3672
+ "accuracy": 1.0,
3673
+ "epoch": 5.17,
3674
+ "step": 300
3675
+ },
3676
+ {
3677
+ "accuracy": 1.0,
3678
+ "epoch": 5.17,
3679
+ "step": 300
3680
+ },
3681
+ {
3682
+ "accuracy": 1.0,
3683
+ "epoch": 5.17,
3684
+ "step": 300
3685
+ },
3686
+ {
3687
+ "accuracy": 1.0,
3688
+ "epoch": 5.17,
3689
+ "step": 300
3690
+ },
3691
+ {
3692
+ "accuracy": 1.0,
3693
+ "epoch": 5.17,
3694
+ "step": 300
3695
+ },
3696
+ {
3697
+ "accuracy": 1.0,
3698
+ "epoch": 5.17,
3699
+ "step": 300
3700
+ },
3701
+ {
3702
+ "accuracy": 1.0,
3703
+ "epoch": 5.17,
3704
+ "step": 300
3705
+ },
3706
+ {
3707
+ "accuracy": 1.0,
3708
+ "epoch": 5.17,
3709
+ "step": 300
3710
+ },
3711
+ {
3712
+ "accuracy": 1.0,
3713
+ "epoch": 5.17,
3714
+ "step": 300
3715
+ },
3716
+ {
3717
+ "accuracy": 1.0,
3718
+ "epoch": 5.17,
3719
+ "step": 300
3720
+ },
3721
+ {
3722
+ "accuracy": 1.0,
3723
+ "epoch": 5.17,
3724
+ "step": 300
3725
+ },
3726
+ {
3727
+ "accuracy": 1.0,
3728
+ "epoch": 5.17,
3729
+ "step": 300
3730
+ },
3731
+ {
3732
+ "accuracy": 1.0,
3733
+ "epoch": 5.17,
3734
+ "step": 300
3735
+ },
3736
+ {
3737
+ "accuracy": 1.0,
3738
+ "epoch": 5.17,
3739
+ "step": 300
3740
+ },
3741
+ {
3742
+ "accuracy": 1.0,
3743
+ "epoch": 5.17,
3744
+ "step": 300
3745
+ },
3746
+ {
3747
+ "accuracy": 1.0,
3748
+ "epoch": 5.17,
3749
+ "step": 300
3750
+ },
3751
+ {
3752
+ "accuracy": 1.0,
3753
+ "epoch": 5.17,
3754
+ "step": 300
3755
+ },
3756
+ {
3757
+ "accuracy": 1.0,
3758
+ "epoch": 5.17,
3759
+ "step": 300
3760
+ },
3761
+ {
3762
+ "accuracy": 1.0,
3763
+ "epoch": 5.17,
3764
+ "step": 300
3765
+ },
3766
+ {
3767
+ "accuracy": 1.0,
3768
+ "epoch": 5.17,
3769
+ "step": 300
3770
+ },
3771
+ {
3772
+ "accuracy": 1.0,
3773
+ "epoch": 5.17,
3774
+ "step": 300
3775
+ },
3776
+ {
3777
+ "accuracy": 1.0,
3778
+ "epoch": 5.17,
3779
+ "step": 300
3780
+ },
3781
+ {
3782
+ "accuracy": 1.0,
3783
+ "epoch": 5.17,
3784
+ "step": 300
3785
+ },
3786
+ {
3787
+ "accuracy": 1.0,
3788
+ "epoch": 5.17,
3789
+ "step": 300
3790
+ },
3791
+ {
3792
+ "accuracy": 1.0,
3793
+ "epoch": 5.17,
3794
+ "step": 300
3795
+ },
3796
+ {
3797
+ "accuracy": 1.0,
3798
+ "epoch": 5.17,
3799
+ "step": 300
3800
+ },
3801
+ {
3802
+ "accuracy": 1.0,
3803
+ "epoch": 5.17,
3804
+ "step": 300
3805
+ },
3806
+ {
3807
+ "accuracy": 1.0,
3808
+ "epoch": 5.17,
3809
+ "step": 300
3810
+ },
3811
+ {
3812
+ "accuracy": 1.0,
3813
+ "epoch": 5.17,
3814
+ "step": 300
3815
+ },
3816
+ {
3817
+ "accuracy": 1.0,
3818
+ "epoch": 5.17,
3819
+ "step": 300
3820
+ },
3821
+ {
3822
+ "accuracy": 1.0,
3823
+ "epoch": 5.17,
3824
+ "step": 300
3825
+ },
3826
+ {
3827
+ "accuracy": 1.0,
3828
+ "epoch": 5.17,
3829
+ "step": 300
3830
+ },
3831
+ {
3832
+ "accuracy": 1.0,
3833
+ "epoch": 5.17,
3834
+ "step": 300
3835
+ },
3836
+ {
3837
+ "accuracy": 1.0,
3838
+ "epoch": 5.17,
3839
+ "step": 300
3840
+ },
3841
+ {
3842
+ "accuracy": 1.0,
3843
+ "epoch": 5.17,
3844
+ "step": 300
3845
+ },
3846
+ {
3847
+ "accuracy": 1.0,
3848
+ "epoch": 5.17,
3849
+ "step": 300
3850
+ },
3851
+ {
3852
+ "accuracy": 0.0,
3853
+ "epoch": 5.17,
3854
+ "step": 300
3855
+ },
3856
+ {
3857
+ "accuracy": 1.0,
3858
+ "epoch": 5.17,
3859
+ "step": 300
3860
+ },
3861
+ {
3862
+ "accuracy": 1.0,
3863
+ "epoch": 5.17,
3864
+ "step": 300
3865
+ },
3866
+ {
3867
+ "accuracy": 1.0,
3868
+ "epoch": 5.17,
3869
+ "step": 300
3870
+ },
3871
+ {
3872
+ "accuracy": 1.0,
3873
+ "epoch": 5.17,
3874
+ "step": 300
3875
+ },
3876
+ {
3877
+ "accuracy": 1.0,
3878
+ "epoch": 5.17,
3879
+ "step": 300
3880
+ },
3881
+ {
3882
+ "accuracy": 1.0,
3883
+ "epoch": 5.17,
3884
+ "step": 300
3885
+ },
3886
+ {
3887
+ "accuracy": 1.0,
3888
+ "epoch": 5.17,
3889
+ "step": 300
3890
+ },
3891
+ {
3892
+ "accuracy": 1.0,
3893
+ "epoch": 5.17,
3894
+ "step": 300
3895
+ },
3896
+ {
3897
+ "accuracy": 1.0,
3898
+ "epoch": 5.17,
3899
+ "step": 300
3900
+ },
3901
+ {
3902
+ "accuracy": 1.0,
3903
+ "epoch": 5.17,
3904
+ "step": 300
3905
+ },
3906
+ {
3907
+ "accuracy": 1.0,
3908
+ "epoch": 5.17,
3909
+ "step": 300
3910
+ },
3911
+ {
3912
+ "accuracy": 1.0,
3913
+ "epoch": 5.17,
3914
+ "step": 300
3915
+ },
3916
+ {
3917
+ "accuracy": 1.0,
3918
+ "epoch": 5.17,
3919
+ "step": 300
3920
+ },
3921
+ {
3922
+ "accuracy": 1.0,
3923
+ "epoch": 5.17,
3924
+ "step": 300
3925
+ },
3926
+ {
3927
+ "epoch": 5.17,
3928
+ "eval_accuracy": 0.9305555555555556,
3929
+ "eval_loss": 0.1562785804271698,
3930
+ "eval_runtime": 84.9053,
3931
+ "eval_samples_per_second": 0.848,
3932
+ "eval_steps_per_second": 0.848,
3933
+ "step": 300
3934
+ },
3935
+ {
3936
+ "epoch": 5.17,
3937
+ "step": 300,
3938
+ "total_flos": 3.28565653420032e+17,
3939
+ "train_loss": 0.19429689762337754,
3940
+ "train_runtime": 808.927,
3941
+ "train_samples_per_second": 0.371,
3942
+ "train_steps_per_second": 0.371
3943
+ },
3944
+ {
3945
+ "accuracy": 1.0,
3946
+ "epoch": 5.17,
3947
+ "step": 300
3948
+ },
3949
+ {
3950
+ "accuracy": 1.0,
3951
+ "epoch": 5.17,
3952
+ "step": 300
3953
+ },
3954
+ {
3955
+ "accuracy": 1.0,
3956
+ "epoch": 5.17,
3957
+ "step": 300
3958
+ },
3959
+ {
3960
+ "accuracy": 1.0,
3961
+ "epoch": 5.17,
3962
+ "step": 300
3963
+ },
3964
+ {
3965
+ "accuracy": 1.0,
3966
+ "epoch": 5.17,
3967
+ "step": 300
3968
+ },
3969
+ {
3970
+ "accuracy": 1.0,
3971
+ "epoch": 5.17,
3972
+ "step": 300
3973
+ },
3974
+ {
3975
+ "accuracy": 1.0,
3976
+ "epoch": 5.17,
3977
+ "step": 300
3978
+ },
3979
+ {
3980
+ "accuracy": 1.0,
3981
+ "epoch": 5.17,
3982
+ "step": 300
3983
+ },
3984
+ {
3985
+ "accuracy": 1.0,
3986
+ "epoch": 5.17,
3987
+ "step": 300
3988
+ },
3989
+ {
3990
+ "accuracy": 1.0,
3991
+ "epoch": 5.17,
3992
+ "step": 300
3993
+ },
3994
+ {
3995
+ "accuracy": 1.0,
3996
+ "epoch": 5.17,
3997
+ "step": 300
3998
+ },
3999
+ {
4000
+ "accuracy": 1.0,
4001
+ "epoch": 5.17,
4002
+ "step": 300
4003
+ },
4004
+ {
4005
+ "accuracy": 1.0,
4006
+ "epoch": 5.17,
4007
+ "step": 300
4008
+ },
4009
+ {
4010
+ "accuracy": 1.0,
4011
+ "epoch": 5.17,
4012
+ "step": 300
4013
+ },
4014
+ {
4015
+ "accuracy": 1.0,
4016
+ "epoch": 5.17,
4017
+ "step": 300
4018
+ },
4019
+ {
4020
+ "accuracy": 1.0,
4021
+ "epoch": 5.17,
4022
+ "step": 300
4023
+ },
4024
+ {
4025
+ "accuracy": 1.0,
4026
+ "epoch": 5.17,
4027
+ "step": 300
4028
+ },
4029
+ {
4030
+ "accuracy": 1.0,
4031
+ "epoch": 5.17,
4032
+ "step": 300
4033
+ },
4034
+ {
4035
+ "accuracy": 1.0,
4036
+ "epoch": 5.17,
4037
+ "step": 300
4038
+ },
4039
+ {
4040
+ "accuracy": 1.0,
4041
+ "epoch": 5.17,
4042
+ "step": 300
4043
+ },
4044
+ {
4045
+ "accuracy": 1.0,
4046
+ "epoch": 5.17,
4047
+ "step": 300
4048
+ },
4049
+ {
4050
+ "accuracy": 1.0,
4051
+ "epoch": 5.17,
4052
+ "step": 300
4053
+ },
4054
+ {
4055
+ "accuracy": 1.0,
4056
+ "epoch": 5.17,
4057
+ "step": 300
4058
+ },
4059
+ {
4060
+ "accuracy": 1.0,
4061
+ "epoch": 5.17,
4062
+ "step": 300
4063
+ },
4064
+ {
4065
+ "accuracy": 1.0,
4066
+ "epoch": 5.17,
4067
+ "step": 300
4068
+ },
4069
+ {
4070
+ "accuracy": 1.0,
4071
+ "epoch": 5.17,
4072
+ "step": 300
4073
+ },
4074
+ {
4075
+ "accuracy": 1.0,
4076
+ "epoch": 5.17,
4077
+ "step": 300
4078
+ },
4079
+ {
4080
+ "accuracy": 1.0,
4081
+ "epoch": 5.17,
4082
+ "step": 300
4083
+ },
4084
+ {
4085
+ "accuracy": 1.0,
4086
+ "epoch": 5.17,
4087
+ "step": 300
4088
+ },
4089
+ {
4090
+ "accuracy": 1.0,
4091
+ "epoch": 5.17,
4092
+ "step": 300
4093
+ },
4094
+ {
4095
+ "accuracy": 1.0,
4096
+ "epoch": 5.17,
4097
+ "step": 300
4098
+ },
4099
+ {
4100
+ "accuracy": 1.0,
4101
+ "epoch": 5.17,
4102
+ "step": 300
4103
+ },
4104
+ {
4105
+ "accuracy": 1.0,
4106
+ "epoch": 5.17,
4107
+ "step": 300
4108
+ },
4109
+ {
4110
+ "accuracy": 1.0,
4111
+ "epoch": 5.17,
4112
+ "step": 300
4113
+ },
4114
+ {
4115
+ "accuracy": 0.0,
4116
+ "epoch": 5.17,
4117
+ "step": 300
4118
+ },
4119
+ {
4120
+ "accuracy": 0.0,
4121
+ "epoch": 5.17,
4122
+ "step": 300
4123
+ },
4124
+ {
4125
+ "accuracy": 0.0,
4126
+ "epoch": 5.17,
4127
+ "step": 300
4128
+ },
4129
+ {
4130
+ "accuracy": 1.0,
4131
+ "epoch": 5.17,
4132
+ "step": 300
4133
+ },
4134
+ {
4135
+ "accuracy": 1.0,
4136
+ "epoch": 5.17,
4137
+ "step": 300
4138
+ },
4139
+ {
4140
+ "accuracy": 1.0,
4141
+ "epoch": 5.17,
4142
+ "step": 300
4143
+ },
4144
+ {
4145
+ "accuracy": 1.0,
4146
+ "epoch": 5.17,
4147
+ "step": 300
4148
+ },
4149
+ {
4150
+ "accuracy": 1.0,
4151
+ "epoch": 5.17,
4152
+ "step": 300
4153
+ },
4154
+ {
4155
+ "accuracy": 1.0,
4156
+ "epoch": 5.17,
4157
+ "step": 300
4158
+ },
4159
+ {
4160
+ "accuracy": 1.0,
4161
+ "epoch": 5.17,
4162
+ "step": 300
4163
+ },
4164
+ {
4165
+ "accuracy": 1.0,
4166
+ "epoch": 5.17,
4167
+ "step": 300
4168
+ },
4169
+ {
4170
+ "accuracy": 1.0,
4171
+ "epoch": 5.17,
4172
+ "step": 300
4173
+ },
4174
+ {
4175
+ "accuracy": 1.0,
4176
+ "epoch": 5.17,
4177
+ "step": 300
4178
+ },
4179
+ {
4180
+ "accuracy": 1.0,
4181
+ "epoch": 5.17,
4182
+ "step": 300
4183
+ },
4184
+ {
4185
+ "accuracy": 1.0,
4186
+ "epoch": 5.17,
4187
+ "step": 300
4188
+ },
4189
+ {
4190
+ "accuracy": 1.0,
4191
+ "epoch": 5.17,
4192
+ "step": 300
4193
+ },
4194
+ {
4195
+ "accuracy": 1.0,
4196
+ "epoch": 5.17,
4197
+ "step": 300
4198
+ },
4199
+ {
4200
+ "accuracy": 1.0,
4201
+ "epoch": 5.17,
4202
+ "step": 300
4203
+ },
4204
+ {
4205
+ "accuracy": 1.0,
4206
+ "epoch": 5.17,
4207
+ "step": 300
4208
+ },
4209
+ {
4210
+ "accuracy": 1.0,
4211
+ "epoch": 5.17,
4212
+ "step": 300
4213
+ },
4214
+ {
4215
+ "accuracy": 1.0,
4216
+ "epoch": 5.17,
4217
+ "step": 300
4218
+ },
4219
+ {
4220
+ "accuracy": 1.0,
4221
+ "epoch": 5.17,
4222
+ "step": 300
4223
+ },
4224
+ {
4225
+ "accuracy": 1.0,
4226
+ "epoch": 5.17,
4227
+ "step": 300
4228
+ },
4229
+ {
4230
+ "accuracy": 1.0,
4231
+ "epoch": 5.17,
4232
+ "step": 300
4233
+ },
4234
+ {
4235
+ "accuracy": 1.0,
4236
+ "epoch": 5.17,
4237
+ "step": 300
4238
+ },
4239
+ {
4240
+ "accuracy": 1.0,
4241
+ "epoch": 5.17,
4242
+ "step": 300
4243
+ },
4244
+ {
4245
+ "accuracy": 1.0,
4246
+ "epoch": 5.17,
4247
+ "step": 300
4248
+ },
4249
+ {
4250
+ "accuracy": 1.0,
4251
+ "epoch": 5.17,
4252
+ "step": 300
4253
+ },
4254
+ {
4255
+ "accuracy": 1.0,
4256
+ "epoch": 5.17,
4257
+ "step": 300
4258
+ },
4259
+ {
4260
+ "accuracy": 1.0,
4261
+ "epoch": 5.17,
4262
+ "step": 300
4263
+ },
4264
+ {
4265
+ "accuracy": 1.0,
4266
+ "epoch": 5.17,
4267
+ "step": 300
4268
+ },
4269
+ {
4270
+ "accuracy": 1.0,
4271
+ "epoch": 5.17,
4272
+ "step": 300
4273
+ },
4274
+ {
4275
+ "accuracy": 0.0,
4276
+ "epoch": 5.17,
4277
+ "step": 300
4278
+ },
4279
+ {
4280
+ "accuracy": 1.0,
4281
+ "epoch": 5.17,
4282
+ "step": 300
4283
+ },
4284
+ {
4285
+ "epoch": 5.17,
4286
+ "eval_accuracy": 0.9411764705882353,
4287
+ "eval_loss": 0.37313517928123474,
4288
+ "eval_runtime": 54.9231,
4289
+ "eval_samples_per_second": 1.238,
4290
+ "eval_steps_per_second": 1.238,
4291
+ "step": 300
4292
+ },
4293
+ {
4294
+ "accuracy": 1.0,
4295
+ "epoch": 5.17,
4296
+ "step": 300
4297
+ },
4298
+ {
4299
+ "accuracy": 1.0,
4300
+ "epoch": 5.17,
4301
+ "step": 300
4302
+ },
4303
+ {
4304
+ "accuracy": 1.0,
4305
+ "epoch": 5.17,
4306
+ "step": 300
4307
+ },
4308
+ {
4309
+ "accuracy": 1.0,
4310
+ "epoch": 5.17,
4311
+ "step": 300
4312
+ },
4313
+ {
4314
+ "accuracy": 1.0,
4315
+ "epoch": 5.17,
4316
+ "step": 300
4317
+ },
4318
+ {
4319
+ "accuracy": 1.0,
4320
+ "epoch": 5.17,
4321
+ "step": 300
4322
+ },
4323
+ {
4324
+ "accuracy": 1.0,
4325
+ "epoch": 5.17,
4326
+ "step": 300
4327
+ },
4328
+ {
4329
+ "accuracy": 1.0,
4330
+ "epoch": 5.17,
4331
+ "step": 300
4332
+ },
4333
+ {
4334
+ "accuracy": 1.0,
4335
+ "epoch": 5.17,
4336
+ "step": 300
4337
+ },
4338
+ {
4339
+ "accuracy": 1.0,
4340
+ "epoch": 5.17,
4341
+ "step": 300
4342
+ },
4343
+ {
4344
+ "accuracy": 1.0,
4345
+ "epoch": 5.17,
4346
+ "step": 300
4347
+ },
4348
+ {
4349
+ "accuracy": 1.0,
4350
+ "epoch": 5.17,
4351
+ "step": 300
4352
+ },
4353
+ {
4354
+ "accuracy": 0.0,
4355
+ "epoch": 5.17,
4356
+ "step": 300
4357
+ },
4358
+ {
4359
+ "accuracy": 1.0,
4360
+ "epoch": 5.17,
4361
+ "step": 300
4362
+ },
4363
+ {
4364
+ "accuracy": 1.0,
4365
+ "epoch": 5.17,
4366
+ "step": 300
4367
+ },
4368
+ {
4369
+ "accuracy": 1.0,
4370
+ "epoch": 5.17,
4371
+ "step": 300
4372
+ },
4373
+ {
4374
+ "accuracy": 1.0,
4375
+ "epoch": 5.17,
4376
+ "step": 300
4377
+ },
4378
+ {
4379
+ "accuracy": 1.0,
4380
+ "epoch": 5.17,
4381
+ "step": 300
4382
+ },
4383
+ {
4384
+ "accuracy": 1.0,
4385
+ "epoch": 5.17,
4386
+ "step": 300
4387
+ },
4388
+ {
4389
+ "accuracy": 1.0,
4390
+ "epoch": 5.17,
4391
+ "step": 300
4392
+ },
4393
+ {
4394
+ "accuracy": 1.0,
4395
+ "epoch": 5.17,
4396
+ "step": 300
4397
+ },
4398
+ {
4399
+ "accuracy": 1.0,
4400
+ "epoch": 5.17,
4401
+ "step": 300
4402
+ },
4403
+ {
4404
+ "accuracy": 1.0,
4405
+ "epoch": 5.17,
4406
+ "step": 300
4407
+ },
4408
+ {
4409
+ "accuracy": 1.0,
4410
+ "epoch": 5.17,
4411
+ "step": 300
4412
+ },
4413
+ {
4414
+ "accuracy": 1.0,
4415
+ "epoch": 5.17,
4416
+ "step": 300
4417
+ },
4418
+ {
4419
+ "accuracy": 1.0,
4420
+ "epoch": 5.17,
4421
+ "step": 300
4422
+ },
4423
+ {
4424
+ "accuracy": 1.0,
4425
+ "epoch": 5.17,
4426
+ "step": 300
4427
+ },
4428
+ {
4429
+ "accuracy": 1.0,
4430
+ "epoch": 5.17,
4431
+ "step": 300
4432
+ },
4433
+ {
4434
+ "accuracy": 1.0,
4435
+ "epoch": 5.17,
4436
+ "step": 300
4437
+ },
4438
+ {
4439
+ "accuracy": 1.0,
4440
+ "epoch": 5.17,
4441
+ "step": 300
4442
+ },
4443
+ {
4444
+ "accuracy": 1.0,
4445
+ "epoch": 5.17,
4446
+ "step": 300
4447
+ },
4448
+ {
4449
+ "accuracy": 1.0,
4450
+ "epoch": 5.17,
4451
+ "step": 300
4452
+ },
4453
+ {
4454
+ "accuracy": 1.0,
4455
+ "epoch": 5.17,
4456
+ "step": 300
4457
+ },
4458
+ {
4459
+ "accuracy": 1.0,
4460
+ "epoch": 5.17,
4461
+ "step": 300
4462
+ },
4463
+ {
4464
+ "accuracy": 1.0,
4465
+ "epoch": 5.17,
4466
+ "step": 300
4467
+ },
4468
+ {
4469
+ "accuracy": 1.0,
4470
+ "epoch": 5.17,
4471
+ "step": 300
4472
+ },
4473
+ {
4474
+ "accuracy": 1.0,
4475
+ "epoch": 5.17,
4476
+ "step": 300
4477
+ },
4478
+ {
4479
+ "accuracy": 1.0,
4480
+ "epoch": 5.17,
4481
+ "step": 300
4482
+ },
4483
+ {
4484
+ "accuracy": 1.0,
4485
+ "epoch": 5.17,
4486
+ "step": 300
4487
+ },
4488
+ {
4489
+ "accuracy": 1.0,
4490
+ "epoch": 5.17,
4491
+ "step": 300
4492
+ },
4493
+ {
4494
+ "accuracy": 1.0,
4495
+ "epoch": 5.17,
4496
+ "step": 300
4497
+ },
4498
+ {
4499
+ "accuracy": 1.0,
4500
+ "epoch": 5.17,
4501
+ "step": 300
4502
+ },
4503
+ {
4504
+ "accuracy": 1.0,
4505
+ "epoch": 5.17,
4506
+ "step": 300
4507
+ },
4508
+ {
4509
+ "accuracy": 1.0,
4510
+ "epoch": 5.17,
4511
+ "step": 300
4512
+ },
4513
+ {
4514
+ "accuracy": 1.0,
4515
+ "epoch": 5.17,
4516
+ "step": 300
4517
+ },
4518
+ {
4519
+ "accuracy": 1.0,
4520
+ "epoch": 5.17,
4521
+ "step": 300
4522
+ },
4523
+ {
4524
+ "accuracy": 1.0,
4525
+ "epoch": 5.17,
4526
+ "step": 300
4527
+ },
4528
+ {
4529
+ "accuracy": 1.0,
4530
+ "epoch": 5.17,
4531
+ "step": 300
4532
+ },
4533
+ {
4534
+ "accuracy": 1.0,
4535
+ "epoch": 5.17,
4536
+ "step": 300
4537
+ },
4538
+ {
4539
+ "accuracy": 1.0,
4540
+ "epoch": 5.17,
4541
+ "step": 300
4542
+ },
4543
+ {
4544
+ "accuracy": 1.0,
4545
+ "epoch": 5.17,
4546
+ "step": 300
4547
+ },
4548
+ {
4549
+ "accuracy": 1.0,
4550
+ "epoch": 5.17,
4551
+ "step": 300
4552
+ },
4553
+ {
4554
+ "accuracy": 1.0,
4555
+ "epoch": 5.17,
4556
+ "step": 300
4557
+ },
4558
+ {
4559
+ "accuracy": 1.0,
4560
+ "epoch": 5.17,
4561
+ "step": 300
4562
+ },
4563
+ {
4564
+ "accuracy": 1.0,
4565
+ "epoch": 5.17,
4566
+ "step": 300
4567
+ },
4568
+ {
4569
+ "accuracy": 1.0,
4570
+ "epoch": 5.17,
4571
+ "step": 300
4572
+ },
4573
+ {
4574
+ "accuracy": 1.0,
4575
+ "epoch": 5.17,
4576
+ "step": 300
4577
+ },
4578
+ {
4579
+ "accuracy": 1.0,
4580
+ "epoch": 5.17,
4581
+ "step": 300
4582
+ },
4583
+ {
4584
+ "accuracy": 1.0,
4585
+ "epoch": 5.17,
4586
+ "step": 300
4587
+ },
4588
+ {
4589
+ "accuracy": 1.0,
4590
+ "epoch": 5.17,
4591
+ "step": 300
4592
+ },
4593
+ {
4594
+ "accuracy": 0.0,
4595
+ "epoch": 5.17,
4596
+ "step": 300
4597
+ },
4598
+ {
4599
+ "accuracy": 0.0,
4600
+ "epoch": 5.17,
4601
+ "step": 300
4602
+ },
4603
+ {
4604
+ "accuracy": 0.0,
4605
+ "epoch": 5.17,
4606
+ "step": 300
4607
+ },
4608
+ {
4609
+ "accuracy": 1.0,
4610
+ "epoch": 5.17,
4611
+ "step": 300
4612
+ },
4613
+ {
4614
+ "accuracy": 1.0,
4615
+ "epoch": 5.17,
4616
+ "step": 300
4617
+ },
4618
+ {
4619
+ "accuracy": 1.0,
4620
+ "epoch": 5.17,
4621
+ "step": 300
4622
+ },
4623
+ {
4624
+ "accuracy": 1.0,
4625
+ "epoch": 5.17,
4626
+ "step": 300
4627
+ },
4628
+ {
4629
+ "accuracy": 1.0,
4630
+ "epoch": 5.17,
4631
+ "step": 300
4632
+ },
4633
+ {
4634
+ "epoch": 5.17,
4635
+ "eval_accuracy": 0.9411764705882353,
4636
+ "eval_loss": 0.3731352388858795,
4637
+ "eval_runtime": 54.8788,
4638
+ "eval_samples_per_second": 1.239,
4639
+ "eval_steps_per_second": 1.239,
4640
+ "step": 300
4641
+ }
4642
+ ],
4643
+ "logging_steps": 10,
4644
+ "max_steps": 300,
4645
+ "num_input_tokens_seen": 0,
4646
+ "num_train_epochs": 9223372036854775807,
4647
+ "save_steps": 500,
4648
+ "total_flos": 3.28565653420032e+17,
4649
+ "train_batch_size": 1,
4650
+ "trial_name": null,
4651
+ "trial_params": null
4652
+ }