JacobLinCool commited on
Commit
db33f2f
·
verified ·
1 Parent(s): 4b64016

Training in progress, epoch 2, checkpoint

Browse files
last-checkpoint/adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1fb289b1313d8d3ea139574060333d2bc4e322e19984a34b0b887bcf81acfcc0
3
  size 111475752
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:63e000fe639aec47bff52fa19a961971b707751947c87f9a5ff6c0dc2fe99f01
3
  size 111475752
last-checkpoint/optimizer.pt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c04f0312246c9bde038163f931a8c5c377b0fe4219a9ac2784576d6028f217d9
3
  size 223212738
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fa7052db74e0f4dc5b3a0b582bd7440f2d753d1f86e80b9bf2465c05890c05af
3
  size 223212738
last-checkpoint/rng_state.pth CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:10340d2fef3c6ef590a90ffffb0fbde8d896f997c1e120eda641f1fb1757f11e
3
  size 14244
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a62cf652e907688082668731438326fd59f61afa5ca5ca0811f87bbebf5f3221
3
  size 14244
last-checkpoint/scheduler.pt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c89c7447eb51d945a7138520eef2ce59e572a99c872850f96937133d3e9cce22
3
  size 1064
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b1d7731b358c3ec58c279de7e2cb4cb855355943816fa190a444f54b60a2f948
3
  size 1064
last-checkpoint/trainer_state.json CHANGED
@@ -1,9 +1,9 @@
1
  {
2
- "best_metric": 35.86396427344555,
3
- "best_model_checkpoint": "./exp/whisper-large-v3-turbo-augmented/checkpoint-190",
4
- "epoch": 1.0,
5
  "eval_steps": 500,
6
- "global_step": 190,
7
  "is_hyper_param_search": false,
8
  "is_local_process_zero": true,
9
  "is_world_process_zero": true,
@@ -1348,6 +1348,1347 @@
1348
  "eval_steps_per_second": 0.89,
1349
  "eval_wer": 35.86396427344555,
1350
  "step": 190
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1351
  }
1352
  ],
1353
  "logging_steps": 1,
@@ -1367,7 +2708,7 @@
1367
  "attributes": {}
1368
  }
1369
  },
1370
- "total_flos": 5.3604457316352e+18,
1371
  "train_batch_size": 16,
1372
  "trial_name": null,
1373
  "trial_params": null
 
1
  {
2
+ "best_metric": 32.53177602198557,
3
+ "best_model_checkpoint": "./exp/whisper-large-v3-turbo-augmented/checkpoint-380",
4
+ "epoch": 2.0,
5
  "eval_steps": 500,
6
+ "global_step": 380,
7
  "is_hyper_param_search": false,
8
  "is_local_process_zero": true,
9
  "is_world_process_zero": true,
 
1348
  "eval_steps_per_second": 0.89,
1349
  "eval_wer": 35.86396427344555,
1350
  "step": 190
1351
+ },
1352
+ {
1353
+ "epoch": 1.0052631578947369,
1354
+ "grad_norm": 0.3642374873161316,
1355
+ "learning_rate": 0.0003994736842105263,
1356
+ "loss": 0.0385,
1357
+ "step": 191
1358
+ },
1359
+ {
1360
+ "epoch": 1.0105263157894737,
1361
+ "grad_norm": 0.45194628834724426,
1362
+ "learning_rate": 0.0003989473684210526,
1363
+ "loss": 0.0433,
1364
+ "step": 192
1365
+ },
1366
+ {
1367
+ "epoch": 1.0157894736842106,
1368
+ "grad_norm": 0.40765321254730225,
1369
+ "learning_rate": 0.000398421052631579,
1370
+ "loss": 0.0326,
1371
+ "step": 193
1372
+ },
1373
+ {
1374
+ "epoch": 1.0210526315789474,
1375
+ "grad_norm": 0.5447459816932678,
1376
+ "learning_rate": 0.00039789473684210527,
1377
+ "loss": 0.0531,
1378
+ "step": 194
1379
+ },
1380
+ {
1381
+ "epoch": 1.0263157894736843,
1382
+ "grad_norm": 0.6106863617897034,
1383
+ "learning_rate": 0.0003973684210526316,
1384
+ "loss": 0.0477,
1385
+ "step": 195
1386
+ },
1387
+ {
1388
+ "epoch": 1.0315789473684212,
1389
+ "grad_norm": 0.38588041067123413,
1390
+ "learning_rate": 0.0003968421052631579,
1391
+ "loss": 0.0308,
1392
+ "step": 196
1393
+ },
1394
+ {
1395
+ "epoch": 1.0368421052631578,
1396
+ "grad_norm": 0.5768702030181885,
1397
+ "learning_rate": 0.00039631578947368424,
1398
+ "loss": 0.0933,
1399
+ "step": 197
1400
+ },
1401
+ {
1402
+ "epoch": 1.0421052631578946,
1403
+ "grad_norm": 0.37887042760849,
1404
+ "learning_rate": 0.0003957894736842105,
1405
+ "loss": 0.0388,
1406
+ "step": 198
1407
+ },
1408
+ {
1409
+ "epoch": 1.0473684210526315,
1410
+ "grad_norm": 0.5356220006942749,
1411
+ "learning_rate": 0.0003952631578947368,
1412
+ "loss": 0.0752,
1413
+ "step": 199
1414
+ },
1415
+ {
1416
+ "epoch": 1.0526315789473684,
1417
+ "grad_norm": 0.5753058195114136,
1418
+ "learning_rate": 0.00039473684210526315,
1419
+ "loss": 0.0335,
1420
+ "step": 200
1421
+ },
1422
+ {
1423
+ "epoch": 1.0578947368421052,
1424
+ "grad_norm": 1.119248390197754,
1425
+ "learning_rate": 0.0003942105263157895,
1426
+ "loss": 0.1239,
1427
+ "step": 201
1428
+ },
1429
+ {
1430
+ "epoch": 1.063157894736842,
1431
+ "grad_norm": 0.5077896118164062,
1432
+ "learning_rate": 0.00039368421052631583,
1433
+ "loss": 0.048,
1434
+ "step": 202
1435
+ },
1436
+ {
1437
+ "epoch": 1.068421052631579,
1438
+ "grad_norm": 0.44742247462272644,
1439
+ "learning_rate": 0.0003931578947368421,
1440
+ "loss": 0.0431,
1441
+ "step": 203
1442
+ },
1443
+ {
1444
+ "epoch": 1.0736842105263158,
1445
+ "grad_norm": 0.4166790246963501,
1446
+ "learning_rate": 0.00039263157894736846,
1447
+ "loss": 0.047,
1448
+ "step": 204
1449
+ },
1450
+ {
1451
+ "epoch": 1.0789473684210527,
1452
+ "grad_norm": 0.664332389831543,
1453
+ "learning_rate": 0.00039210526315789474,
1454
+ "loss": 0.0905,
1455
+ "step": 205
1456
+ },
1457
+ {
1458
+ "epoch": 1.0842105263157895,
1459
+ "grad_norm": 0.7036660313606262,
1460
+ "learning_rate": 0.000391578947368421,
1461
+ "loss": 0.0794,
1462
+ "step": 206
1463
+ },
1464
+ {
1465
+ "epoch": 1.0894736842105264,
1466
+ "grad_norm": 0.7079408764839172,
1467
+ "learning_rate": 0.00039105263157894737,
1468
+ "loss": 0.061,
1469
+ "step": 207
1470
+ },
1471
+ {
1472
+ "epoch": 1.0947368421052632,
1473
+ "grad_norm": 0.6071505546569824,
1474
+ "learning_rate": 0.00039052631578947365,
1475
+ "loss": 0.1008,
1476
+ "step": 208
1477
+ },
1478
+ {
1479
+ "epoch": 1.1,
1480
+ "grad_norm": 0.5258550643920898,
1481
+ "learning_rate": 0.00039000000000000005,
1482
+ "loss": 0.0563,
1483
+ "step": 209
1484
+ },
1485
+ {
1486
+ "epoch": 1.1052631578947367,
1487
+ "grad_norm": 0.4677925705909729,
1488
+ "learning_rate": 0.00038947368421052633,
1489
+ "loss": 0.0316,
1490
+ "step": 210
1491
+ },
1492
+ {
1493
+ "epoch": 1.1105263157894736,
1494
+ "grad_norm": 0.36704903841018677,
1495
+ "learning_rate": 0.0003889473684210527,
1496
+ "loss": 0.0272,
1497
+ "step": 211
1498
+ },
1499
+ {
1500
+ "epoch": 1.1157894736842104,
1501
+ "grad_norm": 0.25789088010787964,
1502
+ "learning_rate": 0.00038842105263157896,
1503
+ "loss": 0.0232,
1504
+ "step": 212
1505
+ },
1506
+ {
1507
+ "epoch": 1.1210526315789473,
1508
+ "grad_norm": 0.459507018327713,
1509
+ "learning_rate": 0.00038789473684210524,
1510
+ "loss": 0.0368,
1511
+ "step": 213
1512
+ },
1513
+ {
1514
+ "epoch": 1.1263157894736842,
1515
+ "grad_norm": 1.4723243713378906,
1516
+ "learning_rate": 0.0003873684210526316,
1517
+ "loss": 0.1488,
1518
+ "step": 214
1519
+ },
1520
+ {
1521
+ "epoch": 1.131578947368421,
1522
+ "grad_norm": 0.5805358290672302,
1523
+ "learning_rate": 0.00038684210526315787,
1524
+ "loss": 0.0539,
1525
+ "step": 215
1526
+ },
1527
+ {
1528
+ "epoch": 1.1368421052631579,
1529
+ "grad_norm": 0.42323464155197144,
1530
+ "learning_rate": 0.0003863157894736842,
1531
+ "loss": 0.0341,
1532
+ "step": 216
1533
+ },
1534
+ {
1535
+ "epoch": 1.1421052631578947,
1536
+ "grad_norm": 0.5997718572616577,
1537
+ "learning_rate": 0.0003857894736842105,
1538
+ "loss": 0.0281,
1539
+ "step": 217
1540
+ },
1541
+ {
1542
+ "epoch": 1.1473684210526316,
1543
+ "grad_norm": 0.4035644829273224,
1544
+ "learning_rate": 0.0003852631578947369,
1545
+ "loss": 0.0243,
1546
+ "step": 218
1547
+ },
1548
+ {
1549
+ "epoch": 1.1526315789473685,
1550
+ "grad_norm": 0.6057082414627075,
1551
+ "learning_rate": 0.0003847368421052632,
1552
+ "loss": 0.0288,
1553
+ "step": 219
1554
+ },
1555
+ {
1556
+ "epoch": 1.1578947368421053,
1557
+ "grad_norm": 2.3672714233398438,
1558
+ "learning_rate": 0.00038421052631578946,
1559
+ "loss": 0.0941,
1560
+ "step": 220
1561
+ },
1562
+ {
1563
+ "epoch": 1.1631578947368422,
1564
+ "grad_norm": 0.5687212347984314,
1565
+ "learning_rate": 0.0003836842105263158,
1566
+ "loss": 0.0305,
1567
+ "step": 221
1568
+ },
1569
+ {
1570
+ "epoch": 1.168421052631579,
1571
+ "grad_norm": 0.41877779364585876,
1572
+ "learning_rate": 0.0003831578947368421,
1573
+ "loss": 0.0355,
1574
+ "step": 222
1575
+ },
1576
+ {
1577
+ "epoch": 1.1736842105263159,
1578
+ "grad_norm": 0.2985477149486542,
1579
+ "learning_rate": 0.00038263157894736843,
1580
+ "loss": 0.0343,
1581
+ "step": 223
1582
+ },
1583
+ {
1584
+ "epoch": 1.1789473684210527,
1585
+ "grad_norm": 0.7928400635719299,
1586
+ "learning_rate": 0.0003821052631578947,
1587
+ "loss": 0.0904,
1588
+ "step": 224
1589
+ },
1590
+ {
1591
+ "epoch": 1.1842105263157894,
1592
+ "grad_norm": 0.9386544823646545,
1593
+ "learning_rate": 0.00038157894736842105,
1594
+ "loss": 0.0814,
1595
+ "step": 225
1596
+ },
1597
+ {
1598
+ "epoch": 1.1894736842105262,
1599
+ "grad_norm": 0.8019982576370239,
1600
+ "learning_rate": 0.0003810526315789474,
1601
+ "loss": 0.0852,
1602
+ "step": 226
1603
+ },
1604
+ {
1605
+ "epoch": 1.194736842105263,
1606
+ "grad_norm": 0.7978726029396057,
1607
+ "learning_rate": 0.0003805263157894737,
1608
+ "loss": 0.0579,
1609
+ "step": 227
1610
+ },
1611
+ {
1612
+ "epoch": 1.2,
1613
+ "grad_norm": 0.5441069006919861,
1614
+ "learning_rate": 0.00038,
1615
+ "loss": 0.0298,
1616
+ "step": 228
1617
+ },
1618
+ {
1619
+ "epoch": 1.2052631578947368,
1620
+ "grad_norm": 0.7516114711761475,
1621
+ "learning_rate": 0.0003794736842105263,
1622
+ "loss": 0.0916,
1623
+ "step": 229
1624
+ },
1625
+ {
1626
+ "epoch": 1.2105263157894737,
1627
+ "grad_norm": 0.6469661593437195,
1628
+ "learning_rate": 0.00037894736842105265,
1629
+ "loss": 0.081,
1630
+ "step": 230
1631
+ },
1632
+ {
1633
+ "epoch": 1.2157894736842105,
1634
+ "grad_norm": 0.5822259783744812,
1635
+ "learning_rate": 0.00037842105263157893,
1636
+ "loss": 0.0767,
1637
+ "step": 231
1638
+ },
1639
+ {
1640
+ "epoch": 1.2210526315789474,
1641
+ "grad_norm": 0.6925140023231506,
1642
+ "learning_rate": 0.00037789473684210527,
1643
+ "loss": 0.0797,
1644
+ "step": 232
1645
+ },
1646
+ {
1647
+ "epoch": 1.2263157894736842,
1648
+ "grad_norm": 0.7924796938896179,
1649
+ "learning_rate": 0.00037736842105263156,
1650
+ "loss": 0.0663,
1651
+ "step": 233
1652
+ },
1653
+ {
1654
+ "epoch": 1.231578947368421,
1655
+ "grad_norm": 0.5165220499038696,
1656
+ "learning_rate": 0.0003768421052631579,
1657
+ "loss": 0.0314,
1658
+ "step": 234
1659
+ },
1660
+ {
1661
+ "epoch": 1.236842105263158,
1662
+ "grad_norm": 0.6821334362030029,
1663
+ "learning_rate": 0.00037631578947368424,
1664
+ "loss": 0.1264,
1665
+ "step": 235
1666
+ },
1667
+ {
1668
+ "epoch": 1.2421052631578948,
1669
+ "grad_norm": 0.6426858305931091,
1670
+ "learning_rate": 0.0003757894736842105,
1671
+ "loss": 0.0647,
1672
+ "step": 236
1673
+ },
1674
+ {
1675
+ "epoch": 1.2473684210526317,
1676
+ "grad_norm": 0.620343804359436,
1677
+ "learning_rate": 0.00037526315789473686,
1678
+ "loss": 0.0401,
1679
+ "step": 237
1680
+ },
1681
+ {
1682
+ "epoch": 1.2526315789473683,
1683
+ "grad_norm": 0.7430979609489441,
1684
+ "learning_rate": 0.00037473684210526315,
1685
+ "loss": 0.0834,
1686
+ "step": 238
1687
+ },
1688
+ {
1689
+ "epoch": 1.2578947368421054,
1690
+ "grad_norm": 0.7153907418251038,
1691
+ "learning_rate": 0.0003742105263157895,
1692
+ "loss": 0.0648,
1693
+ "step": 239
1694
+ },
1695
+ {
1696
+ "epoch": 1.263157894736842,
1697
+ "grad_norm": 0.6082773208618164,
1698
+ "learning_rate": 0.0003736842105263158,
1699
+ "loss": 0.0862,
1700
+ "step": 240
1701
+ },
1702
+ {
1703
+ "epoch": 1.268421052631579,
1704
+ "grad_norm": 0.43621256947517395,
1705
+ "learning_rate": 0.0003731578947368421,
1706
+ "loss": 0.0445,
1707
+ "step": 241
1708
+ },
1709
+ {
1710
+ "epoch": 1.2736842105263158,
1711
+ "grad_norm": 0.5313287377357483,
1712
+ "learning_rate": 0.00037263157894736846,
1713
+ "loss": 0.0633,
1714
+ "step": 242
1715
+ },
1716
+ {
1717
+ "epoch": 1.2789473684210526,
1718
+ "grad_norm": 0.5271344184875488,
1719
+ "learning_rate": 0.00037210526315789474,
1720
+ "loss": 0.052,
1721
+ "step": 243
1722
+ },
1723
+ {
1724
+ "epoch": 1.2842105263157895,
1725
+ "grad_norm": 0.4209420382976532,
1726
+ "learning_rate": 0.0003715789473684211,
1727
+ "loss": 0.0495,
1728
+ "step": 244
1729
+ },
1730
+ {
1731
+ "epoch": 1.2894736842105263,
1732
+ "grad_norm": 0.530073344707489,
1733
+ "learning_rate": 0.00037105263157894737,
1734
+ "loss": 0.0331,
1735
+ "step": 245
1736
+ },
1737
+ {
1738
+ "epoch": 1.2947368421052632,
1739
+ "grad_norm": 0.5666402578353882,
1740
+ "learning_rate": 0.0003705263157894737,
1741
+ "loss": 0.0414,
1742
+ "step": 246
1743
+ },
1744
+ {
1745
+ "epoch": 1.3,
1746
+ "grad_norm": 0.6282495260238647,
1747
+ "learning_rate": 0.00037,
1748
+ "loss": 0.0428,
1749
+ "step": 247
1750
+ },
1751
+ {
1752
+ "epoch": 1.305263157894737,
1753
+ "grad_norm": 0.6240766644477844,
1754
+ "learning_rate": 0.00036947368421052633,
1755
+ "loss": 0.0348,
1756
+ "step": 248
1757
+ },
1758
+ {
1759
+ "epoch": 1.3105263157894738,
1760
+ "grad_norm": 0.6795949935913086,
1761
+ "learning_rate": 0.0003689473684210526,
1762
+ "loss": 0.0625,
1763
+ "step": 249
1764
+ },
1765
+ {
1766
+ "epoch": 1.3157894736842106,
1767
+ "grad_norm": 0.9656326174736023,
1768
+ "learning_rate": 0.00036842105263157896,
1769
+ "loss": 0.073,
1770
+ "step": 250
1771
+ },
1772
+ {
1773
+ "epoch": 1.3210526315789473,
1774
+ "grad_norm": 0.387407511472702,
1775
+ "learning_rate": 0.0003678947368421053,
1776
+ "loss": 0.0186,
1777
+ "step": 251
1778
+ },
1779
+ {
1780
+ "epoch": 1.3263157894736843,
1781
+ "grad_norm": 0.4958190619945526,
1782
+ "learning_rate": 0.0003673684210526316,
1783
+ "loss": 0.0501,
1784
+ "step": 252
1785
+ },
1786
+ {
1787
+ "epoch": 1.331578947368421,
1788
+ "grad_norm": 0.9461203217506409,
1789
+ "learning_rate": 0.0003668421052631579,
1790
+ "loss": 0.0444,
1791
+ "step": 253
1792
+ },
1793
+ {
1794
+ "epoch": 1.3368421052631578,
1795
+ "grad_norm": 0.5634720325469971,
1796
+ "learning_rate": 0.0003663157894736842,
1797
+ "loss": 0.0529,
1798
+ "step": 254
1799
+ },
1800
+ {
1801
+ "epoch": 1.3421052631578947,
1802
+ "grad_norm": 0.4822929799556732,
1803
+ "learning_rate": 0.00036578947368421055,
1804
+ "loss": 0.0602,
1805
+ "step": 255
1806
+ },
1807
+ {
1808
+ "epoch": 1.3473684210526315,
1809
+ "grad_norm": 0.6798676252365112,
1810
+ "learning_rate": 0.00036526315789473684,
1811
+ "loss": 0.0552,
1812
+ "step": 256
1813
+ },
1814
+ {
1815
+ "epoch": 1.3526315789473684,
1816
+ "grad_norm": 0.791389524936676,
1817
+ "learning_rate": 0.0003647368421052631,
1818
+ "loss": 0.0994,
1819
+ "step": 257
1820
+ },
1821
+ {
1822
+ "epoch": 1.3578947368421053,
1823
+ "grad_norm": 0.7977035641670227,
1824
+ "learning_rate": 0.0003642105263157895,
1825
+ "loss": 0.0767,
1826
+ "step": 258
1827
+ },
1828
+ {
1829
+ "epoch": 1.3631578947368421,
1830
+ "grad_norm": 0.2511521875858307,
1831
+ "learning_rate": 0.0003636842105263158,
1832
+ "loss": 0.016,
1833
+ "step": 259
1834
+ },
1835
+ {
1836
+ "epoch": 1.368421052631579,
1837
+ "grad_norm": 0.5453305840492249,
1838
+ "learning_rate": 0.00036315789473684214,
1839
+ "loss": 0.0386,
1840
+ "step": 260
1841
+ },
1842
+ {
1843
+ "epoch": 1.3736842105263158,
1844
+ "grad_norm": 0.6291806697845459,
1845
+ "learning_rate": 0.00036263157894736843,
1846
+ "loss": 0.059,
1847
+ "step": 261
1848
+ },
1849
+ {
1850
+ "epoch": 1.3789473684210527,
1851
+ "grad_norm": 0.2653387486934662,
1852
+ "learning_rate": 0.00036210526315789477,
1853
+ "loss": 0.012,
1854
+ "step": 262
1855
+ },
1856
+ {
1857
+ "epoch": 1.3842105263157896,
1858
+ "grad_norm": 0.17621006071567535,
1859
+ "learning_rate": 0.00036157894736842106,
1860
+ "loss": 0.0086,
1861
+ "step": 263
1862
+ },
1863
+ {
1864
+ "epoch": 1.3894736842105262,
1865
+ "grad_norm": 0.6944013833999634,
1866
+ "learning_rate": 0.00036105263157894734,
1867
+ "loss": 0.0802,
1868
+ "step": 264
1869
+ },
1870
+ {
1871
+ "epoch": 1.3947368421052633,
1872
+ "grad_norm": 0.6588537096977234,
1873
+ "learning_rate": 0.0003605263157894737,
1874
+ "loss": 0.045,
1875
+ "step": 265
1876
+ },
1877
+ {
1878
+ "epoch": 1.4,
1879
+ "grad_norm": 0.5538493990898132,
1880
+ "learning_rate": 0.00035999999999999997,
1881
+ "loss": 0.0296,
1882
+ "step": 266
1883
+ },
1884
+ {
1885
+ "epoch": 1.4052631578947368,
1886
+ "grad_norm": 0.49943503737449646,
1887
+ "learning_rate": 0.00035947368421052636,
1888
+ "loss": 0.0728,
1889
+ "step": 267
1890
+ },
1891
+ {
1892
+ "epoch": 1.4105263157894736,
1893
+ "grad_norm": 0.8903251886367798,
1894
+ "learning_rate": 0.00035894736842105265,
1895
+ "loss": 0.0362,
1896
+ "step": 268
1897
+ },
1898
+ {
1899
+ "epoch": 1.4157894736842105,
1900
+ "grad_norm": 1.4392952919006348,
1901
+ "learning_rate": 0.000358421052631579,
1902
+ "loss": 0.054,
1903
+ "step": 269
1904
+ },
1905
+ {
1906
+ "epoch": 1.4210526315789473,
1907
+ "grad_norm": 0.5288906693458557,
1908
+ "learning_rate": 0.0003578947368421053,
1909
+ "loss": 0.0427,
1910
+ "step": 270
1911
+ },
1912
+ {
1913
+ "epoch": 1.4263157894736842,
1914
+ "grad_norm": 0.4974476397037506,
1915
+ "learning_rate": 0.00035736842105263156,
1916
+ "loss": 0.0325,
1917
+ "step": 271
1918
+ },
1919
+ {
1920
+ "epoch": 1.431578947368421,
1921
+ "grad_norm": 0.5362409949302673,
1922
+ "learning_rate": 0.0003568421052631579,
1923
+ "loss": 0.0471,
1924
+ "step": 272
1925
+ },
1926
+ {
1927
+ "epoch": 1.436842105263158,
1928
+ "grad_norm": 0.485411137342453,
1929
+ "learning_rate": 0.0003563157894736842,
1930
+ "loss": 0.0403,
1931
+ "step": 273
1932
+ },
1933
+ {
1934
+ "epoch": 1.4421052631578948,
1935
+ "grad_norm": 0.4615587592124939,
1936
+ "learning_rate": 0.0003557894736842105,
1937
+ "loss": 0.0241,
1938
+ "step": 274
1939
+ },
1940
+ {
1941
+ "epoch": 1.4473684210526316,
1942
+ "grad_norm": 0.7625806927680969,
1943
+ "learning_rate": 0.00035526315789473687,
1944
+ "loss": 0.0485,
1945
+ "step": 275
1946
+ },
1947
+ {
1948
+ "epoch": 1.4526315789473685,
1949
+ "grad_norm": 0.5750721096992493,
1950
+ "learning_rate": 0.0003547368421052632,
1951
+ "loss": 0.0717,
1952
+ "step": 276
1953
+ },
1954
+ {
1955
+ "epoch": 1.4578947368421051,
1956
+ "grad_norm": 0.6047337651252747,
1957
+ "learning_rate": 0.0003542105263157895,
1958
+ "loss": 0.0394,
1959
+ "step": 277
1960
+ },
1961
+ {
1962
+ "epoch": 1.4631578947368422,
1963
+ "grad_norm": 0.4338988959789276,
1964
+ "learning_rate": 0.0003536842105263158,
1965
+ "loss": 0.0289,
1966
+ "step": 278
1967
+ },
1968
+ {
1969
+ "epoch": 1.4684210526315788,
1970
+ "grad_norm": 0.6003149747848511,
1971
+ "learning_rate": 0.0003531578947368421,
1972
+ "loss": 0.0376,
1973
+ "step": 279
1974
+ },
1975
+ {
1976
+ "epoch": 1.4736842105263157,
1977
+ "grad_norm": 0.576666533946991,
1978
+ "learning_rate": 0.0003526315789473684,
1979
+ "loss": 0.0404,
1980
+ "step": 280
1981
+ },
1982
+ {
1983
+ "epoch": 1.4789473684210526,
1984
+ "grad_norm": 0.5296128392219543,
1985
+ "learning_rate": 0.00035210526315789474,
1986
+ "loss": 0.038,
1987
+ "step": 281
1988
+ },
1989
+ {
1990
+ "epoch": 1.4842105263157894,
1991
+ "grad_norm": 0.507158637046814,
1992
+ "learning_rate": 0.00035157894736842103,
1993
+ "loss": 0.0317,
1994
+ "step": 282
1995
+ },
1996
+ {
1997
+ "epoch": 1.4894736842105263,
1998
+ "grad_norm": 0.5600894689559937,
1999
+ "learning_rate": 0.0003510526315789474,
2000
+ "loss": 0.0331,
2001
+ "step": 283
2002
+ },
2003
+ {
2004
+ "epoch": 1.4947368421052631,
2005
+ "grad_norm": 0.5067355036735535,
2006
+ "learning_rate": 0.0003505263157894737,
2007
+ "loss": 0.0483,
2008
+ "step": 284
2009
+ },
2010
+ {
2011
+ "epoch": 1.5,
2012
+ "grad_norm": 0.2595311105251312,
2013
+ "learning_rate": 0.00035,
2014
+ "loss": 0.0076,
2015
+ "step": 285
2016
+ },
2017
+ {
2018
+ "epoch": 1.5052631578947369,
2019
+ "grad_norm": 1.583303451538086,
2020
+ "learning_rate": 0.00034947368421052634,
2021
+ "loss": 0.177,
2022
+ "step": 286
2023
+ },
2024
+ {
2025
+ "epoch": 1.5105263157894737,
2026
+ "grad_norm": 0.4873698353767395,
2027
+ "learning_rate": 0.0003489473684210526,
2028
+ "loss": 0.0206,
2029
+ "step": 287
2030
+ },
2031
+ {
2032
+ "epoch": 1.5157894736842106,
2033
+ "grad_norm": 0.40088966488838196,
2034
+ "learning_rate": 0.00034842105263157896,
2035
+ "loss": 0.0263,
2036
+ "step": 288
2037
+ },
2038
+ {
2039
+ "epoch": 1.5210526315789474,
2040
+ "grad_norm": 0.6531310677528381,
2041
+ "learning_rate": 0.00034789473684210525,
2042
+ "loss": 0.0895,
2043
+ "step": 289
2044
+ },
2045
+ {
2046
+ "epoch": 1.526315789473684,
2047
+ "grad_norm": 0.5846711993217468,
2048
+ "learning_rate": 0.0003473684210526316,
2049
+ "loss": 0.0429,
2050
+ "step": 290
2051
+ },
2052
+ {
2053
+ "epoch": 1.5315789473684212,
2054
+ "grad_norm": 0.3327924311161041,
2055
+ "learning_rate": 0.00034684210526315793,
2056
+ "loss": 0.0404,
2057
+ "step": 291
2058
+ },
2059
+ {
2060
+ "epoch": 1.5368421052631578,
2061
+ "grad_norm": 0.6832515597343445,
2062
+ "learning_rate": 0.0003463157894736842,
2063
+ "loss": 0.0733,
2064
+ "step": 292
2065
+ },
2066
+ {
2067
+ "epoch": 1.5421052631578949,
2068
+ "grad_norm": 0.83530592918396,
2069
+ "learning_rate": 0.00034578947368421055,
2070
+ "loss": 0.0728,
2071
+ "step": 293
2072
+ },
2073
+ {
2074
+ "epoch": 1.5473684210526315,
2075
+ "grad_norm": 0.5762038230895996,
2076
+ "learning_rate": 0.00034526315789473684,
2077
+ "loss": 0.0377,
2078
+ "step": 294
2079
+ },
2080
+ {
2081
+ "epoch": 1.5526315789473686,
2082
+ "grad_norm": 0.7385838627815247,
2083
+ "learning_rate": 0.0003447368421052632,
2084
+ "loss": 0.0821,
2085
+ "step": 295
2086
+ },
2087
+ {
2088
+ "epoch": 1.5578947368421052,
2089
+ "grad_norm": 0.35716795921325684,
2090
+ "learning_rate": 0.00034421052631578947,
2091
+ "loss": 0.0444,
2092
+ "step": 296
2093
+ },
2094
+ {
2095
+ "epoch": 1.563157894736842,
2096
+ "grad_norm": 0.656604528427124,
2097
+ "learning_rate": 0.0003436842105263158,
2098
+ "loss": 0.043,
2099
+ "step": 297
2100
+ },
2101
+ {
2102
+ "epoch": 1.568421052631579,
2103
+ "grad_norm": 0.4570469260215759,
2104
+ "learning_rate": 0.0003431578947368421,
2105
+ "loss": 0.0423,
2106
+ "step": 298
2107
+ },
2108
+ {
2109
+ "epoch": 1.5736842105263158,
2110
+ "grad_norm": 0.4714101552963257,
2111
+ "learning_rate": 0.0003426315789473684,
2112
+ "loss": 0.0306,
2113
+ "step": 299
2114
+ },
2115
+ {
2116
+ "epoch": 1.5789473684210527,
2117
+ "grad_norm": 0.6644315719604492,
2118
+ "learning_rate": 0.00034210526315789477,
2119
+ "loss": 0.0679,
2120
+ "step": 300
2121
+ },
2122
+ {
2123
+ "epoch": 1.5842105263157895,
2124
+ "grad_norm": 1.8450918197631836,
2125
+ "learning_rate": 0.00034157894736842106,
2126
+ "loss": 0.1879,
2127
+ "step": 301
2128
+ },
2129
+ {
2130
+ "epoch": 1.5894736842105264,
2131
+ "grad_norm": 0.7267522215843201,
2132
+ "learning_rate": 0.0003410526315789474,
2133
+ "loss": 0.0417,
2134
+ "step": 302
2135
+ },
2136
+ {
2137
+ "epoch": 1.594736842105263,
2138
+ "grad_norm": 0.46764013171195984,
2139
+ "learning_rate": 0.0003405263157894737,
2140
+ "loss": 0.0211,
2141
+ "step": 303
2142
+ },
2143
+ {
2144
+ "epoch": 1.6,
2145
+ "grad_norm": 0.8650890588760376,
2146
+ "learning_rate": 0.00034,
2147
+ "loss": 0.0719,
2148
+ "step": 304
2149
+ },
2150
+ {
2151
+ "epoch": 1.6052631578947367,
2152
+ "grad_norm": 0.4164888858795166,
2153
+ "learning_rate": 0.0003394736842105263,
2154
+ "loss": 0.0531,
2155
+ "step": 305
2156
+ },
2157
+ {
2158
+ "epoch": 1.6105263157894738,
2159
+ "grad_norm": 0.44695788621902466,
2160
+ "learning_rate": 0.0003389473684210526,
2161
+ "loss": 0.0674,
2162
+ "step": 306
2163
+ },
2164
+ {
2165
+ "epoch": 1.6157894736842104,
2166
+ "grad_norm": 0.8715166449546814,
2167
+ "learning_rate": 0.00033842105263157894,
2168
+ "loss": 0.0415,
2169
+ "step": 307
2170
+ },
2171
+ {
2172
+ "epoch": 1.6210526315789475,
2173
+ "grad_norm": 0.8586774468421936,
2174
+ "learning_rate": 0.0003378947368421053,
2175
+ "loss": 0.0864,
2176
+ "step": 308
2177
+ },
2178
+ {
2179
+ "epoch": 1.6263157894736842,
2180
+ "grad_norm": 0.7089522480964661,
2181
+ "learning_rate": 0.0003373684210526316,
2182
+ "loss": 0.038,
2183
+ "step": 309
2184
+ },
2185
+ {
2186
+ "epoch": 1.631578947368421,
2187
+ "grad_norm": 0.6505339741706848,
2188
+ "learning_rate": 0.0003368421052631579,
2189
+ "loss": 0.0334,
2190
+ "step": 310
2191
+ },
2192
+ {
2193
+ "epoch": 1.6368421052631579,
2194
+ "grad_norm": 0.8886998891830444,
2195
+ "learning_rate": 0.00033631578947368424,
2196
+ "loss": 0.0661,
2197
+ "step": 311
2198
+ },
2199
+ {
2200
+ "epoch": 1.6421052631578947,
2201
+ "grad_norm": 0.6641944050788879,
2202
+ "learning_rate": 0.00033578947368421053,
2203
+ "loss": 0.0713,
2204
+ "step": 312
2205
+ },
2206
+ {
2207
+ "epoch": 1.6473684210526316,
2208
+ "grad_norm": 0.3028227984905243,
2209
+ "learning_rate": 0.0003352631578947368,
2210
+ "loss": 0.0163,
2211
+ "step": 313
2212
+ },
2213
+ {
2214
+ "epoch": 1.6526315789473685,
2215
+ "grad_norm": 0.4122330844402313,
2216
+ "learning_rate": 0.00033473684210526315,
2217
+ "loss": 0.0394,
2218
+ "step": 314
2219
+ },
2220
+ {
2221
+ "epoch": 1.6578947368421053,
2222
+ "grad_norm": 0.629173994064331,
2223
+ "learning_rate": 0.00033421052631578944,
2224
+ "loss": 0.0639,
2225
+ "step": 315
2226
+ },
2227
+ {
2228
+ "epoch": 1.663157894736842,
2229
+ "grad_norm": 0.6487518548965454,
2230
+ "learning_rate": 0.00033368421052631583,
2231
+ "loss": 0.0317,
2232
+ "step": 316
2233
+ },
2234
+ {
2235
+ "epoch": 1.668421052631579,
2236
+ "grad_norm": 0.45958709716796875,
2237
+ "learning_rate": 0.0003331578947368421,
2238
+ "loss": 0.0354,
2239
+ "step": 317
2240
+ },
2241
+ {
2242
+ "epoch": 1.6736842105263157,
2243
+ "grad_norm": 0.41501811146736145,
2244
+ "learning_rate": 0.00033263157894736846,
2245
+ "loss": 0.0174,
2246
+ "step": 318
2247
+ },
2248
+ {
2249
+ "epoch": 1.6789473684210527,
2250
+ "grad_norm": 0.41986000537872314,
2251
+ "learning_rate": 0.00033210526315789475,
2252
+ "loss": 0.0356,
2253
+ "step": 319
2254
+ },
2255
+ {
2256
+ "epoch": 1.6842105263157894,
2257
+ "grad_norm": 0.45858511328697205,
2258
+ "learning_rate": 0.00033157894736842103,
2259
+ "loss": 0.0328,
2260
+ "step": 320
2261
+ },
2262
+ {
2263
+ "epoch": 1.6894736842105265,
2264
+ "grad_norm": 0.48091039061546326,
2265
+ "learning_rate": 0.00033105263157894737,
2266
+ "loss": 0.0533,
2267
+ "step": 321
2268
+ },
2269
+ {
2270
+ "epoch": 1.694736842105263,
2271
+ "grad_norm": 0.9891145825386047,
2272
+ "learning_rate": 0.00033052631578947366,
2273
+ "loss": 0.0302,
2274
+ "step": 322
2275
+ },
2276
+ {
2277
+ "epoch": 1.7,
2278
+ "grad_norm": 0.413960337638855,
2279
+ "learning_rate": 0.00033,
2280
+ "loss": 0.0607,
2281
+ "step": 323
2282
+ },
2283
+ {
2284
+ "epoch": 1.7052631578947368,
2285
+ "grad_norm": 0.4374825656414032,
2286
+ "learning_rate": 0.00032947368421052634,
2287
+ "loss": 0.0159,
2288
+ "step": 324
2289
+ },
2290
+ {
2291
+ "epoch": 1.7105263157894737,
2292
+ "grad_norm": 0.35670799016952515,
2293
+ "learning_rate": 0.0003289473684210527,
2294
+ "loss": 0.02,
2295
+ "step": 325
2296
+ },
2297
+ {
2298
+ "epoch": 1.7157894736842105,
2299
+ "grad_norm": 0.5692275762557983,
2300
+ "learning_rate": 0.00032842105263157896,
2301
+ "loss": 0.0524,
2302
+ "step": 326
2303
+ },
2304
+ {
2305
+ "epoch": 1.7210526315789474,
2306
+ "grad_norm": 0.39579054713249207,
2307
+ "learning_rate": 0.00032789473684210525,
2308
+ "loss": 0.037,
2309
+ "step": 327
2310
+ },
2311
+ {
2312
+ "epoch": 1.7263157894736842,
2313
+ "grad_norm": 0.2939654588699341,
2314
+ "learning_rate": 0.0003273684210526316,
2315
+ "loss": 0.0137,
2316
+ "step": 328
2317
+ },
2318
+ {
2319
+ "epoch": 1.731578947368421,
2320
+ "grad_norm": 0.8391321301460266,
2321
+ "learning_rate": 0.0003268421052631579,
2322
+ "loss": 0.0421,
2323
+ "step": 329
2324
+ },
2325
+ {
2326
+ "epoch": 1.736842105263158,
2327
+ "grad_norm": 0.35206350684165955,
2328
+ "learning_rate": 0.0003263157894736842,
2329
+ "loss": 0.0226,
2330
+ "step": 330
2331
+ },
2332
+ {
2333
+ "epoch": 1.7421052631578946,
2334
+ "grad_norm": 0.5110854506492615,
2335
+ "learning_rate": 0.0003257894736842105,
2336
+ "loss": 0.0291,
2337
+ "step": 331
2338
+ },
2339
+ {
2340
+ "epoch": 1.7473684210526317,
2341
+ "grad_norm": 0.5658808946609497,
2342
+ "learning_rate": 0.0003252631578947369,
2343
+ "loss": 0.0543,
2344
+ "step": 332
2345
+ },
2346
+ {
2347
+ "epoch": 1.7526315789473683,
2348
+ "grad_norm": 0.9136970043182373,
2349
+ "learning_rate": 0.0003247368421052632,
2350
+ "loss": 0.085,
2351
+ "step": 333
2352
+ },
2353
+ {
2354
+ "epoch": 1.7578947368421054,
2355
+ "grad_norm": 0.6130772829055786,
2356
+ "learning_rate": 0.00032421052631578947,
2357
+ "loss": 0.0797,
2358
+ "step": 334
2359
+ },
2360
+ {
2361
+ "epoch": 1.763157894736842,
2362
+ "grad_norm": 0.22287864983081818,
2363
+ "learning_rate": 0.0003236842105263158,
2364
+ "loss": 0.0099,
2365
+ "step": 335
2366
+ },
2367
+ {
2368
+ "epoch": 1.768421052631579,
2369
+ "grad_norm": 0.38135984539985657,
2370
+ "learning_rate": 0.0003231578947368421,
2371
+ "loss": 0.0279,
2372
+ "step": 336
2373
+ },
2374
+ {
2375
+ "epoch": 1.7736842105263158,
2376
+ "grad_norm": 0.17216333746910095,
2377
+ "learning_rate": 0.00032263157894736843,
2378
+ "loss": 0.0099,
2379
+ "step": 337
2380
+ },
2381
+ {
2382
+ "epoch": 1.7789473684210526,
2383
+ "grad_norm": 0.5448169112205505,
2384
+ "learning_rate": 0.0003221052631578947,
2385
+ "loss": 0.0541,
2386
+ "step": 338
2387
+ },
2388
+ {
2389
+ "epoch": 1.7842105263157895,
2390
+ "grad_norm": 0.5146567225456238,
2391
+ "learning_rate": 0.00032157894736842106,
2392
+ "loss": 0.0239,
2393
+ "step": 339
2394
+ },
2395
+ {
2396
+ "epoch": 1.7894736842105263,
2397
+ "grad_norm": 0.9499697685241699,
2398
+ "learning_rate": 0.0003210526315789474,
2399
+ "loss": 0.0841,
2400
+ "step": 340
2401
+ },
2402
+ {
2403
+ "epoch": 1.7947368421052632,
2404
+ "grad_norm": 1.1383264064788818,
2405
+ "learning_rate": 0.0003205263157894737,
2406
+ "loss": 0.0354,
2407
+ "step": 341
2408
+ },
2409
+ {
2410
+ "epoch": 1.8,
2411
+ "grad_norm": 0.7471795678138733,
2412
+ "learning_rate": 0.00032,
2413
+ "loss": 0.0441,
2414
+ "step": 342
2415
+ },
2416
+ {
2417
+ "epoch": 1.805263157894737,
2418
+ "grad_norm": 0.5698598027229309,
2419
+ "learning_rate": 0.0003194736842105263,
2420
+ "loss": 0.0655,
2421
+ "step": 343
2422
+ },
2423
+ {
2424
+ "epoch": 1.8105263157894735,
2425
+ "grad_norm": 0.615290105342865,
2426
+ "learning_rate": 0.00031894736842105265,
2427
+ "loss": 0.0237,
2428
+ "step": 344
2429
+ },
2430
+ {
2431
+ "epoch": 1.8157894736842106,
2432
+ "grad_norm": 0.6663447022438049,
2433
+ "learning_rate": 0.00031842105263157894,
2434
+ "loss": 0.0589,
2435
+ "step": 345
2436
+ },
2437
+ {
2438
+ "epoch": 1.8210526315789473,
2439
+ "grad_norm": 0.5681447386741638,
2440
+ "learning_rate": 0.0003178947368421053,
2441
+ "loss": 0.024,
2442
+ "step": 346
2443
+ },
2444
+ {
2445
+ "epoch": 1.8263157894736843,
2446
+ "grad_norm": 0.6610841751098633,
2447
+ "learning_rate": 0.00031736842105263156,
2448
+ "loss": 0.0672,
2449
+ "step": 347
2450
+ },
2451
+ {
2452
+ "epoch": 1.831578947368421,
2453
+ "grad_norm": 0.3079202175140381,
2454
+ "learning_rate": 0.00031684210526315785,
2455
+ "loss": 0.0188,
2456
+ "step": 348
2457
+ },
2458
+ {
2459
+ "epoch": 1.836842105263158,
2460
+ "grad_norm": 0.7476277947425842,
2461
+ "learning_rate": 0.00031631578947368424,
2462
+ "loss": 0.0673,
2463
+ "step": 349
2464
+ },
2465
+ {
2466
+ "epoch": 1.8421052631578947,
2467
+ "grad_norm": 0.31603574752807617,
2468
+ "learning_rate": 0.00031578947368421053,
2469
+ "loss": 0.0181,
2470
+ "step": 350
2471
+ },
2472
+ {
2473
+ "epoch": 1.8473684210526315,
2474
+ "grad_norm": 0.5879719257354736,
2475
+ "learning_rate": 0.00031526315789473687,
2476
+ "loss": 0.0166,
2477
+ "step": 351
2478
+ },
2479
+ {
2480
+ "epoch": 1.8526315789473684,
2481
+ "grad_norm": 0.6690747141838074,
2482
+ "learning_rate": 0.00031473684210526316,
2483
+ "loss": 0.0468,
2484
+ "step": 352
2485
+ },
2486
+ {
2487
+ "epoch": 1.8578947368421053,
2488
+ "grad_norm": 0.31422004103660583,
2489
+ "learning_rate": 0.0003142105263157895,
2490
+ "loss": 0.0292,
2491
+ "step": 353
2492
+ },
2493
+ {
2494
+ "epoch": 1.8631578947368421,
2495
+ "grad_norm": 0.9119225740432739,
2496
+ "learning_rate": 0.0003136842105263158,
2497
+ "loss": 0.051,
2498
+ "step": 354
2499
+ },
2500
+ {
2501
+ "epoch": 1.868421052631579,
2502
+ "grad_norm": 0.5634626150131226,
2503
+ "learning_rate": 0.00031315789473684207,
2504
+ "loss": 0.031,
2505
+ "step": 355
2506
+ },
2507
+ {
2508
+ "epoch": 1.8736842105263158,
2509
+ "grad_norm": 0.4134940803050995,
2510
+ "learning_rate": 0.0003126315789473684,
2511
+ "loss": 0.0295,
2512
+ "step": 356
2513
+ },
2514
+ {
2515
+ "epoch": 1.8789473684210525,
2516
+ "grad_norm": 1.009355068206787,
2517
+ "learning_rate": 0.00031210526315789475,
2518
+ "loss": 0.0958,
2519
+ "step": 357
2520
+ },
2521
+ {
2522
+ "epoch": 1.8842105263157896,
2523
+ "grad_norm": 0.5140488147735596,
2524
+ "learning_rate": 0.0003115789473684211,
2525
+ "loss": 0.0242,
2526
+ "step": 358
2527
+ },
2528
+ {
2529
+ "epoch": 1.8894736842105262,
2530
+ "grad_norm": 0.656700074672699,
2531
+ "learning_rate": 0.0003110526315789474,
2532
+ "loss": 0.0369,
2533
+ "step": 359
2534
+ },
2535
+ {
2536
+ "epoch": 1.8947368421052633,
2537
+ "grad_norm": 0.27617648243904114,
2538
+ "learning_rate": 0.0003105263157894737,
2539
+ "loss": 0.015,
2540
+ "step": 360
2541
+ },
2542
+ {
2543
+ "epoch": 1.9,
2544
+ "grad_norm": 0.5509806871414185,
2545
+ "learning_rate": 0.00031,
2546
+ "loss": 0.0303,
2547
+ "step": 361
2548
+ },
2549
+ {
2550
+ "epoch": 1.905263157894737,
2551
+ "grad_norm": 0.312589168548584,
2552
+ "learning_rate": 0.0003094736842105263,
2553
+ "loss": 0.0173,
2554
+ "step": 362
2555
+ },
2556
+ {
2557
+ "epoch": 1.9105263157894736,
2558
+ "grad_norm": 0.37649065256118774,
2559
+ "learning_rate": 0.0003089473684210526,
2560
+ "loss": 0.0229,
2561
+ "step": 363
2562
+ },
2563
+ {
2564
+ "epoch": 1.9157894736842105,
2565
+ "grad_norm": 0.49866557121276855,
2566
+ "learning_rate": 0.0003084210526315789,
2567
+ "loss": 0.0369,
2568
+ "step": 364
2569
+ },
2570
+ {
2571
+ "epoch": 1.9210526315789473,
2572
+ "grad_norm": 0.5967050790786743,
2573
+ "learning_rate": 0.0003078947368421053,
2574
+ "loss": 0.1047,
2575
+ "step": 365
2576
+ },
2577
+ {
2578
+ "epoch": 1.9263157894736842,
2579
+ "grad_norm": 0.5491074919700623,
2580
+ "learning_rate": 0.0003073684210526316,
2581
+ "loss": 0.0527,
2582
+ "step": 366
2583
+ },
2584
+ {
2585
+ "epoch": 1.931578947368421,
2586
+ "grad_norm": 0.5987709760665894,
2587
+ "learning_rate": 0.00030684210526315793,
2588
+ "loss": 0.0413,
2589
+ "step": 367
2590
+ },
2591
+ {
2592
+ "epoch": 1.936842105263158,
2593
+ "grad_norm": 0.5473771691322327,
2594
+ "learning_rate": 0.0003063157894736842,
2595
+ "loss": 0.0575,
2596
+ "step": 368
2597
+ },
2598
+ {
2599
+ "epoch": 1.9421052631578948,
2600
+ "grad_norm": 0.6479553580284119,
2601
+ "learning_rate": 0.0003057894736842105,
2602
+ "loss": 0.0406,
2603
+ "step": 369
2604
+ },
2605
+ {
2606
+ "epoch": 1.9473684210526314,
2607
+ "grad_norm": 0.4294752776622772,
2608
+ "learning_rate": 0.00030526315789473684,
2609
+ "loss": 0.0221,
2610
+ "step": 370
2611
+ },
2612
+ {
2613
+ "epoch": 1.9526315789473685,
2614
+ "grad_norm": 0.592293381690979,
2615
+ "learning_rate": 0.00030473684210526313,
2616
+ "loss": 0.0405,
2617
+ "step": 371
2618
+ },
2619
+ {
2620
+ "epoch": 1.9578947368421051,
2621
+ "grad_norm": 0.3438422977924347,
2622
+ "learning_rate": 0.00030421052631578947,
2623
+ "loss": 0.0186,
2624
+ "step": 372
2625
+ },
2626
+ {
2627
+ "epoch": 1.9631578947368422,
2628
+ "grad_norm": 0.4528750479221344,
2629
+ "learning_rate": 0.0003036842105263158,
2630
+ "loss": 0.0457,
2631
+ "step": 373
2632
+ },
2633
+ {
2634
+ "epoch": 1.9684210526315788,
2635
+ "grad_norm": 0.4549511671066284,
2636
+ "learning_rate": 0.00030315789473684215,
2637
+ "loss": 0.031,
2638
+ "step": 374
2639
+ },
2640
+ {
2641
+ "epoch": 1.973684210526316,
2642
+ "grad_norm": 0.8639032244682312,
2643
+ "learning_rate": 0.00030263157894736844,
2644
+ "loss": 0.1158,
2645
+ "step": 375
2646
+ },
2647
+ {
2648
+ "epoch": 1.9789473684210526,
2649
+ "grad_norm": 0.8165035843849182,
2650
+ "learning_rate": 0.0003021052631578947,
2651
+ "loss": 0.0593,
2652
+ "step": 376
2653
+ },
2654
+ {
2655
+ "epoch": 1.9842105263157894,
2656
+ "grad_norm": 0.3303014039993286,
2657
+ "learning_rate": 0.00030157894736842106,
2658
+ "loss": 0.014,
2659
+ "step": 377
2660
+ },
2661
+ {
2662
+ "epoch": 1.9894736842105263,
2663
+ "grad_norm": 0.7385947108268738,
2664
+ "learning_rate": 0.00030105263157894735,
2665
+ "loss": 0.0876,
2666
+ "step": 378
2667
+ },
2668
+ {
2669
+ "epoch": 1.9947368421052631,
2670
+ "grad_norm": 0.5052048563957214,
2671
+ "learning_rate": 0.0003005263157894737,
2672
+ "loss": 0.0376,
2673
+ "step": 379
2674
+ },
2675
+ {
2676
+ "epoch": 2.0,
2677
+ "grad_norm": 0.959463357925415,
2678
+ "learning_rate": 0.0003,
2679
+ "loss": 0.0503,
2680
+ "step": 380
2681
+ },
2682
+ {
2683
+ "epoch": 2.0,
2684
+ "eval_cer": 20.32831737346101,
2685
+ "eval_loss": 1.3509594202041626,
2686
+ "eval_pred": "| i | Label | Prediction |\n| --- | --- | --- |\n| 0 | I think that in the people in the river. They because I\u2026 | I ... that in the people in the river they they ... I\u2026 |\n| 1 | This is a restaurant because there has | I ... ... ... because they ... ... |\n| 2 | food. | S |\n| 3 | The people in the picture are playing soccer. I\u2019ve played soccer twice before in physical education class and I liked it. Well, mostly because I have really strong muscles in my legs from running, so I have a lot of advantages in soccer. If I was a parent, I would I would agree for my kid to play soccer. Mostly because playing a sport helps you stay healthy and fit and that\u2019s what ??? society thinks you should do. Stay fit and healthy. | The people in the picture are playing soccer. I play\ufffdve play soccer twice before in physical education class and I liked it well Well, mostly because I have really strong muscles in my legs from running so so I have a lot of advantages in soccer. If I was a parent, I ... ... would agree for my kid to play soccer mostly Mostly because playing a sport helps you stay healthy and fit, that's\ufffds what society society thinks you should do, Stay fit and healthy |\n| 4 | And it\u2019s also good for your health too. You can have lower a lower risk of getting any diseases from body fat. Also, the people in this picture mostly are wearing jerseys and shorts. Some of them are wearing knee-high socks. And all of them are wearing sneakers. And the details in here. There are a lot of trees, which I really like. And really beautiful grass. And there are also two buildings in the background. | And it\u2019s also good for your help,, You could have lower ... ... risk of getting any diseases from body fat. Also, the people in this picture mostly are wearing jerseys and shorts. Some of them are wearing ne highhigh socks, And all of them are wearing sneakers. And the details in here, There are a lot of trees which which I really like. And the beautiful grass. And there are also two buildings in the background |\n| 5 | Also a bridge. There is a silver car silver car and there are multiple soccer balls which um means that they are probably probably practicing and not playing against each other. The people um Also there are benches in the background which also indicates that they ??? might might be in a park instead of a soccer field. And there are | umso, bridge, There is a silver car ... car and there are multiple soccer balls which um means that they are probably ... practicing and not playing against each other The The people um also there are benches in the background which also indicates that they might might be be in the park instead of a soccer field. And there are\u2026 |\n| 6 | some\u2026 | There ... |\n| 7 | um I think people in the picture is playing soccer, if I\u2019m not wrong. Yes, eN they are playing soccer. And um did I? I did. When I was in elementary school, we we had we had a class we had a PE class and um the teacher taught us how to play soccer before. But, honestly, I\u2019m very poor at at um at sports, so I\u2019m not really enjoy it. But I did see some people, like my some of my classmate, really do know how to play soccer. | um I think people in the picture is playing soccer, if I\u2019m not wrong, They, they they they are playing soccer and And um did ... ... I did um I I was in my school, we ... ... a ... a class ... ... a PE class and the the teacher taught us how to play soccer before, But honestly honestly, I am\ufffdm very poor at ... ... at sport so so I\u2019m not really enjoy it. But I did say some people like like my ... of my classmatesmates really really do know how to play soccer |\n| 8 | I I was like, oh my god, this is a very good, very, very um phenomenal cause um it's like it\u2019s a it\u2019s very hard to see some for some the students in Taiwan to play soccer, so as I think it\u2019s quite quite cool. And if I am parents, well, um um because I\u2019m not really interested in this, so, um if they want to, of course, I would encourage them, but um if they doesn\u2019t like that, I I won\u2019t force them to do it cause I think it\u2019s not really um, it\u2019s alright. It doesn\u2019t like it\u2019s not | I ... was ..., oh, god, this is a very good ... very ... very um phenomenal because um it\ufffd like,'s\ufffds a ...'s\ufffds very hard to see some ... some the students in Taiwan to play soccer so so it I think it's\ufffds quite ... cool and And if I am parents, well, um because because I am\ufffdm not really interested in this, so um um if they want to, of course I I will encourage them, but um if they doesn\u2019t like that, I ... won't\ufffdt force them to do it, I think it's\ufffds not really um it it\u2019s all, It's\u2019s like ...'s\ufffds not |\n| 9 | necessary and just like if they want, I will. And um if I have time, um so people and it\u2019s um only boys. Why? ish In the picture, they should have girls, but um in the picture, um the only boys in the pictures and uN they separate divided into two groups, is it? And all they\u2019re wearing this long socks is quite cool and um it\u2019s quite a beautiful place. It\u2019s a really beautiful place and um I think they enjoyed very | necessary and just like if they want to I will and And um if I have time, um so people in ...'s\ufffds the the boys why Why ... In ... in the picture they they should have girls, but um in the picture um um the only boys in the pictures, um they they separate ... into two groups, is it? And um they're\ufffdre wearing this long socks is quite cool and um it\u2019s quite a beautiful place, It\u2019s a really beautiful place and um I think they enjoyed very |\n| 10 | much. It's quite\u2026 | um ... it It's a |\n| 11 | Um I think the picture is taken um at a park and it\u2019s a very bright sunny day. And um there are some people are in the park and they are painting. And and there is a woman uh on the right of the picture. She\u2019s sitting on the chair and she is she has short hair and wearing some, wearing dress. And she | um I think the picture is taken um at a park and it\u2019s a very um, day and And um there are some people are in the park and they are painting. And um um is a woman um on the right of the picture, She's\ufffds sitting on the chair and she is ... has short hair and wearing some ... wearing dress and And she um um um |\n| 12 | is painting some trees and I really like the picture. It\u2019s beautiful. And there are a bags beside the woman. I think there\u2019s uh the wore the pants or some color in the bags that she wants to to draw. And there are also um lots of people near nearby the the girl the woman | She painting some trees and I really like the picture it It\u2019s beautiful and And there are a bags uh a woman I I think there's\ufffds uh the ... the ... ... some color in the bags that she wants to ... draw. And there are also um lots of people near ... the ... girl ... woman |\n| 13 | are painting painting and there are two people which has a bags and others are besides her and they are discussing um how to draw the picture. And I think uh the advantage to join a park is that you can really near the the picture you want to draw and it\u2019s in | are ... the the ... are two people which has a backs and others are besides her and I are disgusting um how to draw the picture and And uh think uh the advantage to draw the park is that you can really near the ... picture you want to draw and it's\ufffds ... |\n| 14 | nature scenery\u2026 | um nature scenery |\n| 15 | Well, I see at least nine people in the picture, and I can see that all they are all young men, and they are probably professional soccer players, or a soccer team at school, since they are all wearing sports wears that look quite professional. And, the man at the back of the picture is has a funky look, while he has spiky haircut. | Well, I see at least nine people in the picture and and I can see that all ... are all young men, and they are probably professional soccer players or or a soccer team at school, since they're all wearing sports wear that look quite professional, And the the ... at the back of the picture is ... a funky look, well he has spunkyy hair |\n| 16 | And he\u2019s wearing a white short sleeved t-shirt and like he\u2019s wearing blue shirts and long socks which soccer players usually wear, and he\u2019s also wearing blue sneakers. He\u2019s trying to chase the yellow soccer ball. um I see many people wearing long soccer socks, which really impressed me, and they\u2019re all wearing like um red | And he\u2019s wearing a white--ved t-shirt and like he\u2019s wearing blue shirts and long socks which soccer players usually wear and and he\u2019s also wearing blue sneakers, He\u2019s trying to chase the yellow soccer ball. Um I see many people wearing long soccer socks which which really impressed me and and they're\ufffdre all wearing like um red |\n| 17 | shorts or eh like green shirts with numbers on it. So they might be quite professional. And the weather looks good. At the back of the picture, I could see many trees too. And I could even see the MRT. eh Also, I see two buildings. And I\u2019m not really good at soccer, but I like watching soccer games. So I hope maybe | Sh ... or I I green shirts with numbers on it so So they might be quite professional and And the weather looks good. At the back of the picture I I could see many trees too, And I could even see the MRT. I also I I see two buildings and And I\u2019m not really good at soccer, but I like watching soccer games. So I hope maybe |\n| 18 | I can\u2026 | un ... ... |\n| 19 | I think this might be a room up a up in a up in the top building in the city because the windows out outside the windows there\u2019s a lot of colorful buil buildings and it\u2019s also really high up on the ground. The woman in the middle is playing her violin to the to the guests and and lots of people are taking pictures of her. I think this is a good place to have a celebration because it is really it is really | I think this might be a room up ... ... in the ... in the top building in the city because the windows ... ... the windows there\u2019s a lot of colorful buildings buildings buildings and is's\ufffds also really hype ... the the ground the The woman in the middle is playing her violin to the ... ... guests and ... lots of people are taking pictures of her. I think this is a good place to en a celebration because it is ... is really |\n| 20 | um it looks very comfortable and the food and the food must taste really good. I think um the woman on the the woman on the left is wearing a red skirt red dress and is wearing black heels. She\u2019s looking happily at the woman playing the violin while filming her on the phone. And then there is a waiter on the on the top right corner serving food to the guest that that lives beside him. There is also a man | um It looks really comfortable and the food ... the food must taste really good I I think um the woman on the ... ... on the ... ... wearing a red ... ... dress and is wearing black.. She's\ufffds looking happily at the woman playing the violin while filming her on the phone and And then there is a waiter on ... ... the top right corner serving food to the guests that ... leads it him. There is also um man |\n| 21 | with a with a white T sh with a white shirt smiling happily. And behind the man, there is there is one guy on his iPad while another while another guy looks at him. There is four pers There's four people in total that\u2019s looking at the violin looking at the woman playing the violin, and they seem very satisfied with it. Um On the table, there are wines and different drinks for them. I don\u2019t see any | And ... ... a ... ... ...hh a white shirt smiling happily andAnd behind the men there there ... ... is one guy on his iPad while another ... another guy looks at him there There is four person there's four people in total that's\ufffds looking at the ... ... at the woman playing the violin and and the seem very satisfied with it. On on the table there there are wines and different drinks for the I I don't\ufffdt see any |\n| 22 | food yet, so maybe\u2026 | If, so so maybe |\n| 23 | In the picture, I can see peoples having dinner in a marvelous restaurant. And the woman woman standing in front of the picture are playing the violin. And the people around her are taking photos of of her. They are all smiling and happy with it. I think this | In the picture, I can see people having dinner in a marvelous restaurant and And the woman ... ... in front of the picture are ... the violin, And the people around her are taking photos of ... her. They are all smiling and happy with it. I think |\n| 24 | place is good for people having marriage or having birthday party cause this place can can be full of people. Many people can go to the restaurant at once the same time. And it\u2019s fun | This is ... for people having marriage or having birthday party because this place can ... be full of people ... Many people can go to the restaurant at once ... same time and And it\u2019s fun |\n| 25 | to have so many people celebrate your birthday or marriage. mm And I don\u2019t think there are any children in the picture. I think it might be the all-adults party. And the waiters are | To have so many people celebrate your birthday or a mm And I I don\u2019t think there are any children in the picture I I think it might be the adultsadouts party and And the waiters are |\n| 26 | busy. | Iy |\n| 27 | Here is a river, and the river must in the country because you wouldn't you it's impossible to see this appearance in the city. And if you want to play uh in the river or beside the river, you have to be rea careful that you might get drowned. You have to find some river who has many lifeguard, and you can\u2019t go it uh by yourself alone uh because it's rea really dangerous that if you drown, | Here is a river and and the river must in the country because you ...\ufffd ... ...'s impossible to see this appearance in the city and And if you want to play uh in the river or beside the river, you have to uh careful careful careful that you might get drawn, You have to find some river who has many lifeguard and and you can go\ufffdt go it uh by yourself alone uh because it's re danger re dangerous that if you drown |\n| 28 | no one can rescue you. And I think the place in the picture it's summer because everybody wear t-shirt and shirts and on only summer you will go to river to play because it\u2019s really uh comfortable and cool. And uh in the picture I think there are a big family. There are grandpa, grandma uh s sitting on on the rock in the river and there are many child | No one can rescue you andAnd I think the ... ... the picture is\ufffd summer because everybody wear t-shirt and shirts and ... ... summer you will go to the to play because it's\ufffds really comfortable comfortable and cool. And uh in the picture I think there are a big family there There are grandpa, grandma uh sitting sitting on ... the rock in the river and there are a child |\n| 29 | many child play in the river. uh the river seems really cool and they have a great time to play with uh in it. uh because it is summer, so they everybody want to uh get into the cool river water and play. And there is a old man uh his hand holding a camera or vi camera to take picture. uh he might want | M many child play in the river uh Uh The river thinks really cool and they have a great time to play with uh in it uh uh because it is summer so so they ... want to uh get into the cool river water and play and And there's the old man uh he hand holding a camera ... ... ... to take picture uh uh he might want |\n| 30 | to.... | To ... |\n| 31 | I think the people in the picture are playing soccer. And I ever did the such activities before. And I really like it because playing soccer can make me make a lot of friend. And I when I play soccer, I feel very happy. And if I were a parent, I would encourage my children to do this activities | I think the people in the picture are playing soccer and And I ever did the such activities before and And I really like it because play soccer can make me make a lot of friends. And I ... I play soccer, I feel very happy and And if I were parents parent, I would encourage my children to do this activities |\n| 32 | because I hope they can make they can develop the habit of re doing regular exercise. And I also hope them can know what is the teamwork. And in the picture, I see a man with green clothes with 48 number. He is more | Because I hope they can ... ... can develop the habit of doing doing regular exercise and And I also hope them can know what is the teamwork. And in the picture, I see a man with green clothes, ... s, He is more |\n| 33 | black than others people. And I think a yellow ball floats on the air and a man with white clothes and blue pants is ready for running to get the ball, I think. And in the according to the picture, I think the place is in the | oh ... other people and And I think a yellow ball floats on the air and a man with white clothes and blue pants is ready for running to get the ball, I think. And in the ... to the picture, I think the place is in the |\n| 34 | park. | The |\n| 35 | I think this picture is in the park because there are a lot of tree and a lot of people in there. The girl who sit in in the right of the picture may might look might drawing eh some trees. eh I think through this activity you can enjoy some outside activity and you can smell some | I think this picture is in the park because there are a lot of tree and a lot of people in there ... The girl who sit in ... the ... of the picture may ... look ... drawing a some trees, Uh I think through this activity, can enjoy some outside activity and you can smell some |\n| 36 | fresh airs in the park. And I in this picture, I look some people drawing and looking their pictures. And s there are still another still other people running in front of them. And I think it is a good place to walk and do other activities. | umresh airs in the park and And I ... this picture, I look some people drawing and looking their pictures, And there there are still another ... other people running in front of them. I I think this is a good place to work and to other activities |\n| 37 | There is a girl who still she drawing her pictures that is very beautiful and a lot of trees on it. It is very fresh and very good and very beautiful. I think this is a good activity that you should go to there. If I | There is a girl who still ... drawing her pictures that is very beautiful and a lot of trees on it ... It is very fresh and very good and very beautiful I I think this is a good activity that you should go to there if I |\n| 38 | have a opportunity, I would do that maybe be\u2026 | I a ... I I would do that maybe ... |\n| 39 | This place might be someone is getting married because they look some look someone look really normal and it\u2019s a restaurant. But and the the picture and the middle of the picture, the lady is playing a violin. And a lot of people are taking pictures of him is taking pictures of her. And I don\u2019t think this place is really great for to hold this | This place might be someone is getting married because they look ... ... ... look really normal and it\u2019s a restaurant but But ... the ... picture ... the middle of the picture the the ... is playing a violin and And a lot of people are taking pictures of him ... ... pictures of her and And I don\u2019t think this place is really great for ... ... this |\n| 40 | kind of activity because there are other people in the restaurants too. So I think if they can change to a another place that don\u2019t have too much people. This this will be better. And there are some there are some people in the in the labs taking photos and pictures of the lady playing violin. And the lady | This of activity because there are other people in the restaurants too so So I think if they can change to a ... place, ...\u2019t have too much people, This ... will be better and And there are some ... are some people in the ... the labs taking photos and pictures of the ... playing violin and And the lady |\n| 41 | playing violin is wearing high heels and there are some waiters waiters behind to serve the meals. And there are other customers too. And and the pink and a lady on the left is wearing a pink dress. And others are wearing normal except except of the waiter and the lady playing violin. | un ... is wearing high heels and there are some waiterers ...ers behind to serve the meals and And there are other customers too. And ... a pink ... the lady on the left is wearing a pink dress and And others are wearing normal is of the the waiter and ... ... playing violinun |\n| 42 | It may be a restaurant. Equally some food on the table. The lady stand in the center is playing music. And a man in left is ... | It's be a restaurant who Heally some food on the table. The ... ... ... the center is playing music and And the ... in ... is ... |\n| 43 | It\u2019s cameras. | It's\ufffds ... |\n| 44 | mm This is a restaurant and and she there and she is playing violin. And maybe there | um This is a restaurant and ... ... ... ... ... is playing violin and And maybe there |\n| 45 | And maybe there can | And ... there can\u2026 |\n| 46 | be\u2026 | um ... |\n| 47 | I think thi this there is this is wedding conference because we see many people sit around sit around the table. And I think the mi, the wo | I think the ... ... is ... is wedding conference because we see many people sit around ... around the table and And I think the ... the the ... |\n| 48 | the woman stand in front of everybody is playing violin. And I think this is good to good to hold the campaign in here. | um ... sent in ... of everybody is playing ... and And I think this is good to ... to hold the in ... |\n| 49 | And and and some people are taking pictures the woman playing violin. | And ... ... ... people are taking pictures of woman playing violin |\n| 50 | And the waiters... | And eh way \u2026 |\n| 51 | It may be a restaurant because there are so many temple tem table and some ?? chairs. I think the lady in stand in the middle is playing the pia piano, the violin yes. I think it\u2019s suitable of holding a important and | It may be a restaurant because there are so many temple ... ... and some they they I I think the ... in ... ... the media is playing the ...iano ..., the violin,, I think is's\ufffds suitable of holding a ...\u2026 |\n| 52 | celebrate event because it\u2019s it's their turn enough to show their show their respect. And they all wear a ties dress or dress up with a nature dre dress. And they have a romance romance light and ??? at | um ...ate events because it ...\ufffds ... ... ... true enough to show they ... ... and And they ... a ... ... ... dress are with a ... ... dress and And they are a ... ... ... and ... ... |\n| 53 | head. They are all ??? to to show out in this place. And they are all They have a ??? clothes and they all hold a everyone\u2019s hands are all | They ... They ... all friends friends ... show out in this press and And um ... ... have a responsible and and they all ... a everyone's\ufffds ... are all |\n| 54 | a\u2026 | oh ... |\n| 55 | This is a rest room room because there have a table, round round uh round table and chairs. And this is have this uh the woman is playing uh music instrument and uh this this place uh uh to to this place is | This is the restaurant of because because they have a table ... wrong ... ... round table and the and And eh ... ... ... ... the woman is ... music music instrument and uh this ... place uh ... this ... ... place is |\n| 56 | very good to a red wedding place eh because of uh the there have carpet, there have a high high glass and plate. And everyone uh have a suit and uh uh have a good good manner. This is a brighter and brighter and good a brighter and good place have a shine shine | It good to uh ... waiting place uh because of you the ... have a companies they they a ... ... ... and the and And the uh have a suit and ... have have a ... ... ... and And is a ... and the and the ... brighter and good place. the ... ... |\n| 57 | light and uh very good uh view. eh Everyone is very happy and talk uh enjoy the music and uh have a good uh have a good life. And everyone uh is talking each other heN. I think there is a very good place. And the woman | un and the very good uh view and Everyone everyone is very happy and uh ... enjoy the music and uh have a good ... ... a good life and And uh uh is talking to other eh ... I I think uh is a very good place and And the woman |\n| 58 | play... | um ... |\n| 59 | I think here is Wulai because this there are there is a beautiful s river there and everyone are playing water happily and we should no take the raincoat down because | I think we is a live I because this ... ... ... is a beautiful ... uh there and everyone are playing water happily and we should no take the ... ... done because |\n| 60 | if rain pour and the river will full of water and it might be a flood. Un The perfect season to come here was is summer. Summer is very hot hot so everyone will be play will be | If ... and the river will ... of water and ... ... be a flood um The The perfect season to come here when ... ... ... Summer is very hard pot, everyone will be play, be |\n| 61 | happy to come to come here. And we come to Wulai because this last Sunday was my grandma\u2019s birthday and every everyone in my family planned a surprise to to celebrate at | We to ... ... ... here andAnd we come to like I because this ... Sunday was my grandma\u2019s birthday and every ... in my family plan a surprise to ... celebrate ... |\n| 62 | my\u2026 | My ... |\n| 63 | I think they are in the riverbank because it has water and stones and and have a tree. And you need to and they need to notice that if there if rains, it may it might be a lot of water and may and they might be be caught in they. And I | I think they are in the riverbank because it has to and stones and ... have a tree and And you need to ... they need to notice that if ... ... r, it ... ... might be a lot of water and ... ... they might be ... caught in ... ... And I |\n| 64 | think this place and I think this is is summer because the water is cold and you can enjoy the water and and with your family to spend the afternoon and and the old guy is | I this place ... I think this is ... ... because the ... is cold and you can enjoy the water and ... with your family to spend the afternoon and ... the old guy is |\n| 65 | taking a picture and they seem to have fun in the riverbank and and there are 11 people in there. There're might be a family a family trip. | It a picture and they think to have from in the riverbank and the ... ... ... people in there ... There are ... be a family ... ... creep |\n| 66 | eh\u2026 | And ... |\n| 67 | This place might be a restaurant and and it\u2019s because it\u2019s having a wedding wedding ceremonies. The middle the middle woman is uh woman standing in the middle is playing violins. The restaurant is the best place to having this ceremony because everybody can have great great | This place might be a restaurant and ... it's\ufffds because he's\ufffds having a wedding ... ..., The ... ... ... woman is ... the standing in the middle is playing violins. The restaurant is the best place to having this ceremony because everybody can have great ... |\n| 68 | because everybody can have have to share their happiness to everyone. The middle woman standing in the middle is wearing black black dress and black shoes. The woman stand in the left side is wearing a red red dress. The man beside the red dress woman is wearing a | Because everybody can have ... to share their happiness to everyone The The ... ... standing in the middle is wearing black ... dress and the shoes. The woman stand ... ... ... side is wearing a red ... dress ... The man beside the red dress woman is wearing a |\n| 69 | blue jeans blue jeans and brown color brown pants. And he's heh he is holding his cellphone to take a picture. | uh jeans jeans ... jeans and the color ... pants and And he's ... he ... ... his cellphone to take a picture |\n| 70 | The people in the picture is playing the soccer. I used to playing the soccer in my junior high school exercise class. And I like to playing ball because I prefer to playing outside or outdoor rather than uh staying at home indoor. | The people in the picture is playing the soccer I I used to playing the soccer in my un high school exercise class and And I like to playing ball because I prefer to playing outside or outdoor rather than ... stay at home ... |\n| 71 | If I a prac If I a parent, I would I would push my child child to engage this sports or act activity because the sport is healthy and have some pleasure and can | If I ... ... ... I a present, I ... ... would push my child to to engage this sports or activity because the sport is healthy and have some pleasure and can |\n| 72 | can promote your health. The figure in the pictures is wearing a soccer and a soccer shirt and and foot. They wearing the blue | un ... your health ... The ... in the pictures is wearing the soccer and ... ... shit and ... ... they They were the blog |\n| 73 | clothes and\u2026 | Clothes is ... |\n| 74 | This may be the river eh near the river. And because there are a lot of waters in there, and rocks, they can like, bank in there. And this this this place, you need to notice that the the the mountains upstairs that up up there, you above there, you | This may be the river ... near the river and And because there are a lot of ... in there, and the they they can ... bank bank in there. And this ... ... ..., you need to notice that the ... ... mountains upstairs, ... ... there, you above there, the |\n| 75 | may have the rains and the down the river may have lots of lots of flood uh flooded there. So you may notice that for for the summer. And for or you'll be dead or something. eh You may you can come there in the summer or | You have the rains and the down the river may have lots of ... of floods ... there's So you may notice that for ... the summer and And you ... you will be dead or something, Uh you may ... can come there in the uh or |\n| 76 | spring. Because this quite cold you can there you can be cold there and you don\u2019t really need the\u2026 You can be be ca calm down there and it\u2019s very reliev relieves and just just calm down calm down and and Winter come there is too cold | uh because Because it ... cold, can there ... can be cold there and ... ...\u2019t really need the ... You can be ... ... um down there and it is\ufffds very reallyive ves and just ... come down ... down and ... uh come there it too cold |\n| 77 | and they all\u2026 | And the ... |\n",
2687
+ "eval_runtime": 11.5151,
2688
+ "eval_samples_per_second": 6.774,
2689
+ "eval_steps_per_second": 0.868,
2690
+ "eval_wer": 32.53177602198557,
2691
+ "step": 380
2692
  }
2693
  ],
2694
  "logging_steps": 1,
 
2708
  "attributes": {}
2709
  }
2710
  },
2711
+ "total_flos": 1.07208914632704e+19,
2712
  "train_batch_size": 16,
2713
  "trial_name": null,
2714
  "trial_params": null