Edit model card

RoBERTa-legal-de-cased_German_legal_SQuAD_part_augmented_1000

This model was trained from scratch on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 1.5096

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 128
  • eval_batch_size: 32
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 1000

Training results

Training Loss Epoch Step Validation Loss
No log 1.0 4 6.2837
No log 2.0 8 6.2533
No log 3.0 12 5.9481
No log 4.0 16 5.3996
No log 5.0 20 5.1070
No log 6.0 24 4.8339
No log 7.0 28 4.4593
No log 8.0 32 4.2903
No log 9.0 36 4.0889
No log 10.0 40 3.9262
No log 11.0 44 3.6990
No log 12.0 48 3.4630
No log 13.0 52 3.3548
No log 14.0 56 3.2229
No log 15.0 60 3.0759
No log 16.0 64 2.9493
No log 17.0 68 2.8026
No log 18.0 72 2.7271
No log 19.0 76 2.6537
No log 20.0 80 2.4722
No log 21.0 84 2.4787
No log 22.0 88 2.2904
No log 23.0 92 2.2285
No log 24.0 96 2.0506
No log 25.0 100 2.0258
No log 26.0 104 2.0884
No log 27.0 108 1.7258
No log 28.0 112 1.8528
No log 29.0 116 1.6107
No log 30.0 120 1.8070
No log 31.0 124 1.6551
No log 32.0 128 1.5060
No log 33.0 132 1.4780
No log 34.0 136 1.5360
No log 35.0 140 1.5973
No log 36.0 144 1.5081
No log 37.0 148 1.5293
No log 38.0 152 1.5699
No log 39.0 156 1.3789
No log 40.0 160 1.3797
No log 41.0 164 1.4293
No log 42.0 168 1.3601
No log 43.0 172 1.3191
No log 44.0 176 1.2775
No log 45.0 180 1.3186
No log 46.0 184 1.2806
No log 47.0 188 1.3256
No log 48.0 192 1.3248
No log 49.0 196 1.3389
No log 50.0 200 1.2985
No log 51.0 204 1.2470
No log 52.0 208 1.2510
No log 53.0 212 1.3762
No log 54.0 216 1.2690
No log 55.0 220 1.3169
No log 56.0 224 1.3142
No log 57.0 228 1.2948
No log 58.0 232 1.3216
No log 59.0 236 1.2613
No log 60.0 240 1.3178
No log 61.0 244 1.2888
No log 62.0 248 1.2605
No log 63.0 252 1.2280
No log 64.0 256 1.2609
No log 65.0 260 1.3002
No log 66.0 264 1.2784
No log 67.0 268 1.1941
No log 68.0 272 1.2645
No log 69.0 276 1.2529
No log 70.0 280 1.2423
No log 71.0 284 1.2375
No log 72.0 288 1.2572
No log 73.0 292 1.2493
No log 74.0 296 1.2512
No log 75.0 300 1.2128
No log 76.0 304 1.2246
No log 77.0 308 1.2307
No log 78.0 312 1.2836
No log 79.0 316 1.2728
No log 80.0 320 1.2418
No log 81.0 324 1.2433
No log 82.0 328 1.2358
No log 83.0 332 1.2394
No log 84.0 336 1.2541
No log 85.0 340 1.3406
No log 86.0 344 1.2982
No log 87.0 348 1.2103
No log 88.0 352 1.3101
No log 89.0 356 1.3908
No log 90.0 360 1.3210
No log 91.0 364 1.3641
No log 92.0 368 1.3017
No log 93.0 372 1.3622
No log 94.0 376 1.2857
No log 95.0 380 1.3221
No log 96.0 384 1.2996
No log 97.0 388 1.3561
No log 98.0 392 1.2376
No log 99.0 396 1.3170
No log 100.0 400 1.3494
No log 101.0 404 1.3560
No log 102.0 408 1.2526
No log 103.0 412 1.3123
No log 104.0 416 1.4024
No log 105.0 420 1.3408
No log 106.0 424 1.3860
No log 107.0 428 1.4732
No log 108.0 432 1.3537
No log 109.0 436 1.3521
No log 110.0 440 1.3370
No log 111.0 444 1.4081
No log 112.0 448 1.3721
No log 113.0 452 1.4037
No log 114.0 456 1.3577
No log 115.0 460 1.3530
No log 116.0 464 1.3035
No log 117.0 468 1.3942
No log 118.0 472 1.4689
No log 119.0 476 1.4912
No log 120.0 480 1.3490
No log 121.0 484 1.4376
No log 122.0 488 1.5024
No log 123.0 492 1.4275
No log 124.0 496 1.3304
1.1769 125.0 500 1.3018
1.1769 126.0 504 1.3479
1.1769 127.0 508 1.2967
1.1769 128.0 512 1.3512
1.1769 129.0 516 1.2752
1.1769 130.0 520 1.3445
1.1769 131.0 524 1.4843
1.1769 132.0 528 1.4238
1.1769 133.0 532 1.3526
1.1769 134.0 536 1.4211
1.1769 135.0 540 1.3692
1.1769 136.0 544 1.4308
1.1769 137.0 548 1.4661
1.1769 138.0 552 1.4091
1.1769 139.0 556 1.4392
1.1769 140.0 560 1.4283
1.1769 141.0 564 1.3185
1.1769 142.0 568 1.3383
1.1769 143.0 572 1.5038
1.1769 144.0 576 1.5369
1.1769 145.0 580 1.5651
1.1769 146.0 584 1.4471
1.1769 147.0 588 1.3823
1.1769 148.0 592 1.4221
1.1769 149.0 596 1.4198
1.1769 150.0 600 1.4250
1.1769 151.0 604 1.3998
1.1769 152.0 608 1.3911
1.1769 153.0 612 1.3563
1.1769 154.0 616 1.4110
1.1769 155.0 620 1.4465
1.1769 156.0 624 1.4382
1.1769 157.0 628 1.3671
1.1769 158.0 632 1.3565
1.1769 159.0 636 1.3593
1.1769 160.0 640 1.3381
1.1769 161.0 644 1.4077
1.1769 162.0 648 1.5401
1.1769 163.0 652 1.5009
1.1769 164.0 656 1.3751
1.1769 165.0 660 1.4234
1.1769 166.0 664 1.4498
1.1769 167.0 668 1.3968
1.1769 168.0 672 1.4349
1.1769 169.0 676 1.3392
1.1769 170.0 680 1.3445
1.1769 171.0 684 1.4489
1.1769 172.0 688 1.4760
1.1769 173.0 692 1.3868
1.1769 174.0 696 1.4113
1.1769 175.0 700 1.4980
1.1769 176.0 704 1.4751
1.1769 177.0 708 1.4916
1.1769 178.0 712 1.4633
1.1769 179.0 716 1.3422
1.1769 180.0 720 1.4274
1.1769 181.0 724 1.5459
1.1769 182.0 728 1.5025
1.1769 183.0 732 1.4013
1.1769 184.0 736 1.4751
1.1769 185.0 740 1.5027
1.1769 186.0 744 1.4690
1.1769 187.0 748 1.4428
1.1769 188.0 752 1.5449
1.1769 189.0 756 1.6188
1.1769 190.0 760 1.4704
1.1769 191.0 764 1.4760
1.1769 192.0 768 1.6057
1.1769 193.0 772 1.5716
1.1769 194.0 776 1.4465
1.1769 195.0 780 1.4373
1.1769 196.0 784 1.3818
1.1769 197.0 788 1.4692
1.1769 198.0 792 1.5798
1.1769 199.0 796 1.5460
1.1769 200.0 800 1.4693
1.1769 201.0 804 1.4566
1.1769 202.0 808 1.4881
1.1769 203.0 812 1.4454
1.1769 204.0 816 1.3877
1.1769 205.0 820 1.3506
1.1769 206.0 824 1.4355
1.1769 207.0 828 1.4553
1.1769 208.0 832 1.4904
1.1769 209.0 836 1.5774
1.1769 210.0 840 1.6123
1.1769 211.0 844 1.6387
1.1769 212.0 848 1.5709
1.1769 213.0 852 1.4701
1.1769 214.0 856 1.4261
1.1769 215.0 860 1.4665
1.1769 216.0 864 1.5617
1.1769 217.0 868 1.6086
1.1769 218.0 872 1.6149
1.1769 219.0 876 1.6129
1.1769 220.0 880 1.4981
1.1769 221.0 884 1.4594
1.1769 222.0 888 1.5126
1.1769 223.0 892 1.5631
1.1769 224.0 896 1.5409
1.1769 225.0 900 1.5660
1.1769 226.0 904 1.5185
1.1769 227.0 908 1.4195
1.1769 228.0 912 1.4034
1.1769 229.0 916 1.3628
1.1769 230.0 920 1.3470
1.1769 231.0 924 1.4427
1.1769 232.0 928 1.4713
1.1769 233.0 932 1.4139
1.1769 234.0 936 1.3960
1.1769 235.0 940 1.4397
1.1769 236.0 944 1.4789
1.1769 237.0 948 1.5018
1.1769 238.0 952 1.4602
1.1769 239.0 956 1.4550
1.1769 240.0 960 1.4691
1.1769 241.0 964 1.5159
1.1769 242.0 968 1.5266
1.1769 243.0 972 1.4622
1.1769 244.0 976 1.4635
1.1769 245.0 980 1.4470
1.1769 246.0 984 1.4726
1.1769 247.0 988 1.4950
1.1769 248.0 992 1.4698
1.1769 249.0 996 1.4429
0.4184 250.0 1000 1.4376
0.4184 251.0 1004 1.3997
0.4184 252.0 1008 1.3256
0.4184 253.0 1012 1.3644
0.4184 254.0 1016 1.4549
0.4184 255.0 1020 1.4765
0.4184 256.0 1024 1.4533
0.4184 257.0 1028 1.4265
0.4184 258.0 1032 1.4132
0.4184 259.0 1036 1.4635
0.4184 260.0 1040 1.5026
0.4184 261.0 1044 1.4552
0.4184 262.0 1048 1.4154
0.4184 263.0 1052 1.4397
0.4184 264.0 1056 1.4343
0.4184 265.0 1060 1.4089
0.4184 266.0 1064 1.4080
0.4184 267.0 1068 1.4571
0.4184 268.0 1072 1.5073
0.4184 269.0 1076 1.5459
0.4184 270.0 1080 1.5103
0.4184 271.0 1084 1.4620
0.4184 272.0 1088 1.4497
0.4184 273.0 1092 1.4794
0.4184 274.0 1096 1.5676
0.4184 275.0 1100 1.5996
0.4184 276.0 1104 1.5515
0.4184 277.0 1108 1.5303
0.4184 278.0 1112 1.4773
0.4184 279.0 1116 1.4489
0.4184 280.0 1120 1.4761
0.4184 281.0 1124 1.5169
0.4184 282.0 1128 1.5055
0.4184 283.0 1132 1.4855
0.4184 284.0 1136 1.4738
0.4184 285.0 1140 1.4803
0.4184 286.0 1144 1.5080
0.4184 287.0 1148 1.4692
0.4184 288.0 1152 1.4427
0.4184 289.0 1156 1.4967
0.4184 290.0 1160 1.6093
0.4184 291.0 1164 1.7077
0.4184 292.0 1168 1.7230
0.4184 293.0 1172 1.6213
0.4184 294.0 1176 1.4616
0.4184 295.0 1180 1.4572
0.4184 296.0 1184 1.5369
0.4184 297.0 1188 1.5490
0.4184 298.0 1192 1.5599
0.4184 299.0 1196 1.5460
0.4184 300.0 1200 1.5229
0.4184 301.0 1204 1.4836
0.4184 302.0 1208 1.4673
0.4184 303.0 1212 1.4770
0.4184 304.0 1216 1.4569
0.4184 305.0 1220 1.4321
0.4184 306.0 1224 1.4390
0.4184 307.0 1228 1.4474
0.4184 308.0 1232 1.5174
0.4184 309.0 1236 1.5615
0.4184 310.0 1240 1.5636
0.4184 311.0 1244 1.5368
0.4184 312.0 1248 1.5083
0.4184 313.0 1252 1.4660
0.4184 314.0 1256 1.4645
0.4184 315.0 1260 1.5187
0.4184 316.0 1264 1.6259
0.4184 317.0 1268 1.6653
0.4184 318.0 1272 1.6218
0.4184 319.0 1276 1.5372
0.4184 320.0 1280 1.5218
0.4184 321.0 1284 1.5923
0.4184 322.0 1288 1.6150
0.4184 323.0 1292 1.6162
0.4184 324.0 1296 1.6154
0.4184 325.0 1300 1.5934
0.4184 326.0 1304 1.5683
0.4184 327.0 1308 1.5512
0.4184 328.0 1312 1.5416
0.4184 329.0 1316 1.5535
0.4184 330.0 1320 1.6359
0.4184 331.0 1324 1.7002
0.4184 332.0 1328 1.6695
0.4184 333.0 1332 1.6352
0.4184 334.0 1336 1.6134
0.4184 335.0 1340 1.6155
0.4184 336.0 1344 1.6026
0.4184 337.0 1348 1.5935
0.4184 338.0 1352 1.5867
0.4184 339.0 1356 1.5870
0.4184 340.0 1360 1.6008
0.4184 341.0 1364 1.5968
0.4184 342.0 1368 1.6040
0.4184 343.0 1372 1.6181
0.4184 344.0 1376 1.6164
0.4184 345.0 1380 1.6434
0.4184 346.0 1384 1.6501
0.4184 347.0 1388 1.6337
0.4184 348.0 1392 1.5782
0.4184 349.0 1396 1.5265
0.4184 350.0 1400 1.5207
0.4184 351.0 1404 1.5469
0.4184 352.0 1408 1.5381
0.4184 353.0 1412 1.5145
0.4184 354.0 1416 1.5194
0.4184 355.0 1420 1.5902
0.4184 356.0 1424 1.6511
0.4184 357.0 1428 1.6339
0.4184 358.0 1432 1.6251
0.4184 359.0 1436 1.6172
0.4184 360.0 1440 1.6048
0.4184 361.0 1444 1.5671
0.4184 362.0 1448 1.5452
0.4184 363.0 1452 1.5462
0.4184 364.0 1456 1.5718
0.4184 365.0 1460 1.6247
0.4184 366.0 1464 1.6518
0.4184 367.0 1468 1.6479
0.4184 368.0 1472 1.5929
0.4184 369.0 1476 1.5627
0.4184 370.0 1480 1.5416
0.4184 371.0 1484 1.5658
0.4184 372.0 1488 1.5853
0.4184 373.0 1492 1.5762
0.4184 374.0 1496 1.5747
0.403 375.0 1500 1.6016
0.403 376.0 1504 1.5876
0.403 377.0 1508 1.5634
0.403 378.0 1512 1.5487
0.403 379.0 1516 1.5550
0.403 380.0 1520 1.5482
0.403 381.0 1524 1.5366
0.403 382.0 1528 1.4986
0.403 383.0 1532 1.4762
0.403 384.0 1536 1.4525
0.403 385.0 1540 1.4381
0.403 386.0 1544 1.4430
0.403 387.0 1548 1.5026
0.403 388.0 1552 1.5585
0.403 389.0 1556 1.5639
0.403 390.0 1560 1.5419
0.403 391.0 1564 1.5009
0.403 392.0 1568 1.4566
0.403 393.0 1572 1.4410
0.403 394.0 1576 1.4463
0.403 395.0 1580 1.5042
0.403 396.0 1584 1.5396
0.403 397.0 1588 1.5156
0.403 398.0 1592 1.4753
0.403 399.0 1596 1.4929
0.403 400.0 1600 1.5262
0.403 401.0 1604 1.5380
0.403 402.0 1608 1.5223
0.403 403.0 1612 1.5137
0.403 404.0 1616 1.5164
0.403 405.0 1620 1.5077
0.403 406.0 1624 1.4870
0.403 407.0 1628 1.4976
0.403 408.0 1632 1.5187
0.403 409.0 1636 1.5094
0.403 410.0 1640 1.5285
0.403 411.0 1644 1.5810
0.403 412.0 1648 1.6046
0.403 413.0 1652 1.6018
0.403 414.0 1656 1.5656
0.403 415.0 1660 1.5039
0.403 416.0 1664 1.4686
0.403 417.0 1668 1.5452
0.403 418.0 1672 1.5991
0.403 419.0 1676 1.6294
0.403 420.0 1680 1.6170
0.403 421.0 1684 1.5730
0.403 422.0 1688 1.5304
0.403 423.0 1692 1.4830
0.403 424.0 1696 1.4814
0.403 425.0 1700 1.5355
0.403 426.0 1704 1.5737
0.403 427.0 1708 1.5733
0.403 428.0 1712 1.5686
0.403 429.0 1716 1.6145
0.403 430.0 1720 1.6108
0.403 431.0 1724 1.5805
0.403 432.0 1728 1.5336
0.403 433.0 1732 1.5152
0.403 434.0 1736 1.5305
0.403 435.0 1740 1.5345
0.403 436.0 1744 1.5277
0.403 437.0 1748 1.5199
0.403 438.0 1752 1.5255
0.403 439.0 1756 1.5212
0.403 440.0 1760 1.5225
0.403 441.0 1764 1.5280
0.403 442.0 1768 1.5732
0.403 443.0 1772 1.5911
0.403 444.0 1776 1.5708
0.403 445.0 1780 1.5290
0.403 446.0 1784 1.4492
0.403 447.0 1788 1.3949
0.403 448.0 1792 1.3873
0.403 449.0 1796 1.3988
0.403 450.0 1800 1.4091
0.403 451.0 1804 1.4142
0.403 452.0 1808 1.4216
0.403 453.0 1812 1.4320
0.403 454.0 1816 1.4388
0.403 455.0 1820 1.4414
0.403 456.0 1824 1.4931
0.403 457.0 1828 1.5570
0.403 458.0 1832 1.5959
0.403 459.0 1836 1.5853
0.403 460.0 1840 1.5050
0.403 461.0 1844 1.4635
0.403 462.0 1848 1.4427
0.403 463.0 1852 1.4934
0.403 464.0 1856 1.5395
0.403 465.0 1860 1.5815
0.403 466.0 1864 1.5800
0.403 467.0 1868 1.5742
0.403 468.0 1872 1.5501
0.403 469.0 1876 1.5096
0.403 470.0 1880 1.4732
0.403 471.0 1884 1.4491
0.403 472.0 1888 1.4311
0.403 473.0 1892 1.4276
0.403 474.0 1896 1.4415
0.403 475.0 1900 1.4440
0.403 476.0 1904 1.4470
0.403 477.0 1908 1.4525
0.403 478.0 1912 1.4505
0.403 479.0 1916 1.5002
0.403 480.0 1920 1.6120
0.403 481.0 1924 1.6583
0.403 482.0 1928 1.6150
0.403 483.0 1932 1.5865
0.403 484.0 1936 1.5523
0.403 485.0 1940 1.5007
0.403 486.0 1944 1.4920
0.403 487.0 1948 1.4982
0.403 488.0 1952 1.5340
0.403 489.0 1956 1.5504
0.403 490.0 1960 1.5587
0.403 491.0 1964 1.5618
0.403 492.0 1968 1.5556
0.403 493.0 1972 1.5658
0.403 494.0 1976 1.5470
0.403 495.0 1980 1.5113
0.403 496.0 1984 1.4983
0.403 497.0 1988 1.5022
0.403 498.0 1992 1.5066
0.403 499.0 1996 1.5115
0.4024 500.0 2000 1.5116
0.4024 501.0 2004 1.5218
0.4024 502.0 2008 1.5240
0.4024 503.0 2012 1.4856
0.4024 504.0 2016 1.4586
0.4024 505.0 2020 1.4481
0.4024 506.0 2024 1.4423
0.4024 507.0 2028 1.4435
0.4024 508.0 2032 1.4508
0.4024 509.0 2036 1.4464
0.4024 510.0 2040 1.4419
0.4024 511.0 2044 1.4568
0.4024 512.0 2048 1.4571
0.4024 513.0 2052 1.4427
0.4024 514.0 2056 1.4459
0.4024 515.0 2060 1.4544
0.4024 516.0 2064 1.4665
0.4024 517.0 2068 1.4824
0.4024 518.0 2072 1.4981
0.4024 519.0 2076 1.4992
0.4024 520.0 2080 1.4854
0.4024 521.0 2084 1.4697
0.4024 522.0 2088 1.4499
0.4024 523.0 2092 1.4268
0.4024 524.0 2096 1.4076
0.4024 525.0 2100 1.3958
0.4024 526.0 2104 1.3957
0.4024 527.0 2108 1.3953
0.4024 528.0 2112 1.4094
0.4024 529.0 2116 1.4228
0.4024 530.0 2120 1.4343
0.4024 531.0 2124 1.4362
0.4024 532.0 2128 1.4271
0.4024 533.0 2132 1.4120
0.4024 534.0 2136 1.4023
0.4024 535.0 2140 1.3896
0.4024 536.0 2144 1.4249
0.4024 537.0 2148 1.4613
0.4024 538.0 2152 1.4893
0.4024 539.0 2156 1.5004
0.4024 540.0 2160 1.4948
0.4024 541.0 2164 1.4828
0.4024 542.0 2168 1.4530
0.4024 543.0 2172 1.4373
0.4024 544.0 2176 1.4257
0.4024 545.0 2180 1.4385
0.4024 546.0 2184 1.4539
0.4024 547.0 2188 1.4566
0.4024 548.0 2192 1.4535
0.4024 549.0 2196 1.4526
0.4024 550.0 2200 1.4510
0.4024 551.0 2204 1.4496
0.4024 552.0 2208 1.4515
0.4024 553.0 2212 1.4530
0.4024 554.0 2216 1.4621
0.4024 555.0 2220 1.4715
0.4024 556.0 2224 1.4751
0.4024 557.0 2228 1.4794
0.4024 558.0 2232 1.4853
0.4024 559.0 2236 1.4989
0.4024 560.0 2240 1.5042
0.4024 561.0 2244 1.5067
0.4024 562.0 2248 1.5144
0.4024 563.0 2252 1.5396
0.4024 564.0 2256 1.5792
0.4024 565.0 2260 1.5966
0.4024 566.0 2264 1.5967
0.4024 567.0 2268 1.5786
0.4024 568.0 2272 1.5526
0.4024 569.0 2276 1.5258
0.4024 570.0 2280 1.5030
0.4024 571.0 2284 1.4949
0.4024 572.0 2288 1.4925
0.4024 573.0 2292 1.4942
0.4024 574.0 2296 1.5047
0.4024 575.0 2300 1.5211
0.4024 576.0 2304 1.5308
0.4024 577.0 2308 1.5393
0.4024 578.0 2312 1.5351
0.4024 579.0 2316 1.5045
0.4024 580.0 2320 1.4750
0.4024 581.0 2324 1.4815
0.4024 582.0 2328 1.4967
0.4024 583.0 2332 1.4975
0.4024 584.0 2336 1.4911
0.4024 585.0 2340 1.4834
0.4024 586.0 2344 1.4816
0.4024 587.0 2348 1.4838
0.4024 588.0 2352 1.4683
0.4024 589.0 2356 1.4546
0.4024 590.0 2360 1.4426
0.4024 591.0 2364 1.4359
0.4024 592.0 2368 1.4394
0.4024 593.0 2372 1.4694
0.4024 594.0 2376 1.4913
0.4024 595.0 2380 1.5092
0.4024 596.0 2384 1.5184
0.4024 597.0 2388 1.5213
0.4024 598.0 2392 1.5543
0.4024 599.0 2396 1.5954
0.4024 600.0 2400 1.6103
0.4024 601.0 2404 1.6098
0.4024 602.0 2408 1.5797
0.4024 603.0 2412 1.5508
0.4024 604.0 2416 1.5505
0.4024 605.0 2420 1.5737
0.4024 606.0 2424 1.6098
0.4024 607.0 2428 1.6140
0.4024 608.0 2432 1.5859
0.4024 609.0 2436 1.5638
0.4024 610.0 2440 1.5348
0.4024 611.0 2444 1.4999
0.4024 612.0 2448 1.4936
0.4024 613.0 2452 1.4992
0.4024 614.0 2456 1.4965
0.4024 615.0 2460 1.4976
0.4024 616.0 2464 1.5261
0.4024 617.0 2468 1.5552
0.4024 618.0 2472 1.5636
0.4024 619.0 2476 1.5605
0.4024 620.0 2480 1.5478
0.4024 621.0 2484 1.5333
0.4024 622.0 2488 1.5142
0.4024 623.0 2492 1.5076
0.4024 624.0 2496 1.5063
0.4015 625.0 2500 1.5049
0.4015 626.0 2504 1.5068
0.4015 627.0 2508 1.5088
0.4015 628.0 2512 1.5203
0.4015 629.0 2516 1.5283
0.4015 630.0 2520 1.5407
0.4015 631.0 2524 1.5442
0.4015 632.0 2528 1.5426
0.4015 633.0 2532 1.5418
0.4015 634.0 2536 1.5692
0.4015 635.0 2540 1.5733
0.4015 636.0 2544 1.5586
0.4015 637.0 2548 1.5299
0.4015 638.0 2552 1.5009
0.4015 639.0 2556 1.4912
0.4015 640.0 2560 1.4850
0.4015 641.0 2564 1.4784
0.4015 642.0 2568 1.4744
0.4015 643.0 2572 1.4697
0.4015 644.0 2576 1.4705
0.4015 645.0 2580 1.4801
0.4015 646.0 2584 1.5035
0.4015 647.0 2588 1.5196
0.4015 648.0 2592 1.5159
0.4015 649.0 2596 1.4930
0.4015 650.0 2600 1.4696
0.4015 651.0 2604 1.4524
0.4015 652.0 2608 1.4457
0.4015 653.0 2612 1.4458
0.4015 654.0 2616 1.4538
0.4015 655.0 2620 1.4621
0.4015 656.0 2624 1.4501
0.4015 657.0 2628 1.4451
0.4015 658.0 2632 1.4440
0.4015 659.0 2636 1.4448
0.4015 660.0 2640 1.4493
0.4015 661.0 2644 1.4610
0.4015 662.0 2648 1.4716
0.4015 663.0 2652 1.4873
0.4015 664.0 2656 1.4878
0.4015 665.0 2660 1.4846
0.4015 666.0 2664 1.4751
0.4015 667.0 2668 1.4794
0.4015 668.0 2672 1.4909
0.4015 669.0 2676 1.4962
0.4015 670.0 2680 1.4947
0.4015 671.0 2684 1.4937
0.4015 672.0 2688 1.4947
0.4015 673.0 2692 1.5155
0.4015 674.0 2696 1.5275
0.4015 675.0 2700 1.5334
0.4015 676.0 2704 1.5314
0.4015 677.0 2708 1.5217
0.4015 678.0 2712 1.5107
0.4015 679.0 2716 1.4861
0.4015 680.0 2720 1.4675
0.4015 681.0 2724 1.4513
0.4015 682.0 2728 1.4430
0.4015 683.0 2732 1.4404
0.4015 684.0 2736 1.4411
0.4015 685.0 2740 1.4473
0.4015 686.0 2744 1.4560
0.4015 687.0 2748 1.4980
0.4015 688.0 2752 1.5234
0.4015 689.0 2756 1.4941
0.4015 690.0 2760 1.4641
0.4015 691.0 2764 1.4380
0.4015 692.0 2768 1.4186
0.4015 693.0 2772 1.4153
0.4015 694.0 2776 1.4187
0.4015 695.0 2780 1.4162
0.4015 696.0 2784 1.4143
0.4015 697.0 2788 1.4212
0.4015 698.0 2792 1.4432
0.4015 699.0 2796 1.4601
0.4015 700.0 2800 1.4692
0.4015 701.0 2804 1.4744
0.4015 702.0 2808 1.4713
0.4015 703.0 2812 1.4607
0.4015 704.0 2816 1.4542
0.4015 705.0 2820 1.4502
0.4015 706.0 2824 1.4510
0.4015 707.0 2828 1.4470
0.4015 708.0 2832 1.4681
0.4015 709.0 2836 1.4942
0.4015 710.0 2840 1.5013
0.4015 711.0 2844 1.4965
0.4015 712.0 2848 1.4914
0.4015 713.0 2852 1.4880
0.4015 714.0 2856 1.4832
0.4015 715.0 2860 1.4706
0.4015 716.0 2864 1.4516
0.4015 717.0 2868 1.4337
0.4015 718.0 2872 1.4228
0.4015 719.0 2876 1.4346
0.4015 720.0 2880 1.4548
0.4015 721.0 2884 1.4671
0.4015 722.0 2888 1.4739
0.4015 723.0 2892 1.4782
0.4015 724.0 2896 1.4797
0.4015 725.0 2900 1.4808
0.4015 726.0 2904 1.4876
0.4015 727.0 2908 1.4954
0.4015 728.0 2912 1.4947
0.4015 729.0 2916 1.5069
0.4015 730.0 2920 1.5065
0.4015 731.0 2924 1.5027
0.4015 732.0 2928 1.4995
0.4015 733.0 2932 1.4904
0.4015 734.0 2936 1.4799
0.4015 735.0 2940 1.4717
0.4015 736.0 2944 1.4581
0.4015 737.0 2948 1.4548
0.4015 738.0 2952 1.4662
0.4015 739.0 2956 1.4771
0.4015 740.0 2960 1.4662
0.4015 741.0 2964 1.4586
0.4015 742.0 2968 1.4509
0.4015 743.0 2972 1.4447
0.4015 744.0 2976 1.4529
0.4015 745.0 2980 1.4662
0.4015 746.0 2984 1.4690
0.4015 747.0 2988 1.4660
0.4015 748.0 2992 1.4582
0.4015 749.0 2996 1.4479
0.4001 750.0 3000 1.4423
0.4001 751.0 3004 1.4392
0.4001 752.0 3008 1.4525
0.4001 753.0 3012 1.5241
0.4001 754.0 3016 1.5544
0.4001 755.0 3020 1.5489
0.4001 756.0 3024 1.5279
0.4001 757.0 3028 1.5274
0.4001 758.0 3032 1.5301
0.4001 759.0 3036 1.5359
0.4001 760.0 3040 1.5737
0.4001 761.0 3044 1.5974
0.4001 762.0 3048 1.5996
0.4001 763.0 3052 1.6004
0.4001 764.0 3056 1.6019
0.4001 765.0 3060 1.5847
0.4001 766.0 3064 1.5686
0.4001 767.0 3068 1.5514
0.4001 768.0 3072 1.5324
0.4001 769.0 3076 1.5123
0.4001 770.0 3080 1.4964
0.4001 771.0 3084 1.4862
0.4001 772.0 3088 1.4782
0.4001 773.0 3092 1.4780
0.4001 774.0 3096 1.4762
0.4001 775.0 3100 1.4691
0.4001 776.0 3104 1.4632
0.4001 777.0 3108 1.4611
0.4001 778.0 3112 1.4725
0.4001 779.0 3116 1.4805
0.4001 780.0 3120 1.4796
0.4001 781.0 3124 1.4769
0.4001 782.0 3128 1.4737
0.4001 783.0 3132 1.4679
0.4001 784.0 3136 1.4578
0.4001 785.0 3140 1.4514
0.4001 786.0 3144 1.4493
0.4001 787.0 3148 1.4479
0.4001 788.0 3152 1.4517
0.4001 789.0 3156 1.4652
0.4001 790.0 3160 1.4743
0.4001 791.0 3164 1.4811
0.4001 792.0 3168 1.4871
0.4001 793.0 3172 1.4907
0.4001 794.0 3176 1.4923
0.4001 795.0 3180 1.4931
0.4001 796.0 3184 1.4951
0.4001 797.0 3188 1.4946
0.4001 798.0 3192 1.5247
0.4001 799.0 3196 1.5451
0.4001 800.0 3200 1.5432
0.4001 801.0 3204 1.5257
0.4001 802.0 3208 1.5064
0.4001 803.0 3212 1.4682
0.4001 804.0 3216 1.4288
0.4001 805.0 3220 1.4012
0.4001 806.0 3224 1.3906
0.4001 807.0 3228 1.3962
0.4001 808.0 3232 1.4104
0.4001 809.0 3236 1.4105
0.4001 810.0 3240 1.4086
0.4001 811.0 3244 1.4171
0.4001 812.0 3248 1.4244
0.4001 813.0 3252 1.4366
0.4001 814.0 3256 1.4451
0.4001 815.0 3260 1.4541
0.4001 816.0 3264 1.4575
0.4001 817.0 3268 1.4570
0.4001 818.0 3272 1.4586
0.4001 819.0 3276 1.4581
0.4001 820.0 3280 1.4575
0.4001 821.0 3284 1.4642
0.4001 822.0 3288 1.4704
0.4001 823.0 3292 1.4701
0.4001 824.0 3296 1.4691
0.4001 825.0 3300 1.4685
0.4001 826.0 3304 1.4694
0.4001 827.0 3308 1.4711
0.4001 828.0 3312 1.4721
0.4001 829.0 3316 1.4674
0.4001 830.0 3320 1.4622
0.4001 831.0 3324 1.4573
0.4001 832.0 3328 1.4482
0.4001 833.0 3332 1.4359
0.4001 834.0 3336 1.4232
0.4001 835.0 3340 1.4170
0.4001 836.0 3344 1.4198
0.4001 837.0 3348 1.4202
0.4001 838.0 3352 1.4206
0.4001 839.0 3356 1.4196
0.4001 840.0 3360 1.4172
0.4001 841.0 3364 1.4142
0.4001 842.0 3368 1.4109
0.4001 843.0 3372 1.4082
0.4001 844.0 3376 1.4037
0.4001 845.0 3380 1.4000
0.4001 846.0 3384 1.3975
0.4001 847.0 3388 1.3969
0.4001 848.0 3392 1.3926
0.4001 849.0 3396 1.3887
0.4001 850.0 3400 1.3847
0.4001 851.0 3404 1.4020
0.4001 852.0 3408 1.4151
0.4001 853.0 3412 1.4249
0.4001 854.0 3416 1.4314
0.4001 855.0 3420 1.4412
0.4001 856.0 3424 1.4491
0.4001 857.0 3428 1.4530
0.4001 858.0 3432 1.4535
0.4001 859.0 3436 1.4521
0.4001 860.0 3440 1.4490
0.4001 861.0 3444 1.4458
0.4001 862.0 3448 1.4413
0.4001 863.0 3452 1.4338
0.4001 864.0 3456 1.4350
0.4001 865.0 3460 1.4348
0.4001 866.0 3464 1.4384
0.4001 867.0 3468 1.4412
0.4001 868.0 3472 1.4432
0.4001 869.0 3476 1.4470
0.4001 870.0 3480 1.4485
0.4001 871.0 3484 1.4459
0.4001 872.0 3488 1.4425
0.4001 873.0 3492 1.4419
0.4001 874.0 3496 1.4416
0.4008 875.0 3500 1.4396
0.4008 876.0 3504 1.4383
0.4008 877.0 3508 1.4387
0.4008 878.0 3512 1.4361
0.4008 879.0 3516 1.4319
0.4008 880.0 3520 1.4762
0.4008 881.0 3524 1.5073
0.4008 882.0 3528 1.5217
0.4008 883.0 3532 1.5300
0.4008 884.0 3536 1.5353
0.4008 885.0 3540 1.5350
0.4008 886.0 3544 1.5311
0.4008 887.0 3548 1.5258
0.4008 888.0 3552 1.5221
0.4008 889.0 3556 1.5203
0.4008 890.0 3560 1.5197
0.4008 891.0 3564 1.5183
0.4008 892.0 3568 1.5177
0.4008 893.0 3572 1.5159
0.4008 894.0 3576 1.5123
0.4008 895.0 3580 1.5079
0.4008 896.0 3584 1.5260
0.4008 897.0 3588 1.5390
0.4008 898.0 3592 1.5489
0.4008 899.0 3596 1.5523
0.4008 900.0 3600 1.5526
0.4008 901.0 3604 1.5522
0.4008 902.0 3608 1.5511
0.4008 903.0 3612 1.5479
0.4008 904.0 3616 1.5427
0.4008 905.0 3620 1.5378
0.4008 906.0 3624 1.5306
0.4008 907.0 3628 1.5252
0.4008 908.0 3632 1.5225
0.4008 909.0 3636 1.5154
0.4008 910.0 3640 1.5022
0.4008 911.0 3644 1.4939
0.4008 912.0 3648 1.4860
0.4008 913.0 3652 1.4916
0.4008 914.0 3656 1.5086
0.4008 915.0 3660 1.5295
0.4008 916.0 3664 1.5443
0.4008 917.0 3668 1.5529
0.4008 918.0 3672 1.5568
0.4008 919.0 3676 1.5596
0.4008 920.0 3680 1.5665
0.4008 921.0 3684 1.5707
0.4008 922.0 3688 1.5719
0.4008 923.0 3692 1.5714
0.4008 924.0 3696 1.5701
0.4008 925.0 3700 1.5691
0.4008 926.0 3704 1.5677
0.4008 927.0 3708 1.5668
0.4008 928.0 3712 1.5655
0.4008 929.0 3716 1.5664
0.4008 930.0 3720 1.5673
0.4008 931.0 3724 1.5684
0.4008 932.0 3728 1.5683
0.4008 933.0 3732 1.5666
0.4008 934.0 3736 1.5654
0.4008 935.0 3740 1.5645
0.4008 936.0 3744 1.5642
0.4008 937.0 3748 1.5630
0.4008 938.0 3752 1.5632
0.4008 939.0 3756 1.5597
0.4008 940.0 3760 1.5533
0.4008 941.0 3764 1.5481
0.4008 942.0 3768 1.5442
0.4008 943.0 3772 1.5421
0.4008 944.0 3776 1.5434
0.4008 945.0 3780 1.5432
0.4008 946.0 3784 1.5444
0.4008 947.0 3788 1.5443
0.4008 948.0 3792 1.5428
0.4008 949.0 3796 1.5417
0.4008 950.0 3800 1.5401
0.4008 951.0 3804 1.5420
0.4008 952.0 3808 1.5430
0.4008 953.0 3812 1.5433
0.4008 954.0 3816 1.5433
0.4008 955.0 3820 1.5433
0.4008 956.0 3824 1.5440
0.4008 957.0 3828 1.5432
0.4008 958.0 3832 1.5411
0.4008 959.0 3836 1.5377
0.4008 960.0 3840 1.5324
0.4008 961.0 3844 1.5274
0.4008 962.0 3848 1.5238
0.4008 963.0 3852 1.5214
0.4008 964.0 3856 1.5190
0.4008 965.0 3860 1.5161
0.4008 966.0 3864 1.5143
0.4008 967.0 3868 1.5137
0.4008 968.0 3872 1.5126
0.4008 969.0 3876 1.5112
0.4008 970.0 3880 1.5092
0.4008 971.0 3884 1.5077
0.4008 972.0 3888 1.5072
0.4008 973.0 3892 1.5065
0.4008 974.0 3896 1.5063
0.4008 975.0 3900 1.5062
0.4008 976.0 3904 1.5061
0.4008 977.0 3908 1.5068
0.4008 978.0 3912 1.5077
0.4008 979.0 3916 1.5080
0.4008 980.0 3920 1.5080
0.4008 981.0 3924 1.5081
0.4008 982.0 3928 1.5080
0.4008 983.0 3932 1.5086
0.4008 984.0 3936 1.5093
0.4008 985.0 3940 1.5096
0.4008 986.0 3944 1.5095
0.4008 987.0 3948 1.5096
0.4008 988.0 3952 1.5097
0.4008 989.0 3956 1.5099
0.4008 990.0 3960 1.5095
0.4008 991.0 3964 1.5092
0.4008 992.0 3968 1.5090
0.4008 993.0 3972 1.5090
0.4008 994.0 3976 1.5090
0.4008 995.0 3980 1.5090
0.4008 996.0 3984 1.5092
0.4008 997.0 3988 1.5093
0.4008 998.0 3992 1.5094
0.4008 999.0 3996 1.5095
0.4015 1000.0 4000 1.5096

Framework versions

  • Transformers 4.36.2
  • Pytorch 2.1.2+cu121
  • Datasets 2.14.7
  • Tokenizers 0.15.0
Downloads last month
14
Safetensors
Model size
124M params
Tensor type
F32
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.