Edit model card

Research_paper_MLM_Final_Label_400k_v2

This model was trained from scratch on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 2.3527
  • Accuracy: 0.8610
  • F1: 0.8593
  • Precision: 0.8744
  • Recall: 0.8610

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-06
  • train_batch_size: 32
  • eval_batch_size: 64
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • lr_scheduler_warmup_steps: 100
  • num_epochs: 5
  • mixed_precision_training: Native AMP

Training results

Training Loss Epoch Step Validation Loss Accuracy F1 Precision Recall
0.9494 0.01 100 0.8736 0.5144 0.4356 0.5601 0.5144
0.6266 0.02 200 0.5864 0.7277 0.7222 0.7414 0.7277
0.2747 0.02 300 0.3926 0.8644 0.8624 0.8812 0.8644
0.1963 0.03 400 0.4653 0.8693 0.8673 0.8876 0.8693
0.1517 0.04 500 0.5235 0.8678 0.8656 0.8877 0.8678
0.1335 0.05 600 0.5410 0.8712 0.8692 0.8892 0.8712
0.1286 0.06 700 0.6130 0.8652 0.8628 0.8856 0.8652
0.097 0.06 800 0.5954 0.8731 0.8717 0.8857 0.8731
0.1051 0.07 900 0.6787 0.8735 0.8717 0.8897 0.8735
0.1243 0.08 1000 0.6462 0.8724 0.8709 0.8844 0.8724
0.1007 0.09 1100 0.7043 0.8674 0.8655 0.8839 0.8674
0.0864 0.1 1200 0.7269 0.8690 0.8671 0.8850 0.8690
0.1039 0.1 1300 0.7651 0.8671 0.8656 0.8792 0.8671
0.0881 0.11 1400 0.7508 0.8693 0.8675 0.8855 0.8693
0.0947 0.12 1500 0.8400 0.8633 0.8613 0.8801 0.8633
0.1046 0.13 1600 0.8959 0.8663 0.8642 0.8849 0.8663
0.0838 0.14 1700 0.8912 0.8667 0.8648 0.8823 0.8667
0.1 0.14 1800 0.9090 0.8671 0.8653 0.8816 0.8671
0.0965 0.15 1900 0.9310 0.8640 0.8621 0.8799 0.8640
0.0995 0.16 2000 1.0193 0.8671 0.8649 0.8860 0.8671
0.1363 0.17 2100 0.9694 0.8705 0.8689 0.8844 0.8705
0.1287 0.18 2200 0.9761 0.8671 0.8654 0.8812 0.8671
0.1438 0.18 2300 1.1352 0.8637 0.8612 0.8848 0.8637
0.1081 0.19 2400 1.1154 0.8640 0.8621 0.8802 0.8640
0.088 0.2 2500 1.1275 0.8659 0.8639 0.8831 0.8659
0.1119 0.21 2600 1.2741 0.8599 0.8572 0.8826 0.8599
0.163 0.22 2700 1.2057 0.8663 0.8642 0.8847 0.8663
0.1742 0.22 2800 1.2599 0.8637 0.8614 0.8834 0.8637
0.1145 0.23 2900 1.2909 0.8618 0.8601 0.8752 0.8618
0.1451 0.24 3000 1.2917 0.8663 0.8643 0.8839 0.8663
0.1979 0.25 3100 1.2644 0.8637 0.8616 0.8804 0.8637
0.0965 0.26 3200 1.3021 0.8618 0.8599 0.8768 0.8618
0.1252 0.26 3300 1.1827 0.8663 0.8644 0.8828 0.8663
0.1343 0.27 3400 1.3309 0.8659 0.8638 0.8841 0.8659
0.1038 0.28 3500 1.3382 0.8618 0.8602 0.8743 0.8618
0.1641 0.29 3600 1.3133 0.8629 0.8612 0.8763 0.8629
0.1577 0.3 3700 1.3196 0.8633 0.8614 0.8786 0.8633
0.1354 0.3 3800 1.3578 0.8614 0.8596 0.8754 0.8614
0.1212 0.31 3900 1.4582 0.8606 0.8588 0.8755 0.8606
0.1821 0.32 4000 1.3076 0.8640 0.8621 0.8797 0.8640
0.1494 0.33 4100 1.3707 0.8644 0.8625 0.8800 0.8644
0.1252 0.34 4200 1.4447 0.8588 0.8570 0.8720 0.8588
0.1451 0.34 4300 1.4365 0.8656 0.8636 0.8820 0.8656
0.1528 0.35 4400 1.4891 0.8629 0.8611 0.8776 0.8629
0.1448 0.36 4500 1.5234 0.8618 0.8599 0.8766 0.8618
0.2075 0.37 4600 1.4235 0.8633 0.8614 0.8784 0.8633
0.1502 0.38 4700 1.5434 0.8629 0.8610 0.8786 0.8629
0.1492 0.38 4800 1.6726 0.8603 0.8583 0.8760 0.8603
0.1921 0.39 4900 1.6339 0.8633 0.8614 0.8791 0.8633
0.1515 0.4 5000 1.6651 0.8640 0.8622 0.8794 0.8640
0.2271 0.41 5100 1.7037 0.8656 0.8634 0.8841 0.8656
0.2935 0.42 5200 1.5967 0.8656 0.8638 0.8805 0.8656
0.1641 0.42 5300 1.5818 0.8656 0.8636 0.8825 0.8656
0.1563 0.43 5400 1.6788 0.8652 0.8632 0.8818 0.8652
0.2386 0.44 5500 1.6893 0.8648 0.8629 0.8802 0.8648
0.1644 0.45 5600 1.7304 0.8637 0.8618 0.8787 0.8637
0.1917 0.46 5700 1.7029 0.8663 0.8646 0.8802 0.8663
0.1534 0.46 5800 1.8255 0.8652 0.8633 0.8808 0.8652
0.2468 0.47 5900 1.7937 0.8629 0.8610 0.8781 0.8629
0.1443 0.48 6000 1.8242 0.8629 0.8612 0.8769 0.8629
0.1968 0.49 6100 1.8104 0.8637 0.8619 0.8780 0.8637
0.2546 0.5 6200 1.8293 0.8640 0.8621 0.8802 0.8640
0.1756 0.5 6300 1.9527 0.8629 0.8610 0.8786 0.8629
0.277 0.51 6400 1.9285 0.8603 0.8586 0.8729 0.8603
0.2327 0.52 6500 1.9621 0.8629 0.8610 0.8786 0.8629
0.2526 0.53 6600 1.9757 0.8633 0.8614 0.8791 0.8633
0.2439 0.54 6700 2.0490 0.8599 0.8582 0.8731 0.8599
0.2546 0.54 6800 1.9591 0.8637 0.8619 0.8777 0.8637
0.2197 0.55 6900 2.0416 0.8659 0.8641 0.8813 0.8659
0.2329 0.56 7000 2.0248 0.8640 0.8622 0.8787 0.8640
0.2158 0.57 7100 2.0538 0.8637 0.8618 0.8784 0.8637
0.2372 0.58 7200 2.0786 0.8622 0.8601 0.8791 0.8622
0.247 0.58 7300 2.0921 0.8622 0.8601 0.8786 0.8622
0.2151 0.59 7400 2.0345 0.8652 0.8635 0.8784 0.8652
0.1967 0.6 7500 2.1306 0.8610 0.8589 0.8780 0.8610
0.1728 0.61 7600 2.0889 0.8644 0.8625 0.8805 0.8644
0.2496 0.62 7700 2.1031 0.8629 0.8611 0.8772 0.8629
0.2479 0.62 7800 2.0390 0.8614 0.8597 0.8747 0.8614
0.2503 0.63 7900 2.0854 0.8652 0.8633 0.8810 0.8652
0.2469 0.64 8000 2.1326 0.8599 0.8582 0.8731 0.8599
0.2286 0.65 8100 2.0677 0.8648 0.8629 0.8807 0.8648
0.2223 0.66 8200 2.0680 0.8625 0.8607 0.8776 0.8625
0.248 0.66 8300 2.0952 0.8640 0.8622 0.8794 0.8640
0.2402 0.67 8400 2.1070 0.8629 0.8609 0.8799 0.8629
0.2417 0.68 8500 2.0570 0.8652 0.8634 0.8795 0.8652
0.2305 0.69 8600 2.1236 0.8606 0.8591 0.8728 0.8606
0.184 0.7 8700 2.1277 0.8629 0.8611 0.8779 0.8629
0.1861 0.7 8800 2.1695 0.8640 0.8621 0.8802 0.8640
0.2246 0.71 8900 2.1301 0.8644 0.8626 0.8790 0.8644
0.2213 0.72 9000 2.1211 0.8629 0.8609 0.8791 0.8629
0.2087 0.73 9100 2.1119 0.8663 0.8647 0.8792 0.8663
0.2665 0.74 9200 2.1326 0.8644 0.8627 0.8788 0.8644
0.1916 0.74 9300 2.1792 0.8603 0.8581 0.8780 0.8603
0.1771 0.75 9400 2.1212 0.8652 0.8634 0.8800 0.8652
0.2424 0.76 9500 2.0911 0.8644 0.8626 0.8795 0.8644
0.2198 0.77 9600 2.1190 0.8633 0.8615 0.8782 0.8633
0.1696 0.78 9700 2.1271 0.8644 0.8627 0.8785 0.8644
0.181 0.78 9800 2.1595 0.8633 0.8614 0.8784 0.8633
0.2164 0.79 9900 2.1076 0.8640 0.8624 0.8776 0.8640
0.193 0.8 10000 2.2109 0.8622 0.8602 0.8783 0.8622
0.2687 0.81 10100 2.0889 0.8640 0.8623 0.8778 0.8640
0.1753 0.82 10200 2.1338 0.8629 0.8612 0.8769 0.8629
0.2145 0.82 10300 2.1685 0.8618 0.8598 0.8778 0.8618
0.2245 0.83 10400 2.1522 0.8633 0.8614 0.8791 0.8633
0.2426 0.84 10500 2.1791 0.8618 0.8598 0.8778 0.8618
0.1354 0.85 10600 2.1929 0.8637 0.8615 0.8815 0.8637
0.2212 0.86 10700 2.2136 0.8618 0.8598 0.8780 0.8618
0.1629 0.86 10800 2.2222 0.8640 0.8622 0.8790 0.8640
0.1885 0.87 10900 2.1754 0.8637 0.8618 0.8787 0.8637
0.1804 0.88 11000 2.1812 0.8659 0.8641 0.8808 0.8659
0.1578 0.89 11100 2.2018 0.8629 0.8611 0.8774 0.8629
0.28 0.9 11200 2.2446 0.8633 0.8616 0.8772 0.8633
0.343 0.9 11300 2.1378 0.8625 0.8609 0.8753 0.8625
0.2371 0.91 11400 2.2008 0.8633 0.8615 0.8777 0.8633
0.2227 0.92 11500 2.2469 0.8618 0.8600 0.8754 0.8618
0.2136 0.93 11600 2.2088 0.8614 0.8598 0.8740 0.8614
0.1769 0.94 11700 2.2464 0.8599 0.8582 0.8731 0.8599
0.1731 0.94 11800 2.2234 0.8614 0.8598 0.8736 0.8614
0.1478 0.95 11900 2.2484 0.8622 0.8605 0.8755 0.8622
0.1965 0.96 12000 2.2631 0.8606 0.8587 0.8760 0.8606
0.1724 0.97 12100 2.2277 0.8629 0.8612 0.8767 0.8629
0.1743 0.98 12200 2.2287 0.8625 0.8607 0.8769 0.8625
0.2682 0.98 12300 2.2316 0.8610 0.8590 0.8770 0.8610
0.1671 0.99 12400 2.2474 0.8622 0.8604 0.8762 0.8622
0.187 1.0 12500 2.2459 0.8633 0.8614 0.8789 0.8633
0.2311 1.01 12600 2.2434 0.8622 0.8603 0.8766 0.8622
0.2085 1.02 12700 2.2845 0.8618 0.8599 0.8768 0.8618
0.1421 1.02 12800 2.2292 0.8625 0.8607 0.8771 0.8625
0.2476 1.03 12900 2.2084 0.8629 0.8610 0.8781 0.8629
0.2092 1.04 13000 2.2049 0.8622 0.8602 0.8776 0.8622
0.1621 1.05 13100 2.2588 0.8603 0.8586 0.8734 0.8603
0.2178 1.06 13200 2.2491 0.8595 0.8580 0.8711 0.8595
0.1559 1.06 13300 2.3370 0.8584 0.8568 0.8702 0.8584
0.235 1.07 13400 2.3373 0.8603 0.8583 0.8760 0.8603
0.1646 1.08 13500 2.2559 0.8618 0.8599 0.8766 0.8618
0.1687 1.09 13600 2.3108 0.8618 0.8599 0.8771 0.8618
0.2368 1.1 13700 2.2649 0.8618 0.8600 0.8761 0.8618
0.1429 1.1 13800 2.3160 0.8614 0.8597 0.8751 0.8614
0.2345 1.11 13900 2.3249 0.8599 0.8581 0.8738 0.8599
0.18 1.12 14000 2.2549 0.8606 0.8587 0.8762 0.8606
0.1592 1.13 14100 2.2581 0.8622 0.8604 0.8762 0.8622
0.244 1.14 14200 2.2547 0.8625 0.8607 0.8774 0.8625
0.1482 1.14 14300 2.2739 0.8614 0.8594 0.8773 0.8614
0.1872 1.15 14400 2.2612 0.8614 0.8595 0.8768 0.8614
0.1457 1.16 14500 2.2575 0.8629 0.8610 0.8781 0.8629
0.1829 1.17 14600 2.2488 0.8610 0.8592 0.8755 0.8610
0.1839 1.18 14700 2.2327 0.8618 0.8600 0.8759 0.8618
0.1211 1.18 14800 2.2643 0.8614 0.8596 0.8754 0.8614
0.2126 1.19 14900 2.2206 0.8618 0.8600 0.8761 0.8618
0.1645 1.2 15000 2.2473 0.8633 0.8615 0.8777 0.8633
0.2154 1.21 15100 2.2857 0.8629 0.8611 0.8776 0.8629
0.1771 1.22 15200 2.2785 0.8606 0.8590 0.8732 0.8606
0.1715 1.22 15300 2.2312 0.8618 0.8601 0.8747 0.8618
0.1862 1.23 15400 2.2762 0.8614 0.8595 0.8770 0.8614
0.2334 1.24 15500 2.2271 0.8644 0.8627 0.8788 0.8644
0.2085 1.25 15600 2.2593 0.8618 0.8600 0.8761 0.8618
0.1895 1.26 15700 2.2594 0.8637 0.8619 0.8777 0.8637
0.2338 1.26 15800 2.2837 0.8614 0.8597 0.8751 0.8614
0.2084 1.27 15900 2.2176 0.8618 0.8601 0.8747 0.8618
0.2107 1.28 16000 2.2536 0.8610 0.8592 0.8751 0.8610
0.1682 1.29 16100 2.2652 0.8614 0.8597 0.8745 0.8614
0.175 1.3 16200 2.2509 0.8614 0.8597 0.8747 0.8614
0.2036 1.3 16300 2.2475 0.8606 0.8589 0.8739 0.8606
0.1902 1.31 16400 2.2726 0.8622 0.8605 0.8748 0.8622
0.211 1.32 16500 2.2659 0.8618 0.8600 0.8754 0.8618
0.1791 1.33 16600 2.2408 0.8622 0.8606 0.8737 0.8622
0.1741 1.34 16700 2.3162 0.8599 0.8584 0.8712 0.8599
0.212 1.34 16800 2.3139 0.8618 0.8600 0.8756 0.8618
0.2437 1.35 16900 2.2578 0.8625 0.8609 0.8753 0.8625
0.2018 1.36 17000 2.3055 0.8606 0.8590 0.8732 0.8606
0.2045 1.37 17100 2.2606 0.8618 0.8601 0.8752 0.8618
0.2036 1.38 17200 2.2477 0.8606 0.8590 0.8732 0.8606
0.1797 1.38 17300 2.2420 0.8610 0.8594 0.8735 0.8610
0.2031 1.39 17400 2.3146 0.8614 0.8597 0.8747 0.8614
0.1626 1.4 17500 2.2869 0.8625 0.8607 0.8769 0.8625
0.2573 1.41 17600 2.2997 0.8606 0.8590 0.8732 0.8606
0.1446 1.42 17700 2.3100 0.8595 0.8578 0.8730 0.8595
0.1898 1.42 17800 2.2288 0.8629 0.8613 0.8754 0.8629
0.2456 1.43 17900 2.2901 0.8603 0.8586 0.8729 0.8603
0.1586 1.44 18000 2.2949 0.8603 0.8586 0.8734 0.8603
0.1511 1.45 18100 2.2993 0.8603 0.8586 0.8734 0.8603
0.151 1.46 18200 2.2904 0.8588 0.8570 0.8718 0.8588
0.1978 1.46 18300 2.2994 0.8591 0.8575 0.8719 0.8591
0.1914 1.47 18400 2.2923 0.8606 0.8589 0.8743 0.8606
0.1333 1.48 18500 2.3371 0.8588 0.8571 0.8716 0.8588
0.1946 1.49 18600 2.2922 0.8606 0.8588 0.8750 0.8606
0.2382 1.5 18700 2.2843 0.8603 0.8586 0.8729 0.8603
0.1299 1.5 18800 2.2607 0.8603 0.8585 0.8738 0.8603
0.1677 1.51 18900 2.3156 0.8610 0.8593 0.8746 0.8610
0.146 1.52 19000 2.2914 0.8610 0.8592 0.8753 0.8610
0.1716 1.53 19100 2.3144 0.8599 0.8580 0.8747 0.8599
0.1753 1.54 19200 2.2928 0.8622 0.8602 0.8776 0.8622
0.145 1.54 19300 2.2965 0.8610 0.8591 0.8765 0.8610
0.2447 1.55 19400 2.2429 0.8614 0.8595 0.8763 0.8614
0.2159 1.56 19500 2.2536 0.8606 0.8590 0.8737 0.8606
0.1941 1.57 19600 2.2970 0.8595 0.8578 0.8726 0.8595
0.0558 1.58 19700 2.3184 0.8614 0.8596 0.8754 0.8614
0.1884 1.58 19800 2.3360 0.8584 0.8565 0.8727 0.8584
0.1889 1.59 19900 2.2758 0.8591 0.8574 0.8721 0.8591
0.1889 1.6 20000 2.2841 0.8599 0.8582 0.8731 0.8599
0.174 1.61 20100 2.3137 0.8588 0.8570 0.8718 0.8588
0.1344 1.62 20200 2.2792 0.8614 0.8597 0.8742 0.8614
0.1469 1.62 20300 2.2522 0.8606 0.8591 0.8726 0.8606
0.2515 1.63 20400 2.3051 0.8614 0.8596 0.8758 0.8614
0.1137 1.64 20500 2.3196 0.8595 0.8579 0.8719 0.8595
0.2077 1.65 20600 2.2956 0.8591 0.8575 0.8714 0.8591
0.2118 1.66 20700 2.3010 0.8606 0.8589 0.8743 0.8606
0.1813 1.66 20800 2.2543 0.8610 0.8593 0.8749 0.8610
0.1543 1.67 20900 2.2966 0.8606 0.8587 0.8757 0.8606
0.1499 1.68 21000 2.2502 0.8606 0.8590 0.8734 0.8606
0.1875 1.69 21100 2.2973 0.8618 0.8599 0.8763 0.8618
0.227 1.7 21200 2.2575 0.8606 0.8589 0.8743 0.8606
0.2004 1.7 21300 2.2163 0.8618 0.8601 0.8750 0.8618
0.1592 1.71 21400 2.2668 0.8614 0.8597 0.8749 0.8614
0.1827 1.72 21500 2.2783 0.8622 0.8605 0.8752 0.8622
0.1806 1.73 21600 2.2336 0.8606 0.8589 0.8743 0.8606
0.1381 1.74 21700 2.2785 0.8618 0.8601 0.8747 0.8618
0.2109 1.74 21800 2.2627 0.8640 0.8624 0.8773 0.8640
0.2175 1.75 21900 2.2620 0.8629 0.8612 0.8763 0.8629
0.1265 1.76 22000 2.3233 0.8614 0.8597 0.8745 0.8614
0.1779 1.77 22100 2.3258 0.8610 0.8594 0.8733 0.8610
0.1492 1.78 22200 2.3090 0.8614 0.8598 0.8740 0.8614
0.2162 1.78 22300 2.2683 0.8614 0.8599 0.8732 0.8614
0.1461 1.79 22400 2.3485 0.8603 0.8585 0.8743 0.8603
0.1814 1.8 22500 2.2756 0.8618 0.8601 0.8747 0.8618
0.2056 1.81 22600 2.3039 0.8599 0.8580 0.8742 0.8599
0.1752 1.82 22700 2.3411 0.8588 0.8569 0.8732 0.8588
0.1703 1.82 22800 2.3312 0.8591 0.8575 0.8717 0.8591
0.1948 1.83 22900 2.3066 0.8595 0.8578 0.8726 0.8595
0.196 1.84 23000 2.3609 0.8591 0.8575 0.8712 0.8591
0.154 1.85 23100 2.3389 0.8588 0.8571 0.8709 0.8588
0.1699 1.86 23200 2.3253 0.8599 0.8583 0.8722 0.8599
0.1633 1.86 23300 2.3693 0.8576 0.8558 0.8719 0.8576
0.1504 1.87 23400 2.3270 0.8599 0.8581 0.8733 0.8599
0.2036 1.88 23500 2.3290 0.8591 0.8574 0.8725 0.8591
0.1523 1.89 23600 2.3689 0.8599 0.8581 0.8738 0.8599
0.1951 1.9 23700 2.2765 0.8606 0.8590 0.8732 0.8606
0.1382 1.9 23800 2.3362 0.8599 0.8582 0.8729 0.8599
0.1118 1.91 23900 2.3666 0.8595 0.8578 0.8724 0.8595
0.2331 1.92 24000 2.3684 0.8595 0.8578 0.8728 0.8595
0.2376 1.93 24100 2.3604 0.8599 0.8581 0.8740 0.8599
0.2144 1.94 24200 2.2634 0.8614 0.8598 0.8734 0.8614
0.1365 1.94 24300 2.3800 0.8588 0.8569 0.8729 0.8588
0.148 1.95 24400 2.3321 0.8595 0.8579 0.8717 0.8595
0.1822 1.96 24500 2.3240 0.8599 0.8580 0.8745 0.8599
0.1875 1.97 24600 2.3408 0.8599 0.8582 0.8731 0.8599
0.1956 1.98 24700 2.3142 0.8606 0.8590 0.8737 0.8606
0.1704 1.98 24800 2.3011 0.8622 0.8603 0.8766 0.8622
0.2133 1.99 24900 2.3037 0.8610 0.8594 0.8737 0.8610
0.1373 2.0 25000 2.2866 0.8618 0.8602 0.8743 0.8618
0.1834 2.01 25100 2.3310 0.8603 0.8586 0.8732 0.8603
0.2341 2.02 25200 2.2910 0.8606 0.8590 0.8730 0.8606
0.188 2.02 25300 2.2973 0.8610 0.8594 0.8733 0.8610
0.1795 2.03 25400 2.2980 0.8614 0.8597 0.8747 0.8614
0.1217 2.04 25500 2.3586 0.8610 0.8593 0.8746 0.8610
0.1521 2.05 25600 2.3181 0.8610 0.8594 0.8733 0.8610
0.1713 2.06 25700 2.3195 0.8595 0.8579 0.8717 0.8595
0.1543 2.06 25800 2.3433 0.8588 0.8571 0.8714 0.8588
0.1276 2.07 25900 2.3511 0.8588 0.8571 0.8714 0.8588
0.1927 2.08 26000 2.3367 0.8584 0.8567 0.8711 0.8584
0.1687 2.09 26100 2.3132 0.8606 0.8590 0.8737 0.8606
0.182 2.1 26200 2.3801 0.8591 0.8575 0.8717 0.8591
0.1969 2.1 26300 2.3655 0.8610 0.8592 0.8751 0.8610
0.1285 2.11 26400 2.3745 0.8599 0.8581 0.8736 0.8599
0.1332 2.12 26500 2.3669 0.8603 0.8585 0.8736 0.8603
0.0973 2.13 26600 2.3738 0.8595 0.8577 0.8740 0.8595
0.1619 2.14 26700 2.3852 0.8584 0.8567 0.8709 0.8584
0.1902 2.14 26800 2.3698 0.8591 0.8575 0.8719 0.8591
0.1587 2.15 26900 2.3686 0.8591 0.8574 0.8728 0.8591
0.1653 2.16 27000 2.3625 0.8591 0.8574 0.8723 0.8591
0.123 2.17 27100 2.3334 0.8588 0.8571 0.8709 0.8588
0.1623 2.18 27200 2.3450 0.8614 0.8595 0.8763 0.8614
0.2265 2.18 27300 2.2938 0.8591 0.8576 0.8706 0.8591
0.147 2.19 27400 2.3616 0.8599 0.8582 0.8729 0.8599
0.154 2.2 27500 2.3458 0.8606 0.8589 0.8739 0.8606
0.174 2.21 27600 2.3778 0.8591 0.8576 0.8708 0.8591
0.1132 2.22 27700 2.3620 0.8603 0.8586 0.8729 0.8603
0.1243 2.22 27800 2.4019 0.8588 0.8571 0.8712 0.8588
0.1529 2.23 27900 2.3913 0.8584 0.8567 0.8709 0.8584
0.1056 2.24 28000 2.3855 0.8603 0.8586 0.8734 0.8603
0.1316 2.25 28100 2.4150 0.8595 0.8578 0.8728 0.8595
0.1121 2.26 28200 2.3840 0.8584 0.8566 0.8717 0.8584
0.2022 2.26 28300 2.3457 0.8591 0.8575 0.8710 0.8591
0.1859 2.27 28400 2.3294 0.8595 0.8578 0.8722 0.8595
0.1638 2.28 28500 2.3811 0.8565 0.8549 0.8684 0.8565
0.1461 2.29 28600 2.4119 0.8554 0.8536 0.8679 0.8554
0.1758 2.3 28700 2.3863 0.8576 0.8560 0.8694 0.8576
0.1637 2.3 28800 2.4175 0.8573 0.8556 0.8692 0.8573
0.1902 2.31 28900 2.3778 0.8591 0.8574 0.8721 0.8591
0.0955 2.32 29000 2.3967 0.8588 0.8570 0.8723 0.8588
0.1419 2.33 29100 2.3850 0.8584 0.8566 0.8715 0.8584
0.1614 2.34 29200 2.3960 0.8584 0.8566 0.8722 0.8584
0.2069 2.34 29300 2.3357 0.8584 0.8567 0.8709 0.8584
0.1547 2.35 29400 2.3682 0.8565 0.8547 0.8699 0.8565
0.1487 2.36 29500 2.3847 0.8573 0.8556 0.8694 0.8573
0.1044 2.37 29600 2.4106 0.8584 0.8567 0.8713 0.8584
0.2181 2.38 29700 2.3857 0.8576 0.8559 0.8703 0.8576
0.2294 2.38 29800 2.3231 0.8591 0.8576 0.8702 0.8591
0.1921 2.39 29900 2.3138 0.8606 0.8590 0.8734 0.8606
0.2334 2.4 30000 2.2727 0.8606 0.8588 0.8748 0.8606
0.1703 2.41 30100 2.3384 0.8591 0.8574 0.8721 0.8591
0.1713 2.42 30200 2.3356 0.8610 0.8593 0.8746 0.8610
0.1686 2.42 30300 2.3269 0.8603 0.8584 0.8748 0.8603
0.151 2.43 30400 2.3478 0.8599 0.8582 0.8727 0.8599
0.1872 2.44 30500 2.3963 0.8573 0.8554 0.8711 0.8573
0.1196 2.45 30600 2.3817 0.8595 0.8578 0.8726 0.8595
0.1545 2.46 30700 2.3864 0.8580 0.8563 0.8708 0.8580
0.193 2.46 30800 2.3527 0.8610 0.8593 0.8744 0.8610

Framework versions

  • Transformers 4.37.0
  • Pytorch 2.1.0+cu121
  • Datasets 2.16.1
  • Tokenizers 0.15.1
Downloads last month
6
Safetensors
Model size
125M params
Tensor type
F32
·
Inference API
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.