dahe827
/

mpnet-base-airlines-news-multi-label

@@ -16,9 +16,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [microsoft/mpnet-base](https://huggingface.co/microsoft/mpnet-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2478
-- F1: 0.8938
-- Roc Auc: 0.6465
 ## Model description
@@ -43,52 +43,77 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 40
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | F1     | Roc Auc |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
-| No log        | 1.0   | 57   | 0.3726          | 0.8319 | 0.5     |
-| No log        | 2.0   | 114  | 0.3361          | 0.8319 | 0.5     |
-| No log        | 3.0   | 171  | 0.3303          | 0.8319 | 0.5     |
-| No log        | 4.0   | 228  | 0.3249          | 0.8319 | 0.5     |
-| No log        | 5.0   | 285  | 0.3188          | 0.8319 | 0.5     |
-| No log        | 6.0   | 342  | 0.3141          | 0.8319 | 0.5     |
-| No log        | 7.0   | 399  | 0.3089          | 0.8319 | 0.5     |
-| No log        | 8.0   | 456  | 0.3042          | 0.8319 | 0.5     |
-| 0.3595        | 9.0   | 513  | 0.2997          | 0.8319 | 0.5     |
-| 0.3595        | 10.0  | 570  | 0.2940          | 0.8319 | 0.5     |
-| 0.3595        | 11.0  | 627  | 0.2898          | 0.8319 | 0.5     |
-| 0.3595        | 12.0  | 684  | 0.2856          | 0.8463 | 0.5032  |
-| 0.3595        | 13.0  | 741  | 0.2819          | 0.8593 | 0.5096  |
-| 0.3595        | 14.0  | 798  | 0.2789          | 0.8600 | 0.5128  |
-| 0.3595        | 15.0  | 855  | 0.2757          | 0.8701 | 0.5220  |
-| 0.3595        | 16.0  | 912  | 0.2723          | 0.8733 | 0.5312  |
-| 0.3595        | 17.0  | 969  | 0.2698          | 0.8733 | 0.5312  |
-| 0.2983        | 18.0  | 1026 | 0.2670          | 0.8808 | 0.5629  |
-| 0.2983        | 19.0  | 1083 | 0.2652          | 0.8814 | 0.5661  |
-| 0.2983        | 20.0  | 1140 | 0.2630          | 0.8786 | 0.5744  |
-| 0.2983        | 21.0  | 1197 | 0.2612          | 0.8807 | 0.5840  |
-| 0.2983        | 22.0  | 1254 | 0.2596          | 0.8818 | 0.5900  |
-| 0.2983        | 23.0  | 1311 | 0.2580          | 0.8841 | 0.6024  |
-| 0.2983        | 24.0  | 1368 | 0.2562          | 0.8878 | 0.6153  |
-| 0.2983        | 25.0  | 1425 | 0.2555          | 0.8851 | 0.6056  |
-| 0.2983        | 26.0  | 1482 | 0.2544          | 0.8860 | 0.6088  |
-| 0.2747        | 27.0  | 1539 | 0.2535          | 0.8868 | 0.6148  |
-| 0.2747        | 28.0  | 1596 | 0.2527          | 0.8878 | 0.6153  |
-| 0.2747        | 29.0  | 1653 | 0.2519          | 0.8869 | 0.6121  |
-| 0.2747        | 30.0  | 1710 | 0.2512          | 0.8875 | 0.6180  |
-| 0.2747        | 31.0  | 1767 | 0.2501          | 0.8900 | 0.6277  |
-| 0.2747        | 32.0  | 1824 | 0.2495          | 0.8923 | 0.6401  |
-| 0.2747        | 33.0  | 1881 | 0.2492          | 0.8907 | 0.6337  |
-| 0.2747        | 34.0  | 1938 | 0.2488          | 0.8922 | 0.6401  |
-| 0.2747        | 35.0  | 1995 | 0.2485          | 0.8915 | 0.6369  |
-| 0.2633        | 36.0  | 2052 | 0.2480          | 0.8922 | 0.6401  |
-| 0.2633        | 37.0  | 2109 | 0.2478          | 0.8938 | 0.6465  |
-| 0.2633        | 38.0  | 2166 | 0.2477          | 0.8930 | 0.6433  |
-| 0.2633        | 39.0  | 2223 | 0.2476          | 0.8938 | 0.6465  |
-| 0.2633        | 40.0  | 2280 | 0.2476          | 0.8938 | 0.6465  |
 ### Framework versions

 This model is a fine-tuned version of [microsoft/mpnet-base](https://huggingface.co/microsoft/mpnet-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2601
+- F1: 0.8921
+- Roc Auc: 0.6253
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 65
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | F1     | Roc Auc |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
+| No log        | 1.0   | 57   | 0.3852          | 0.8161 | 0.5     |
+| No log        | 2.0   | 114  | 0.3612          | 0.8161 | 0.5     |
+| No log        | 3.0   | 171  | 0.3569          | 0.8161 | 0.5     |
+| No log        | 4.0   | 228  | 0.3515          | 0.8161 | 0.5     |
+| No log        | 5.0   | 285  | 0.3453          | 0.8161 | 0.5     |
+| No log        | 6.0   | 342  | 0.3403          | 0.8161 | 0.5     |
+| No log        | 7.0   | 399  | 0.3345          | 0.8161 | 0.5     |
+| No log        | 8.0   | 456  | 0.3292          | 0.8161 | 0.5     |
+| 0.3585        | 9.0   | 513  | 0.3252          | 0.8161 | 0.5     |
+| 0.3585        | 10.0  | 570  | 0.3175          | 0.8161 | 0.5     |
+| 0.3585        | 11.0  | 627  | 0.3129          | 0.8161 | 0.5     |
+| 0.3585        | 12.0  | 684  | 0.3076          | 0.8351 | 0.5029  |
+| 0.3585        | 13.0  | 741  | 0.3024          | 0.8425 | 0.5109  |
+| 0.3585        | 14.0  | 798  | 0.2995          | 0.8516 | 0.5163  |
+| 0.3585        | 15.0  | 855  | 0.2953          | 0.8528 | 0.5221  |
+| 0.3585        | 16.0  | 912  | 0.2904          | 0.8744 | 0.5426  |
+| 0.3585        | 17.0  | 969  | 0.2875          | 0.8738 | 0.5451  |
+| 0.2943        | 18.0  | 1026 | 0.2835          | 0.8833 | 0.5798  |
+| 0.2943        | 19.0  | 1083 | 0.2811          | 0.8799 | 0.5710  |
+| 0.2943        | 20.0  | 1140 | 0.2786          | 0.8815 | 0.5873  |
+| 0.2943        | 21.0  | 1197 | 0.2761          | 0.8815 | 0.5873  |
+| 0.2943        | 22.0  | 1254 | 0.2750          | 0.8838 | 0.5906  |
+| 0.2943        | 23.0  | 1311 | 0.2705          | 0.8905 | 0.6194  |
+| 0.2943        | 24.0  | 1368 | 0.2687          | 0.8911 | 0.6224  |
+| 0.2943        | 25.0  | 1425 | 0.2674          | 0.8895 | 0.6165  |
+| 0.2943        | 26.0  | 1482 | 0.2652          | 0.8911 | 0.6224  |
+| 0.2666        | 27.0  | 1539 | 0.2642          | 0.8911 | 0.6224  |
+| 0.2666        | 28.0  | 1596 | 0.2634          | 0.8903 | 0.6194  |
+| 0.2666        | 29.0  | 1653 | 0.2612          | 0.8903 | 0.6194  |
+| 0.2666        | 30.0  | 1710 | 0.2601          | 0.8921 | 0.6253  |
+| 0.2666        | 31.0  | 1767 | 0.2583          | 0.8913 | 0.6328  |
+| 0.2666        | 32.0  | 1824 | 0.2568          | 0.8864 | 0.6319  |
+| 0.2666        | 33.0  | 1881 | 0.2563          | 0.8861 | 0.6319  |
+| 0.2666        | 34.0  | 1938 | 0.2552          | 0.8869 | 0.6349  |
+| 0.2666        | 35.0  | 1995 | 0.2544          | 0.8884 | 0.6378  |
+| 0.2516        | 36.0  | 2052 | 0.2530          | 0.8875 | 0.6374  |
+| 0.2516        | 37.0  | 2109 | 0.2523          | 0.8876 | 0.6374  |
+| 0.2516        | 38.0  | 2166 | 0.2514          | 0.8889 | 0.6432  |
+| 0.2516        | 39.0  | 2223 | 0.2504          | 0.8874 | 0.6453  |
+| 0.2516        | 40.0  | 2280 | 0.2502          | 0.8892 | 0.6432  |
+| 0.2516        | 41.0  | 2337 | 0.2495          | 0.8862 | 0.6419  |
+| 0.2516        | 42.0  | 2394 | 0.2490          | 0.8867 | 0.6445  |
+| 0.2516        | 43.0  | 2451 | 0.2491          | 0.8859 | 0.6365  |
+| 0.2442        | 44.0  | 2508 | 0.2480          | 0.8906 | 0.6511  |
+| 0.2442        | 45.0  | 2565 | 0.2476          | 0.8894 | 0.6457  |
+| 0.2442        | 46.0  | 2622 | 0.2476          | 0.8888 | 0.6478  |
+| 0.2442        | 47.0  | 2679 | 0.2474          | 0.8906 | 0.6511  |
+| 0.2442        | 48.0  | 2736 | 0.2462          | 0.8890 | 0.6507  |
+| 0.2442        | 49.0  | 2793 | 0.2461          | 0.8920 | 0.6545  |
+| 0.2442        | 50.0  | 2850 | 0.2455          | 0.8894 | 0.6532  |
+| 0.2442        | 51.0  | 2907 | 0.2457          | 0.8897 | 0.6507  |
+| 0.2442        | 52.0  | 2964 | 0.2452          | 0.8894 | 0.6532  |
+| 0.238         | 53.0  | 3021 | 0.2449          | 0.8903 | 0.6536  |
+| 0.238         | 54.0  | 3078 | 0.2447          | 0.8894 | 0.6532  |
+| 0.238         | 55.0  | 3135 | 0.2446          | 0.8894 | 0.6532  |
+| 0.238         | 56.0  | 3192 | 0.2446          | 0.8904 | 0.6536  |
+| 0.238         | 57.0  | 3249 | 0.2443          | 0.8894 | 0.6532  |
+| 0.238         | 58.0  | 3306 | 0.2441          | 0.8894 | 0.6532  |
+| 0.238         | 59.0  | 3363 | 0.2440          | 0.8911 | 0.6566  |
+| 0.238         | 60.0  | 3420 | 0.2440          | 0.8911 | 0.6566  |
+| 0.238         | 61.0  | 3477 | 0.2439          | 0.8903 | 0.6536  |
+| 0.2353        | 62.0  | 3534 | 0.2437          | 0.8911 | 0.6566  |
+| 0.2353        | 63.0  | 3591 | 0.2438          | 0.8911 | 0.6566  |
+| 0.2353        | 64.0  | 3648 | 0.2437          | 0.8911 | 0.6566  |
+| 0.2353        | 65.0  | 3705 | 0.2437          | 0.8911 | 0.6566  |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e48e5c04022531c9afce686f13e6957e74f8c64b316e8be49e0448aab3e9719b
 size 438775128

 version https://git-lfs.github.com/spec/v1
+oid sha256:cfa2699e8ad82e853a2faff23f77a1255bee2ec63cb3cce9fdf437fca59c4e88
 size 438775128

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6722ff36b427261af23977edbfc6bab9cc1ace7c105f229a78d5b79025104570
 size 5176

 version https://git-lfs.github.com/spec/v1
+oid sha256:95fb0aae5bd2dd0697810cf4b690f56c8abfebbe77aa48905002bed6530c7eea
 size 5176