stulcrad
/

CNEC2_0_Supertypes_xlm-roberta-large

@@ -25,16 +25,16 @@ model-index:
     metrics:
     - name: Precision
       type: precision
-      value: 0.8022359290670779
     - name: Recall
       type: recall
-      value: 0.8549712407559573
     - name: F1
       type: f1
-      value: 0.8277645186953062
     - name: Accuracy
       type: accuracy
-      value: 0.9616810519608411
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -44,11 +44,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [FacebookAI/xlm-roberta-large](https://huggingface.co/FacebookAI/xlm-roberta-large) on the cnec dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2033
-- Precision: 0.8022
-- Recall: 0.8550
-- F1: 0.8278
-- Accuracy: 0.9617
 ## Model description
@@ -68,33 +68,25 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
-- train_batch_size: 8
-- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.1
 - lr_scheduler_warmup_steps: 500
-- num_epochs: 8
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
-| 0.6981        | 0.56  | 500  | 0.3042          | 0.5141    | 0.6652 | 0.5800 | 0.9121   |
-| 0.2782        | 1.11  | 1000 | 0.2128          | 0.7078    | 0.8159 | 0.7580 | 0.9495   |
-| 0.2247        | 1.67  | 1500 | 0.2200          | 0.7055    | 0.8081 | 0.7534 | 0.9450   |
-| 0.1986        | 2.22  | 2000 | 0.2291          | 0.6569    | 0.8110 | 0.7259 | 0.9460   |
-| 0.1697        | 2.78  | 2500 | 0.1819          | 0.7520    | 0.8184 | 0.7838 | 0.9548   |
-| 0.1415        | 3.33  | 3000 | 0.1873          | 0.7341    | 0.7975 | 0.7645 | 0.9527   |
-| 0.1284        | 3.89  | 3500 | 0.1752          | 0.7618    | 0.8578 | 0.8070 | 0.9590   |
-| 0.1073        | 4.44  | 4000 | 0.1903          | 0.7793    | 0.8488 | 0.8126 | 0.9586   |
-| 0.1006        | 5.0   | 4500 | 0.1741          | 0.7922    | 0.8661 | 0.8275 | 0.9610   |
-| 0.0788        | 5.56  | 5000 | 0.1830          | 0.7995    | 0.8537 | 0.8258 | 0.9623   |
-| 0.0838        | 6.11  | 5500 | 0.2096          | 0.8018    | 0.8509 | 0.8256 | 0.9610   |
-| 0.0617        | 6.67  | 6000 | 0.1978          | 0.8056    | 0.8632 | 0.8334 | 0.9627   |
-| 0.0515        | 7.22  | 6500 | 0.2020          | 0.8061    | 0.8521 | 0.8284 | 0.9616   |
-| 0.0455        | 7.78  | 7000 | 0.2033          | 0.8022    | 0.8550 | 0.8278 | 0.9617   |
 ### Framework versions

     metrics:
     - name: Precision
       type: precision
+      value: 0.8325581395348837
     - name: Recall
       type: recall
+      value: 0.8824979457682827
     - name: F1
       type: f1
+      value: 0.8568009573195053
     - name: Accuracy
       type: accuracy
+      value: 0.965938712854081
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [FacebookAI/xlm-roberta-large](https://huggingface.co/FacebookAI/xlm-roberta-large) on the cnec dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1992
+- Precision: 0.8326
+- Recall: 0.8825
+- F1: 0.8568
+- Accuracy: 0.9659
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
+- train_batch_size: 32
+- eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.1
 - lr_scheduler_warmup_steps: 500
+- num_epochs: 15
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
+| 0.5321        | 2.22  | 500  | 0.1641          | 0.7159    | 0.8065 | 0.7585 | 0.9566   |
+| 0.1512        | 4.44  | 1000 | 0.1831          | 0.7886    | 0.8611 | 0.8233 | 0.9591   |
+| 0.0967        | 6.67  | 1500 | 0.1866          | 0.7628    | 0.8628 | 0.8097 | 0.9596   |
+| 0.0637        | 8.89  | 2000 | 0.1586          | 0.8054    | 0.8841 | 0.8429 | 0.9648   |
+| 0.0422        | 11.11 | 2500 | 0.1777          | 0.8294    | 0.8648 | 0.8467 | 0.9654   |
+| 0.0292        | 13.33 | 3000 | 0.1992          | 0.8326    | 0.8825 | 0.8568 | 0.9659   |
 ### Framework versions

runs/Mar07_19-15-06_g01/events.out.tfevents.1709835307.g01.769784.6 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0b1918e326f5ad913815ef2a936e28d0de615e527a8bf102944f8f723e0cbbc8
-size 8757

 version https://git-lfs.github.com/spec/v1
+oid sha256:995a8a9c3725ca25c80d0640df884eef6f077945004c462bb190a4d8a4f1b560
+size 9111