End of training

Browse files

Files changed (6) hide show

README.md +20 -18
config.json +4 -24
model.safetensors +2 -2
runs/Jul03_10-14-35_31ca19578870/events.out.tfevents.1720001694.31ca19578870.2037.0 +3 -0
runs/Jul03_10-14-35_31ca19578870/events.out.tfevents.1720001954.31ca19578870.2037.1 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -20,11 +20,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [xlm-roberta-base](https://huggingface.co/xlm-roberta-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1471
-- Precision: 0.7995
-- Recall: 0.9088
-- F1: 0.8506
-- Accuracy: 0.9605
 ## Model description
@@ -44,27 +44,29 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 32
 - eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 10
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
-|:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
-| 1.7775        | 1.0   | 31   | 0.3746          | 0.6199    | 0.6839 | 0.6503 | 0.8978   |
-| 0.1886        | 2.0   | 62   | 0.0734          | 0.9590    | 0.9301 | 0.9444 | 0.9875   |
-| 0.0821        | 3.0   | 93   | 0.0413          | 0.9697    | 0.9651 | 0.9674 | 0.9928   |
-| 0.0427        | 4.0   | 124  | 0.0400          | 0.9491    | 0.9635 | 0.9562 | 0.9911   |
-| 0.0352        | 5.0   | 155  | 0.0397          | 0.9421    | 0.9571 | 0.9496 | 0.9899   |
-| 0.0315        | 6.0   | 186  | 0.0410          | 0.9371    | 0.9579 | 0.9474 | 0.9895   |
-| 0.0344        | 7.0   | 217  | 0.0386          | 0.9612    | 0.9643 | 0.9627 | 0.9922   |
-| 0.0292        | 8.0   | 248  | 0.0383          | 0.9574    | 0.9651 | 0.9612 | 0.9921   |
-| 0.0286        | 9.0   | 279  | 0.0387          | 0.9543    | 0.9619 | 0.9581 | 0.9913   |
-| 0.0259        | 10.0  | 310  | 0.0415          | 0.9430    | 0.9595 | 0.9512 | 0.9901   |
 ### Framework versions

 This model is a fine-tuned version of [xlm-roberta-base](https://huggingface.co/xlm-roberta-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1313
+- Precision: 0.0
+- Recall: 0.0
+- F1: 0.0
+- Accuracy: 0.9315
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 4
 - eval_batch_size: 32
 - seed: 42
+- gradient_accumulation_steps: 2
+- total_train_batch_size: 8
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 10
 ### Training results
+| Training Loss | Epoch  | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
+|:-------------:|:------:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
+| No log        | 0.9091 | 5    | 0.7631          | 0.0       | 0.0    | 0.0    | 0.9368   |
+| No log        | 2.0    | 11   | 0.2917          | 0.0       | 0.0    | 0.0    | 0.9430   |
+| No log        | 2.9091 | 16   | 0.2146          | 0.0       | 0.0    | 0.0    | 0.9430   |
+| 0.5371        | 4.0    | 22   | 0.1465          | 0.0       | 0.0    | 0.0    | 0.9430   |
+| 0.5371        | 4.9091 | 27   | 0.1146          | 0.0       | 0.0    | 0.0    | 0.9430   |
+| 0.5371        | 6.0    | 33   | 0.0938          | 0.0       | 0.0    | 0.0    | 0.9430   |
+| 0.5371        | 6.9091 | 38   | 0.0865          | 0.0       | 0.0    | 0.0    | 0.9430   |
+| 0.1459        | 8.0    | 44   | 0.0824          | 0.0       | 0.0    | 0.0    | 0.9456   |
+| 0.1459        | 8.9091 | 49   | 0.0797          | 0.0833    | 0.0714 | 0.0769 | 0.9514   |
+| 0.1459        | 9.0909 | 50   | 0.0797          | 0.0833    | 0.0714 | 0.0769 | 0.9514   |
 ### Framework versions

config.json CHANGED Viewed

@@ -12,34 +12,14 @@
   "hidden_size": 768,
   "id2label": {
     "0": "O",
-    "1": "B-NAME",
-    "2": "I-NAME",
-    "3": "B-DATE",
-    "4": "I-DATE",
-    "5": "B-UNI",
-    "6": "I-UNI",
-    "7": "B-MAJ",
-    "8": "I-MAJ",
-    "9": "B-MAIL",
-    "10": "I-MAIL",
-    "11": "B-UNI_ABRREV",
-    "12": "I-PHONE"
   },
   "initializer_range": 0.02,
   "intermediate_size": 3072,
   "label2id": {
-    "B-DATE": 3,
-    "B-MAIL": 9,
-    "B-MAJ": 7,
-    "B-NAME": 1,
-    "B-UNI": 5,
-    "B-UNI_ABBREV": 11,
-    "I-DATE": 4,
-    "I-MAIL": 10,
-    "I-MAJ": 8,
-    "I-NAME": 2,
-    "I-PHONE": 12,
-    "I-UNI": 6,
     "O": 0
   },
   "layer_norm_eps": 1e-05,

   "hidden_size": 768,
   "id2label": {
     "0": "O",
+    "1": "B-HL",
+    "2": "I-HL"
   },
   "initializer_range": 0.02,
   "intermediate_size": 3072,
   "label2id": {
+    "B-HL": 1,
+    "I-HL": 2,
     "O": 0
   },
   "layer_norm_eps": 1e-05,

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:13cecb2a6666b1ff8f1229bb2448e808e6c397b4542736c221a9a928e4522565
-size 1109876260

 version https://git-lfs.github.com/spec/v1
+oid sha256:6d29f391ac21a39b563e2b5c53fe34320274d71ca105ba529db68ad9360ff6d9
+size 1109845500

runs/Jul03_10-14-35_31ca19578870/events.out.tfevents.1720001694.31ca19578870.2037.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:3d125fd78128d70e32b3dafba1d93c84ecda49814e0c08632352fd34ea5e5466
+size 30433

runs/Jul03_10-14-35_31ca19578870/events.out.tfevents.1720001954.31ca19578870.2037.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5e32983657565700edcd28262c519a94f46eb511395d371eb2147a61dc806536
+size 1014

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:527e9c3d048bd7bc6abe2e63ff7b8e37110134364244cee2292407ff334ca3b9
 size 5176

 version https://git-lfs.github.com/spec/v1
+oid sha256:d919edb84eb0ae0199530f1de1ae262e407d2db1b5b7b24781cdc88eebeba8ee
 size 5176