stulcrad
/

fine_tuned_XLMROBERTA_cs_wikann

@@ -15,11 +15,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [xlm-roberta-large](https://huggingface.co/xlm-roberta-large) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1855
-- Overall Precision: 0.9261
-- Overall Recall: 0.9412
-- Overall F1: 0.9336
-- Overall Accuracy: 0.9748
 ## Model description
@@ -39,26 +39,24 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 64
-- eval_batch_size: 64
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Overall Precision | Overall Recall | Overall F1 | Overall Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:-----------------:|:--------------:|:----------:|:----------------:|
-| 0.2249        | 1.07  | 500  | 0.1466          | 0.8452            | 0.8867         | 0.8655     | 0.9603           |
-| 0.0822        | 2.13  | 1000 | 0.1265          | 0.8953            | 0.9256         | 0.9102     | 0.9704           |
-| 0.049         | 3.2   | 1500 | 0.1349          | 0.9081            | 0.9279         | 0.9179     | 0.9722           |
-| 0.0315        | 4.26  | 2000 | 0.1511          | 0.9098            | 0.9295         | 0.9195     | 0.9715           |
-| 0.021         | 5.33  | 2500 | 0.1421          | 0.9200            | 0.9394         | 0.9296     | 0.9745           |
-| 0.0126        | 6.4   | 3000 | 0.1604          | 0.9239            | 0.9380         | 0.9309     | 0.9751           |
-| 0.0092        | 7.46  | 3500 | 0.1727          | 0.9200            | 0.9378         | 0.9288     | 0.9743           |
-| 0.0058        | 8.53  | 4000 | 0.1843          | 0.9208            | 0.9384         | 0.9295     | 0.9738           |
-| 0.0041        | 9.59  | 4500 | 0.1855          | 0.9261            | 0.9412         | 0.9336     | 0.9748           |
 ### Framework versions

 This model is a fine-tuned version of [xlm-roberta-large](https://huggingface.co/xlm-roberta-large) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1203
+- Overall Precision: 0.9078
+- Overall Recall: 0.9326
+- Overall F1: 0.9200
+- Overall Accuracy: 0.9712
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 16
+- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 3.0
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Overall Precision | Overall Recall | Overall F1 | Overall Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:-----------------:|:--------------:|:----------:|:----------------:|
+| 0.3409        | 0.4   | 500  | 0.1931          | 0.7764            | 0.8465         | 0.8100     | 0.9495           |
+| 0.1816        | 0.8   | 1000 | 0.1427          | 0.8405            | 0.8793         | 0.8595     | 0.9576           |
+| 0.1401        | 1.2   | 1500 | 0.1273          | 0.8758            | 0.9068         | 0.8910     | 0.9651           |
+| 0.1088        | 1.6   | 2000 | 0.1392          | 0.8868            | 0.9139         | 0.9001     | 0.9662           |
+| 0.1027        | 2.0   | 2500 | 0.1096          | 0.8929            | 0.9233         | 0.9078     | 0.9699           |
+| 0.0667        | 2.4   | 3000 | 0.1267          | 0.9030            | 0.9268         | 0.9148     | 0.9699           |
+| 0.0601        | 2.8   | 3500 | 0.1203          | 0.9078            | 0.9326         | 0.9200     | 0.9712           |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6a614875e9993dcde72688e038b9d0df90969abe3a67be6d02d813bbd7be6112
 size 2235440556

 version https://git-lfs.github.com/spec/v1
+oid sha256:ed64719257e7e84b14e26c4feaf8a5728cf42b0b65c3d44c860ca8e993d9be4d
 size 2235440556

runs/Feb05_18-45-49_n28/events.out.tfevents.1707155152.n28.3840034.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:eaa1442ad87b8b15e227b080b111df508b1b93ea3b0c4edcab7ef668f83ddb86
-size 9294

 version https://git-lfs.github.com/spec/v1
+oid sha256:0aeebaec8b05d1e8de74dffd7d577b3afacc7040a609f359e5c87a4d2d4850c5
+size 9648