mhr2004
/

roberta-large-nsp-1000-1e-06-8

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [roberta-large](https://huggingface.co/roberta-large) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5494
 ## Model description
@@ -40,37 +40,28 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 25
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 32   | 0.6936          |
-| No log        | 2.0   | 64   | 0.6895          |
-| No log        | 3.0   | 96   | 0.6860          |
-| No log        | 4.0   | 128  | 0.6809          |
-| No log        | 5.0   | 160  | 0.6744          |
-| No log        | 6.0   | 192  | 0.6670          |
-| 0.6925        | 7.0   | 224  | 0.6560          |
-| 0.6925        | 8.0   | 256  | 0.6460          |
-| 0.6925        | 9.0   | 288  | 0.6242          |
-| 0.6925        | 10.0  | 320  | 0.6054          |
-| 0.6925        | 11.0  | 352  | 0.6036          |
-| 0.6925        | 12.0  | 384  | 0.5899          |
-| 0.6429        | 13.0  | 416  | 0.5801          |
-| 0.6429        | 14.0  | 448  | 0.5833          |
-| 0.6429        | 15.0  | 480  | 0.5716          |
-| 0.6429        | 16.0  | 512  | 0.5684          |
-| 0.6429        | 17.0  | 544  | 0.5719          |
-| 0.6429        | 18.0  | 576  | 0.5615          |
-| 0.5906        | 19.0  | 608  | 0.5588          |
-| 0.5906        | 20.0  | 640  | 0.5575          |
-| 0.5906        | 21.0  | 672  | 0.5561          |
-| 0.5906        | 22.0  | 704  | 0.5516          |
-| 0.5906        | 23.0  | 736  | 0.5499          |
-| 0.5906        | 24.0  | 768  | 0.5499          |
-| 0.5684        | 25.0  | 800  | 0.5494          |
 ### Framework versions

 This model is a fine-tuned version of [roberta-large](https://huggingface.co/roberta-large) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.5884
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 30
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 32   | 0.6935          |
+| No log        | 2.0   | 64   | 0.6888          |
+| No log        | 3.0   | 96   | 0.6834          |
+| No log        | 4.0   | 128  | 0.6600          |
+| No log        | 5.0   | 160  | 0.6272          |
+| No log        | 6.0   | 192  | 0.6098          |
+| 0.6812        | 7.0   | 224  | 0.5968          |
+| 0.6812        | 8.0   | 256  | 0.5925          |
+| 0.6812        | 9.0   | 288  | 0.5899          |
+| 0.6812        | 10.0  | 320  | 0.5873          |
+| 0.6812        | 11.0  | 352  | 0.5866          |
+| 0.6812        | 12.0  | 384  | 0.5870          |
+| 0.6056        | 13.0  | 416  | 0.5884          |
+| 0.6056        | 14.0  | 448  | 0.5889          |
+| 0.6056        | 15.0  | 480  | 0.5887          |
+| 0.6056        | 16.0  | 512  | 0.5884          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:50ce2ee029aa7664ad32c8e6ae0faf8ffaf1bb9f2aaa236cbd9b4bbd2095b594
 size 1421495416

 version https://git-lfs.github.com/spec/v1
+oid sha256:9192657617bec4a3ae0ad4a4c78020b199424b8660f2b5e010860416de49dd37
 size 1421495416

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:74d95e5e57bd7abc11671822c63c442c534c3772aef5058dcb5045f56047f5fd
 size 4984

 version https://git-lfs.github.com/spec/v1
+oid sha256:3e818a9cfa8bc0cc7a7cc4e5630814b24f90a71df6271d47052fc93a4b33b145
 size 4984