ygory
/

roberta-base-on-cuad-finetuned-squad

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [Rakib/roberta-base-on-cuad](https://huggingface.co/Rakib/roberta-base-on-cuad) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1095
 ## Model description
@@ -34,7 +34,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0002
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
@@ -46,32 +46,32 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 0.5857        | 0.1142 | 50   | 0.1486          |
-| 0.6676        | 0.2283 | 100  | 0.1320          |
-| 0.7345        | 0.3425 | 150  | 4.5831          |
-| 1.3841        | 0.4566 | 200  | 0.1313          |
-| 0.1602        | 0.5708 | 250  | 0.1352          |
-| 0.1043        | 0.6849 | 300  | 0.1465          |
-| 0.1937        | 0.7991 | 350  | 0.1348          |
-| 0.1971        | 0.9132 | 400  | 0.1370          |
-| 0.1321        | 1.0274 | 450  | 0.1337          |
-| 0.2461        | 1.1416 | 500  | 0.1327          |
-| 0.2117        | 1.2557 | 550  | 0.1305          |
-| 0.1829        | 1.3699 | 600  | 0.1306          |
-| 0.1973        | 1.4840 | 650  | 0.1304          |
-| 0.1966        | 1.5982 | 700  | 0.1367          |
-| 0.1698        | 1.7123 | 750  | 0.1335          |
-| 0.1037        | 1.8265 | 800  | 0.1379          |
-| 0.0902        | 1.9406 | 850  | 0.1397          |
-| 0.2125        | 2.0548 | 900  | 0.1311          |
-| 0.0898        | 2.1689 | 950  | 0.1369          |
-| 0.1593        | 2.2831 | 1000 | 0.1322          |
-| 0.2223        | 2.3973 | 1050 | 0.1300          |
-| 0.2266        | 2.5114 | 1100 | 0.1211          |
-| 0.1182        | 2.6256 | 1150 | 0.1325          |
-| 0.1429        | 2.7397 | 1200 | 0.1280          |
-| 0.1822        | 2.8539 | 1250 | 0.1179          |
-| 0.1932        | 2.9680 | 1300 | 0.1095          |
 ### Framework versions

 This model is a fine-tuned version of [Rakib/roberta-base-on-cuad](https://huggingface.co/Rakib/roberta-base-on-cuad) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0793
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 2e-05
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 0.1503        | 0.1142 | 50   | 0.1216          |
+| 0.1305        | 0.2283 | 100  | 0.1138          |
+| 0.1693        | 0.3425 | 150  | 0.1135          |
+| 0.1986        | 0.4566 | 200  | 0.1063          |
+| 0.1089        | 0.5708 | 250  | 0.0963          |
+| 0.0799        | 0.6849 | 300  | 0.1018          |
+| 0.1527        | 0.7991 | 350  | 0.0986          |
+| 0.1387        | 0.9132 | 400  | 0.1064          |
+| 0.0938        | 1.0274 | 450  | 0.0951          |
+| 0.1533        | 1.1416 | 500  | 0.0805          |
+| 0.1329        | 1.2557 | 550  | 0.0800          |
+| 0.1254        | 1.3699 | 600  | 0.0763          |
+| 0.1247        | 1.4840 | 650  | 0.0789          |
+| 0.1185        | 1.5982 | 700  | 0.0817          |
+| 0.0808        | 1.7123 | 750  | 0.0835          |
+| 0.0622        | 1.8265 | 800  | 0.0815          |
+| 0.0455        | 1.9406 | 850  | 0.0809          |
+| 0.0846        | 2.0548 | 900  | 0.0851          |
+| 0.0453        | 2.1689 | 950  | 0.0832          |
+| 0.0808        | 2.2831 | 1000 | 0.0789          |
+| 0.0902        | 2.3973 | 1050 | 0.0793          |
+| 0.0974        | 2.5114 | 1100 | 0.0787          |
+| 0.0508        | 2.6256 | 1150 | 0.0802          |
+| 0.0535        | 2.7397 | 1200 | 0.0835          |
+| 0.0956        | 2.8539 | 1250 | 0.0815          |
+| 0.1126        | 2.9680 | 1300 | 0.0793          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:04f74cd08e66663fe46220637e4c1cfdf733cccfcd1bea1e54ea3de8fafd86ed
 size 496250232

 version https://git-lfs.github.com/spec/v1
+oid sha256:8fd104494a75ea5187d97ac7a7fb15463e7f8776fb16a168a8617127f420ee91
 size 496250232

tokenizer.json CHANGED Viewed

@@ -4,9 +4,18 @@
     "direction": "Right",
     "max_length": 384,
     "strategy": "OnlySecond",
-    "stride": 0
   },
-  "padding": null,
   "added_tokens": [
     {
       "id": 0,

     "direction": "Right",
     "max_length": 384,
     "strategy": "OnlySecond",
+    "stride": 128
+  },
+  "padding": {
+    "strategy": {
+      "Fixed": 384
+    },
+    "direction": "Right",
+    "pad_to_multiple_of": null,
+    "pad_id": 1,
+    "pad_type_id": 0,
+    "pad_token": "<pad>"
   },
   "added_tokens": [
     {
       "id": 0,