End of training

Browse files

Files changed (5) hide show

README.md +42 -32
model.safetensors +1 -1
runs/Mar01_00-22-32_f85640113c2d/events.out.tfevents.1709252552.f85640113c2d.5243.0 +3 -0
tokenizer.json +21 -21
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3578
 ## Model description
@@ -38,42 +38,52 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 30
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 2.9444        | 1.0   | 6    | 2.3826          |
-| 2.1543        | 2.0   | 12   | 1.8663          |
-| 1.7072        | 3.0   | 18   | 1.5640          |
-| 1.4261        | 4.0   | 24   | 1.3127          |
-| 1.2227        | 5.0   | 30   | 1.1949          |
-| 1.061         | 6.0   | 36   | 1.0322          |
-| 0.9789        | 7.0   | 42   | 0.9130          |
-| 0.8812        | 8.0   | 48   | 0.8633          |
-| 0.8289        | 9.0   | 54   | 0.7872          |
-| 0.693         | 10.0  | 60   | 0.7300          |
-| 0.7416        | 11.0  | 66   | 0.7454          |
-| 0.6965        | 12.0  | 72   | 0.6641          |
-| 0.6592        | 13.0  | 78   | 0.6362          |
-| 0.6411        | 14.0  | 84   | 0.5824          |
-| 0.5677        | 15.0  | 90   | 0.5541          |
-| 0.5575        | 16.0  | 96   | 0.5299          |
-| 0.5305        | 17.0  | 102  | 0.5435          |
-| 0.5371        | 18.0  | 108  | 0.4937          |
-| 0.4795        | 19.0  | 114  | 0.4800          |
-| 0.4693        | 20.0  | 120  | 0.4486          |
-| 0.4836        | 21.0  | 126  | 0.4478          |
-| 0.4351        | 22.0  | 132  | 0.4323          |
-| 0.47          | 23.0  | 138  | 0.4131          |
-| 0.414         | 24.0  | 144  | 0.4023          |
-| 0.4396        | 25.0  | 150  | 0.3961          |
-| 0.4079        | 26.0  | 156  | 0.3870          |
-| 0.4052        | 27.0  | 162  | 0.3846          |
-| 0.3914        | 28.0  | 168  | 0.3676          |
-| 0.4287        | 29.0  | 174  | 0.3593          |
-| 0.3583        | 30.0  | 180  | 0.3578          |
 ### Framework versions

 This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0868
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 40
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 2.9639        | 1.0   | 6    | 2.2327          |
+| 2.0112        | 2.0   | 12   | 1.7195          |
+| 1.5448        | 3.0   | 18   | 1.3346          |
+| 1.2344        | 4.0   | 24   | 1.1502          |
+| 1.0961        | 5.0   | 30   | 1.0083          |
+| 0.9938        | 6.0   | 36   | 0.9712          |
+| 0.9205        | 7.0   | 42   | 0.8846          |
+| 0.8293        | 8.0   | 48   | 0.7529          |
+| 0.7735        | 9.0   | 54   | 0.7236          |
+| 0.7284        | 10.0  | 60   | 0.7006          |
+| 0.673         | 11.0  | 66   | 0.6580          |
+| 0.6238        | 12.0  | 72   | 0.5931          |
+| 0.5871        | 13.0  | 78   | 0.5475          |
+| 0.548         | 14.0  | 84   | 0.4944          |
+| 0.5           | 15.0  | 90   | 0.4888          |
+| 0.4772        | 16.0  | 96   | 0.4259          |
+| 0.4605        | 17.0  | 102  | 0.4471          |
+| 0.4191        | 18.0  | 108  | 0.3692          |
+| 0.3724        | 19.0  | 114  | 0.3329          |
+| 0.3483        | 20.0  | 120  | 0.3270          |
+| 0.3268        | 21.0  | 126  | 0.2739          |
+| 0.2884        | 22.0  | 132  | 0.2396          |
+| 0.2567        | 23.0  | 138  | 0.2038          |
+| 0.2415        | 24.0  | 144  | 0.2121          |
+| 0.2322        | 25.0  | 150  | 0.1778          |
+| 0.1971        | 26.0  | 156  | 0.1631          |
+| 0.2065        | 27.0  | 162  | 0.1592          |
+| 0.1918        | 28.0  | 168  | 0.1422          |
+| 0.1854        | 29.0  | 174  | 0.1359          |
+| 0.1691        | 30.0  | 180  | 0.1291          |
+| 0.1645        | 31.0  | 186  | 0.1201          |
+| 0.1614        | 32.0  | 192  | 0.1138          |
+| 0.1435        | 33.0  | 198  | 0.1082          |
+| 0.1354        | 34.0  | 204  | 0.1014          |
+| 0.129         | 35.0  | 210  | 0.0956          |
+| 0.1298        | 36.0  | 216  | 0.0971          |
+| 0.1266        | 37.0  | 222  | 0.0916          |
+| 0.1374        | 38.0  | 228  | 0.0919          |
+| 0.1217        | 39.0  | 234  | 0.0882          |
+| 0.1341        | 40.0  | 240  | 0.0868          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:167f72c4e239a9640669bd8575ebb02c93b5e1a3e22a64d06dd428bbe1e8cd7b
 size 31207604

 version https://git-lfs.github.com/spec/v1
+oid sha256:4cfbcc16fa186e80c1f30f878cd3491ee3c5e85053511e9589b369a081ced43d
 size 31207604

runs/Mar01_00-22-32_f85640113c2d/events.out.tfevents.1709252552.f85640113c2d.5243.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:32de26b502cc67715925c95d6453fc9e13031593cc338abb7bdf3a03e8ac6d5d
+size 27997

tokenizer.json CHANGED Viewed

@@ -119,32 +119,32 @@
       "11": 20,
       "97": 21,
       "12": 22,
-      "13": 23,
-      "96": 24,
       "95": 25,
       "14": 26,
-      "94": 27,
-      "15": 28,
       "93": 29,
       "16": 30,
-      "17": 31,
-      "92": 32,
       "18": 33,
       "91": 34,
       "90": 35,
       "19": 36,
-      "20": 37,
-      "89": 38,
       "21": 39,
       "88": 40,
-      "22": 41,
-      "87": 42,
-      "86": 43,
-      "23": 44,
       "24": 45,
       "85": 46,
-      "84": 47,
-      "25": 48,
       "26": 49,
       "83": 50,
       "27": 51,
@@ -157,32 +157,32 @@
       "1 1",
       "9 7",
       "1 2",
-      "1 3",
       "9 6",
       "9 5",
       "1 4",
-      "9 4",
       "1 5",
       "9 3",
       "1 6",
-      "1 7",
       "9 2",
       "1 8",
       "9 1",
       "9 0",
       "1 9",
-      "2 0",
       "8 9",
       "2 1",
       "8 8",
-      "2 2",
       "8 7",
-      "8 6",
       "2 3",
       "2 4",
       "8 5",
-      "8 4",
       "2 5",
       "2 6",
       "8 3",
       "2 7",

       "11": 20,
       "97": 21,
       "12": 22,
+      "96": 23,
+      "13": 24,
       "95": 25,
       "14": 26,
+      "15": 27,
+      "94": 28,
       "93": 29,
       "16": 30,
+      "92": 31,
+      "17": 32,
       "18": 33,
       "91": 34,
       "90": 35,
       "19": 36,
+      "89": 37,
+      "20": 38,
       "21": 39,
       "88": 40,
+      "87": 41,
+      "22": 42,
+      "23": 43,
+      "86": 44,
       "24": 45,
       "85": 46,
+      "25": 47,
+      "84": 48,
       "26": 49,
       "83": 50,
       "27": 51,
       "1 1",
       "9 7",
       "1 2",
       "9 6",
+      "1 3",
       "9 5",
       "1 4",
       "1 5",
+      "9 4",
       "9 3",
       "1 6",
       "9 2",
+      "1 7",
       "1 8",
       "9 1",
       "9 0",
       "1 9",
       "8 9",
+      "2 0",
       "2 1",
       "8 8",
       "8 7",
+      "2 2",
       "2 3",
+      "8 6",
       "2 4",
       "8 5",
       "2 5",
+      "8 4",
       "2 6",
       "8 3",
       "2 7",

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9b65476e169e5915d0d98245574cb840abef742cd27134fa31a767c9d1641c6e
 size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:d8d78e70a07f179c1153d41b5f956272edf46055c23441c484f26204c26026f5
 size 5112