ninagroot/GPT2-705Mtest

Files changed (4) hide show

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 5.5515
 ## Model description
@@ -48,24 +48,24 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 0.86  | 3    | 8.6420          |
-| No log        | 2.0   | 7    | 8.3017          |
-| No log        | 2.86  | 10   | 7.4703          |
-| No log        | 4.0   | 14   | 7.3033          |
-| No log        | 4.86  | 17   | 6.6681          |
-| 7.0997        | 6.0   | 21   | 6.2953          |
-| 7.0997        | 6.86  | 24   | 5.9852          |
-| 7.0997        | 8.0   | 28   | 5.7430          |
-| 7.0997        | 8.86  | 31   | 5.5474          |
-| 7.0997        | 10.0  | 35   | 5.5484          |
-| 7.0997        | 10.86 | 38   | 5.4335          |
-| 4.4904        | 12.0  | 42   | 5.4799          |
-| 4.4904        | 12.86 | 45   | 5.4311          |
-| 4.4904        | 14.0  | 49   | 5.7129          |
-| 4.4904        | 14.86 | 52   | 5.5459          |
-| 4.4904        | 16.0  | 56   | 5.5459          |
-| 4.4904        | 16.86 | 59   | 5.5522          |
-| 3.0663        | 17.14 | 60   | 5.5515          |
 ### Framework versions

 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 5.5908
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 0.86  | 3    | 8.2842          |
+| No log        | 2.0   | 7    | 8.0518          |
+| No log        | 2.86  | 10   | 8.2219          |
+| No log        | 4.0   | 14   | 7.1930          |
+| No log        | 4.86  | 17   | 6.8112          |
+| 7.2488        | 6.0   | 21   | 6.4425          |
+| 7.2488        | 6.86  | 24   | 6.2270          |
+| 7.2488        | 8.0   | 28   | 5.9215          |
+| 7.2488        | 8.86  | 31   | 5.8162          |
+| 7.2488        | 10.0  | 35   | 5.5886          |
+| 7.2488        | 10.86 | 38   | 5.4977          |
+| 4.6863        | 12.0  | 42   | 5.4080          |
+| 4.6863        | 12.86 | 45   | 5.4362          |
+| 4.6863        | 14.0  | 49   | 5.4796          |
+| 4.6863        | 14.86 | 52   | 5.5755          |
+| 4.6863        | 16.0  | 56   | 5.6143          |
+| 4.6863        | 16.86 | 59   | 5.5946          |
+| 3.1785        | 17.14 | 60   | 5.5908          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ee4a9c5cdd3799705c0296fcf0aec464ca69228d3b8120854fc6e5eb7f36ae30
 size 2796386080

 version https://git-lfs.github.com/spec/v1
+oid sha256:699c4eaa84193daab650a623aea5e5bfcff7d0981ab3f798f813b84ebb774cfa
 size 2796386080

runs/Apr16_11-14-38_gcn12.local.snellius.surf.nl/events.out.tfevents.1713258888.gcn12.local.snellius.surf.nl.1149499.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:5cc88625145be1ea0dea170ee3d3ec0dd83d249b8a2a2778796e5a78231f91a5
+size 10435

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7d76a6045c2ee2ccfa0812342fad889ec172e533d684e35763532afaa5c0e77d
 size 4984

 version https://git-lfs.github.com/spec/v1
+oid sha256:0c0e2f18cffae8777c4c4445c40f56237acf34179f571f5675aa4fca79e492ed
 size 4984