arminmehrabian
/

distilgpt2-finetuned-wikitext2-agu

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

arminmehrabian commited on Sep 8, 2022

Commit

4089af7

·

1 Parent(s): 8de1d81

add model

Files changed (1) hide show

README.md +13 -3

README.md CHANGED Viewed

@@ -14,7 +14,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.1929
 ## Model description
@@ -39,7 +39,7 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 40
 ### Training results
@@ -89,7 +89,17 @@ The following hyperparameters were used during training:
 | 3.1278        | 37.0  | 1010433 | 3.1940          |
 | 3.1186        | 38.0  | 1037742 | 3.1934          |
 | 3.1136        | 39.0  | 1065051 | 3.1932          |
-| 3.12          | 40.0  | 1092360 | 3.1929          |
 ### Framework versions

 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.1869
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 50
 ### Training results
 | 3.1278        | 37.0  | 1010433 | 3.1940          |
 | 3.1186        | 38.0  | 1037742 | 3.1934          |
 | 3.1136        | 39.0  | 1065051 | 3.1932          |
+| 3.12          | 40.0  | 1092360 | 3.1931          |
+| 3.12          | 41.0  | 1119669 | 3.1930          |
+| 3.1165        | 42.0  | 1146978 | 3.1914          |
+| 3.1166        | 43.0  | 1174287 | 3.1900          |
+| 3.1139        | 44.0  | 1201596 | 3.1892          |
+| 3.1135        | 45.0  | 1228905 | 3.1885          |
+| 3.1077        | 46.0  | 1256214 | 3.1881          |
+| 3.1097        | 47.0  | 1283523 | 3.1873          |
+| 3.1076        | 48.0  | 1310832 | 3.1872          |
+| 3.102         | 49.0  | 1338141 | 3.1870          |
+| 3.1086        | 50.0  | 1365450 | 3.1869          |
 ### Framework versions