arminmehrabian commited on
Commit
4089af7
1 Parent(s): 8de1d81
Files changed (1) hide show
  1. README.md +13 -3
README.md CHANGED
@@ -14,7 +14,7 @@ should probably proofread and complete it, then remove this comment. -->
14
 
15
  This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the None dataset.
16
  It achieves the following results on the evaluation set:
17
- - Loss: 3.1929
18
 
19
  ## Model description
20
 
@@ -39,7 +39,7 @@ The following hyperparameters were used during training:
39
  - seed: 42
40
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
41
  - lr_scheduler_type: linear
42
- - num_epochs: 40
43
 
44
  ### Training results
45
 
@@ -89,7 +89,17 @@ The following hyperparameters were used during training:
89
  | 3.1278 | 37.0 | 1010433 | 3.1940 |
90
  | 3.1186 | 38.0 | 1037742 | 3.1934 |
91
  | 3.1136 | 39.0 | 1065051 | 3.1932 |
92
- | 3.12 | 40.0 | 1092360 | 3.1929 |
 
 
 
 
 
 
 
 
 
 
93
 
94
 
95
  ### Framework versions
 
14
 
15
  This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the None dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Loss: 3.1869
18
 
19
  ## Model description
20
 
 
39
  - seed: 42
40
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
41
  - lr_scheduler_type: linear
42
+ - num_epochs: 50
43
 
44
  ### Training results
45
 
 
89
  | 3.1278 | 37.0 | 1010433 | 3.1940 |
90
  | 3.1186 | 38.0 | 1037742 | 3.1934 |
91
  | 3.1136 | 39.0 | 1065051 | 3.1932 |
92
+ | 3.12 | 40.0 | 1092360 | 3.1931 |
93
+ | 3.12 | 41.0 | 1119669 | 3.1930 |
94
+ | 3.1165 | 42.0 | 1146978 | 3.1914 |
95
+ | 3.1166 | 43.0 | 1174287 | 3.1900 |
96
+ | 3.1139 | 44.0 | 1201596 | 3.1892 |
97
+ | 3.1135 | 45.0 | 1228905 | 3.1885 |
98
+ | 3.1077 | 46.0 | 1256214 | 3.1881 |
99
+ | 3.1097 | 47.0 | 1283523 | 3.1873 |
100
+ | 3.1076 | 48.0 | 1310832 | 3.1872 |
101
+ | 3.102 | 49.0 | 1338141 | 3.1870 |
102
+ | 3.1086 | 50.0 | 1365450 | 3.1869 |
103
 
104
 
105
  ### Framework versions