z5ying commited on
Commit
5959cee
1 Parent(s): a632076

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -7
README.md CHANGED
@@ -12,9 +12,7 @@ should probably proofread and complete it, then remove this comment. -->
12
 
13
  # distilgpt2-finetuned-wikitext2
14
 
15
- This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the None dataset.
16
- It achieves the following results on the evaluation set:
17
- - Loss: 3.1851
18
 
19
  ## Model description
20
 
@@ -39,15 +37,13 @@ The following hyperparameters were used during training:
39
  - seed: 42
40
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
41
  - lr_scheduler_type: linear
42
- - num_epochs: 3.0
43
 
44
  ### Training results
45
 
46
  | Training Loss | Epoch | Step | Validation Loss |
47
  |:-------------:|:-----:|:----:|:---------------:|
48
- | No log | 1.0 | 118 | 3.3003 |
49
- | No log | 2.0 | 236 | 3.2017 |
50
- | No log | 3.0 | 354 | 3.1851 |
51
 
52
 
53
  ### Framework versions
 
12
 
13
  # distilgpt2-finetuned-wikitext2
14
 
15
+ This model is a fine-tuned version of [z5ying/distilgpt2-finetuned-wikitext2](https://huggingface.co/z5ying/distilgpt2-finetuned-wikitext2) on the None dataset.
 
 
16
 
17
  ## Model description
18
 
 
37
  - seed: 42
38
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
39
  - lr_scheduler_type: linear
40
+ - num_epochs: 1
41
 
42
  ### Training results
43
 
44
  | Training Loss | Epoch | Step | Validation Loss |
45
  |:-------------:|:-----:|:----:|:---------------:|
46
+ | No log | 1.0 | 118 | 3.0306 |
 
 
47
 
48
 
49
  ### Framework versions