pritoms
/

distilgpt2-finetuned-wikitext2

Text Generation

generated_from_trainer

Inference Endpoints

text-generation-inference

Model card Files Files and versions Metrics Training metrics Community

pritoms commited on Oct 21, 2021

Commit

5fa0131

•

1 Parent(s): bfb380f

update model card README.md

Files changed (1) hide show

README.md +8 -7

README.md CHANGED Viewed

@@ -12,9 +12,9 @@ should probably proofread and complete it, then remove this comment. -->
 # distilgpt2-finetuned-wikitext2
-This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.3918
 ## Model description
@@ -45,13 +45,14 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 349  | 3.4119          |
-| 3.4442        | 2.0   | 698  | 3.3960          |
-| 3.3853        | 3.0   | 1047 | 3.3918          |
 ### Framework versions
-- Transformers 4.11.2
-- Pytorch 1.9.0+cu102
 - Tokenizers 0.10.3

 # distilgpt2-finetuned-wikitext2
+This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.0540
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 130  | 3.1733          |
+| No log        | 2.0   | 260  | 3.0756          |
+| No log        | 3.0   | 390  | 3.0540          |
 ### Framework versions
+- Transformers 4.11.3
+- Pytorch 1.9.0+cu111
+- Datasets 1.14.0
 - Tokenizers 0.10.3