ArmanAsq
/

distilgpt2-CLM-DSM

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ArmanAsq commited on Jun 9

Commit

0cc17a4

•

1 Parent(s): 88ad281

End of training

Files changed (1) hide show

README.md +12 -5

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.5603
 ## Model description
@@ -40,16 +40,23 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 3
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 91   | 2.6410          |
-| No log        | 2.0   | 182  | 2.5742          |
-| No log        | 3.0   | 273  | 2.5603          |
 ### Framework versions

 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.4049
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 10
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 91   | 2.6265          |
+| No log        | 2.0   | 182  | 2.5381          |
+| No log        | 3.0   | 273  | 2.4948          |
+| No log        | 4.0   | 364  | 2.4634          |
+| No log        | 5.0   | 455  | 2.4436          |
+| 2.6065        | 6.0   | 546  | 2.4276          |
+| 2.6065        | 7.0   | 637  | 2.4176          |
+| 2.6065        | 8.0   | 728  | 2.4103          |
+| 2.6065        | 9.0   | 819  | 2.4061          |
+| 2.6065        | 10.0  | 910  | 2.4049          |
 ### Framework versions