blatrie
/

my_awesome_eli5_clm-model

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

blatrie commited on Jun 10

Commit

65a4575

•

1 Parent(s): 74560a8

End of training

Files changed (2) hide show

README.md +4 -4
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the eli5_category dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.8579
 ## Model description
@@ -48,9 +48,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 276  | 3.8577          |
-| 3.9823        | 2.0   | 552  | 3.8534          |
-| 3.9823        | 3.0   | 828  | 3.8579          |
 ### Framework versions

 This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the eli5_category dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.9540
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 273  | 3.9545          |
+| 3.9544        | 2.0   | 546  | 3.9522          |
+| 3.9544        | 3.0   | 819  | 3.9540          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:09245bf9fcac3a89b653bc5850ce9f27f7fb7c136f11849a6ab1a052f5047e05
 size 327657928

 version https://git-lfs.github.com/spec/v1
+oid sha256:391352758d59408a46ca91bb6a023761a3a11f715fd44474defa4b610b4b23eb
 size 327657928