itsankitkp
/

my_awesome_eli5_clm-model

@@ -17,7 +17,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the eli5_category dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.7719
 ## Model description
@@ -46,11 +46,11 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss |
-|:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 1    | 3.7715          |
-| No log        | 2.0   | 2    | 3.7715          |
-| No log        | 3.0   | 3    | 3.7719          |
 ### Framework versions

 This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the eli5_category dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.7541
 ## Model description
 ### Training results
+| Training Loss | Epoch | Step  | Validation Loss |
+|:-------------:|:-----:|:-----:|:---------------:|
+| 3.8938        | 1.0   | 5635  | 3.7768          |
+| 3.8308        | 2.0   | 11270 | 3.7594          |
+| 3.8022        | 3.0   | 16905 | 3.7541          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:36ed28945d9686f4b199ef72b565f46199163a27625a44e9a49e061b66ae3532
 size 327657928

 version https://git-lfs.github.com/spec/v1
+oid sha256:63b8b1149030b1eec919212ca3fe0423801e2b33a6464e38c65b7cc53af931d4
 size 327657928

runs/May21_16-08-39_8c33b8e42b47/events.out.tfevents.1716307720.8c33b8e42b47.642.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c693a6b5451d61c514bc6a2a4f18aacf3baa8d735ea0286ecdb1f9b31a9ed422
-size 12691

 version https://git-lfs.github.com/spec/v1
+oid sha256:821ae32ad4e8b74009389b5ccb15749f1ea810a79f719f40a9c258bbac9579a5
+size 13327