Maximofn
/

GPT2-small-finetuned-Maximofn-short-jokes-dataset-casualLM

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [openai-community/gpt2](https://huggingface.co/openai-community/gpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.2082
 ## Model description
@@ -35,7 +35,7 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 32
 - eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
@@ -49,9 +49,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step  | Validation Loss |
 |:-------------:|:-----:|:-----:|:---------------:|
-| 3.381         | 1.0   | 6516  | 3.2650          |
-| 3.2617        | 2.0   | 13032 | 3.2063          |
-| 3.2142        | 3.0   | 19548 | 3.1986          |
 ### Framework versions

 This model is a fine-tuned version of [openai-community/gpt2](https://huggingface.co/openai-community/gpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.2013
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 28
 - eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 | Training Loss | Epoch | Step  | Validation Loss |
 |:-------------:|:-----:|:-----:|:---------------:|
+| 3.3866        | 1.0   | 7447  | 3.2590          |
+| 3.2599        | 2.0   | 14894 | 3.1997          |
+| 3.2126        | 3.0   | 22341 | 3.1920          |
 ### Framework versions

runs/Jul13_10-22-19_8de3af1b431d/events.out.tfevents.1720875425.8de3af1b431d.6946.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:da3103d50b57c7d4f1fd2a7c96cfbc729e27b09961d79af745275eb492659b77
+size 364