osanseviero
/

sft_cml4

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

osanseviero commited on Jan 21

Commit

729c4da

•

1 Parent(s): bc9d0c3

End of training

Files changed (2) hide show

README.md +7 -4
pytorch_model.bin +1 -1

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the ag_news dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.5432
 ## Model description
@@ -48,9 +48,12 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 3.52          | 0.53  | 200  | 3.6018          |
-| 2.8991        | 1.07  | 400  | 3.5992          |
-| 1.8674        | 1.6   | 600  | 3.5432          |
 ### Framework versions

 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the ag_news dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.3980
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 3.7271        | 0.32  | 200  | 3.6065          |
+| 3.346         | 0.64  | 400  | 3.4732          |
+| 3.0685        | 0.96  | 600  | 3.3985          |
+| 2.1435        | 1.28  | 800  | 3.4433          |
+| 1.9834        | 1.6   | 1000 | 3.4203          |
+| 1.8937        | 1.92  | 1200 | 3.3980          |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:94285f9a017e49da30ec21445159914a77ce15579335265211ddc6d79f107bf9
 size 497807197

 version https://git-lfs.github.com/spec/v1
+oid sha256:ebad7d4e7fd700be342c7db7e91c93b7e35af759d45a4945f85fab4206a1055a
 size 497807197