thrunlab
/

pretraining_test

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

lukeleeai commited on Feb 13

Commit

1d72d97

•

1 Parent(s): acd8c6b

End of training

Files changed (2) hide show

README.md +3 -3
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [](https://huggingface.co/) on the openwebtext dataset.
 It achieves the following results on the evaluation set:
-- Loss: 10.3901
 ## Model description
@@ -51,8 +51,8 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 10.3874       | 25.0  | 50   | 10.3906         |
-| 10.3862       | 50.0  | 100  | 10.3901         |
 ### Framework versions

 This model is a fine-tuned version of [](https://huggingface.co/) on the openwebtext dataset.
 It achieves the following results on the evaluation set:
+- Loss: 10.3700
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 10.368        | 25.0  | 50   | 10.3705         |
+| 10.3672       | 50.0  | 100  | 10.3700         |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3f5732eee22e1b9720b548245467e7d9e2c0996e648689264a3e55bb154b99e1
 size 16567728

 version https://git-lfs.github.com/spec/v1
+oid sha256:79a4ce9173dec4d43dfc73aefccec8439d22275a80d72a5420f0646852273f35
 size 16567728