bigmorning
/

try-m

Text Generation

generated_from_keras_callback

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

bigmorning commited on Mar 24, 2022

Commit

b36e2f7

•

1 Parent(s): 1d58b15

add model

Files changed (3) hide show

README.md +3 -3
config.json +1 -1
tf_model.h5 +2 -2

README.md CHANGED Viewed

@@ -14,7 +14,7 @@ probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 0.2158
 - Epoch: 1
 ## Model description
@@ -41,8 +41,8 @@ The following hyperparameters were used during training:
 | Train Loss | Epoch |
 |:----------:|:-----:|
-| 0.5434     | 0     |
-| 0.2158     | 1     |
 ### Framework versions

 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Train Loss: 5.5751
 - Epoch: 1
 ## Model description
 | Train Loss | Epoch |
 |:----------:|:-----:|
+| 6.0979     | 0     |
+| 5.5751     | 1     |
 ### Framework versions

config.json CHANGED Viewed

@@ -41,5 +41,5 @@
   },
   "transformers_version": "4.17.0",
   "use_cache": false,
-  "vocab_size": 5998
 }

   },
   "transformers_version": "4.17.0",
   "use_cache": false,
+  "vocab_size": 50257
 }

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:884d2b7ede8ea69141e3f9486f0cc1d2ab1ba9eff4650d9a2a585a267c338d31
-size 210211336

 version https://git-lfs.github.com/spec/v1
+oid sha256:aedebfa3bfb10faca2e81c7e205208ab122a761d79024cc8b52e755236057d83
+size 327745496