Training in progress epoch 0

Files changed (5) hide show

README.md CHANGED Viewed

@@ -3,20 +3,20 @@ license: mit
 tags:
 - generated_from_keras_callback
 model-index:
-- name: textgenerator
   results: []
 ---
 <!-- This model card has been generated automatically according to the information Keras had access to. You should
 probably proofread and complete it, then remove this comment. -->
-# textgenerator
 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 8.2564
-- Validation Loss: 7.8994
-- Epoch: 2
 ## Model description
@@ -42,14 +42,12 @@ The following hyperparameters were used during training:
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
-| 10.2783    | 9.6364          | 0     |
-| 9.2278     | 8.8362          | 1     |
-| 8.2564     | 7.8994          | 2     |
 ### Framework versions
-- Transformers 4.20.1
 - TensorFlow 2.8.2
 - Datasets 2.4.0
 - Tokenizers 0.12.1

 tags:
 - generated_from_keras_callback
 model-index:
+- name: srivatsavaasista/textgenerator
   results: []
 ---
 <!-- This model card has been generated automatically according to the information Keras had access to. You should
 probably proofread and complete it, then remove this comment. -->
+# srivatsavaasista/textgenerator
 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Train Loss: 10.2937
+- Validation Loss: 9.6742
+- Epoch: 0
 ## Model description
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
+| 10.2937    | 9.6742          | 0     |
 ### Framework versions
+- Transformers 4.21.0
 - TensorFlow 2.8.2
 - Datasets 2.4.0
 - Tokenizers 0.12.1

config.json CHANGED Viewed

@@ -32,7 +32,7 @@
       "max_length": 50
     }
   },
-  "transformers_version": "4.20.1",
   "use_cache": true,
   "vocab_size": 52000
 }

       "max_length": 50
     }
   },
+  "transformers_version": "4.21.0",
   "use_cache": true,
   "vocab_size": 52000
 }

special_tokens_map.json CHANGED Viewed

@@ -1,5 +1,6 @@
 {
   "bos_token": "<|endoftext|>",
   "eos_token": "<|endoftext|>",
   "unk_token": "<|endoftext|>"
 }

 {
   "bos_token": "<|endoftext|>",
   "eos_token": "<|endoftext|>",
+  "pad_token": "<|endoftext|>",
   "unk_token": "<|endoftext|>"
 }

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:06638ad1317f3ba8d89a0eed45896deefbe175d2ee1e7f61ea63d7f4b6ab1be0
 size 503289960

 version https://git-lfs.github.com/spec/v1
+oid sha256:acaf5a369affcf3a7511b84c08305b00c94a2c1e1203d0d5726938c36eafccc6
 size 503289960

tokenizer.json CHANGED Viewed

@@ -1,6 +1,11 @@
 {
   "version": "1.0",
-  "truncation": null,
   "padding": null,
   "added_tokens": [
     {

 {
   "version": "1.0",
+  "truncation": {
+    "direction": "Right",
+    "max_length": 40,
+    "strategy": "LongestFirst",
+    "stride": 0
+  },
   "padding": null,
   "added_tokens": [
     {