egosumkira
/

gpt2-fantasy

Text Generation

Inference Endpoints

Model card Files Files and versions Community

egosumkira commited on Sep 14, 2023

Commit

b150e52

•

1 Parent(s): b1b4fde

Update README.md

Files changed (1) hide show

README.md +14 -15

README.md CHANGED Viewed

@@ -1,11 +1,14 @@
 ---
 license: mit
-tags:
-- generated_from_keras_callback
 base_model: gpt2
 model-index:
 - name: gpt2-fantasy
   results: []
 ---
 <!-- This model card has been generated automatically according to the information Keras had access to. You should
@@ -13,29 +16,25 @@ probably proofread and complete it, then remove this comment. -->
 # gpt2-fantasy
-This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
-It achieves the following results on the evaluation set:
 ## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- optimizer: None
-- training_precision: float32
 ### Training results
@@ -45,4 +44,4 @@ The following hyperparameters were used during training:
 - Transformers 4.29.2
 - TensorFlow 2.12.0
-- Tokenizers 0.13.3

 ---
 license: mit
 base_model: gpt2
 model-index:
 - name: gpt2-fantasy
   results: []
+language:
+- en
+metrics:
+- accuracy
+library_name: transformers
 ---
 <!-- This model card has been generated automatically according to the information Keras had access to. You should
 # gpt2-fantasy
+This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on IMDB fantasy synopsis dataset.
 ## Model description
+This model was fine-tuned with intention of generating short fantasy stories based on given keywords.
+## Training data
+Training data was parsed from IMDB website and consists of keywords-synopsis pairs. Method of encoding data was inspired from [this repo](https://github.com/minimaxir/gpt-2-keyword-generation)
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- optimizer: Adam
+- dropout: 0.2
+- learning schedule: exponential decay
+- epochs: 4
 ### Training results
 - Transformers 4.29.2
 - TensorFlow 2.12.0
+- Tokenizers 0.13.3