morenolq
/

distilgpt2-fables-demo

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Moreno La Quatra commited on Sep 18, 2022

Commit

94d162b

·

1 Parent(s): 16fd9d9

Update README.md

Files changed (1) hide show

README.md +13 -6

README.md CHANGED Viewed

@@ -2,9 +2,18 @@
 license: apache-2.0
 tags:
 - generated_from_trainer
 model-index:
 - name: distilgpt2-fables-demo
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -12,23 +21,21 @@ should probably proofread and complete it, then remove this comment. -->
 # distilgpt2-fables-demo
-This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
 - Loss: 3.2165
 ## Model description
-More information needed
 ## Intended uses & limitations
-More information needed
 ## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters

 license: apache-2.0
 tags:
 - generated_from_trainer
+- distilgpt2
+- text-generation
+- english
 model-index:
 - name: distilgpt2-fables-demo
   results: []
+pipeline:
+- text-generation
+widget:
+- text: Once upon a time,
+- text: There was a time when
+- text: Long time ago
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # distilgpt2-fables-demo
+This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on [demelin/understanding_fables](https://huggingface.co/datasets/demelin/understanding_fables) dataset.
 It achieves the following results on the evaluation set:
 - Loss: 3.2165
 ## Model description
+The model is a demo for the fine-tuning of decoder-only models using `transformers` library.
 ## Intended uses & limitations
+It can be used mainly for prototyping and educational purposes.
 ## Training and evaluation data
+The [demelin/understanding_fables](https://huggingface.co/datasets/demelin/understanding_fables) dataset has been split into train/test/validation using an 80/10/10 random split (`random_seed = 42`). Google Colab has been used for model fine-tuning.
 ### Training hyperparameters