Ransaka
/

sinhala-gpt2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Ransaka commited on Mar 26, 2023

Commit

25ab3aa

•

1 Parent(s): 48c869a

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -31,7 +31,7 @@ Even though this version of GPT-2 has been finely tuned and is quite simple, it
 ⚠️ Since the dataset used for this model is mostly composed of news articles, it is heavily biased towards generating news content. This bias may become apparent during the generation process.
 ## Training procedure
-The model was trained for approximately 12+ hours on Kaggle GPUs.
 ## Usage Details
@@ -63,9 +63,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step  | Validation Loss |
 |:-------------:|:-----:|:-----:|:---------------:|
-| 2.3015        | 1.0   | 15323 | 2.3498             |
-| 1.8582        | 2.0   | 30646 | 1.9921             |
-| 1.5491        | 3.0   | 45969 | 1.9376             |
 ### Framework versions

 ⚠️ Since the dataset used for this model is mostly composed of news articles, it is heavily biased towards generating news content. This bias may become apparent during the generation process.
 ## Training procedure
+The model was trained for 12+ hours on Kaggle GPUs.
 ## Usage Details
 | Training Loss | Epoch | Step  | Validation Loss |
 |:-------------:|:-----:|:-----:|:---------------:|
+| 2.0233        | 1.0   | 15323 | 2.3348             |
+| 1.6938        | 2.0   | 30646 | 1.8377             |
+| 1.4938        | 3.0   | 45969 | 1.6498             |
 ### Framework versions