BenasSabalys
/

gpt2-lithuanian-wiki

generated_from_keras_callback

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

BenasSabalys commited on Jan 30, 2023

Commit

ba2084d

•

1 Parent(s): b0e2b86

Update README.md

Files changed (1) hide show

README.md +30 -8

README.md CHANGED Viewed

@@ -11,20 +11,17 @@ probably proofread and complete it, then remove this comment. -->
 ## Model description
-This is a gpt2 model trained on 142 612 different Lithuanian Wikipedia articles
 ## Intended uses & limitations
-Will be updated
-## Training and evaluation data
-Will be updated
 ## Training procedure
 Will be updated
 ### Training hyperparameters
-Will be updated
 The following hyperparameters were used during training:
 - optimizer: None
@@ -32,7 +29,7 @@ The following hyperparameters were used during training:
 ### Training results
-Will be updated
 ### Framework versions
@@ -40,3 +37,28 @@ Will be updated
  TensorFlow 2.4.1
  Tokenizers 0.12.1
  Torch 1.4.0

 ## Model description
+This is a gpt2 model trained on 142 612 different Lithuanian Wikipedia articles + 11 405 articles taken from delfi.lt, ve.lt and www.respublika.lt portals.
 ## Intended uses & limitations
+This is a model I trained when writing my bachelors. You can use it anywhere you want.
 ## Training procedure
 Will be updated
 ### Training hyperparameters
 The following hyperparameters were used during training:
 - optimizer: None
 ### Training results
+Model reached 36.83% accuracy with training data and 37.02% with validation data
 ### Framework versions
  TensorFlow 2.4.1
  Tokenizers 0.12.1
  Torch 1.4.0
+How to use it:
+import tensorflow as tf
+from transformers import WEIGHTS_NAME, CONFIG_NAME
+from transformers import GPT2Config, TFGPT2LMHeadModel, GPT2Tokenizer
+import os
+output_dir = '...' #local file or link to this page
+tokenizer = GPT2Tokenizer.from_pretrained(output_dir)
+model = TFGPT2LMHeadModel.from_pretrained(output_dir)
+text = "Siekdamas"
+# encoding the input text
+input_ids = tokenizer.encode(text, return_tensors='tf')
+# getting out output
+beam_outputs = model.generate(
+  input_ids,
+  max_length = 150,
+  num_beams = 5,
+  temperature = 0.7,
+  no_repeat_ngram_size=2,
+  num_return_sequences=5
+)
+print(tokenizer.decode(beam_outputs[0]))