tartuNLP
/

gpt-for-est-large

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

mphi commited on Dec 10, 2021

Commit

0c5560b

•

1 Parent(s): a9da74e

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -10,6 +10,8 @@ model-index:
 A GPT model for Estonian (large-size), trained from scratch on 2.2 billion words (Estonian National Corpus + News Crawl + Common Crawl). Currently trained for 1 epoch (but already better than gpt-4-est-base :-) to be updated)
 ### Format
 For training data was prepended with a text domain tag, and it should be added as prefix when using the model: >general<, >web<, >news<, >doaj< and >wiki< (standing for general texts, web crawled texts, news, article abstracts and wikipedia texts). Use the prefixes like this, e.g: ">web< Kas tead, et".

 A GPT model for Estonian (large-size), trained from scratch on 2.2 billion words (Estonian National Corpus + News Crawl + Common Crawl). Currently trained for 1 epoch (but already better than gpt-4-est-base :-) to be updated)
+[Colab demo](https://colab.research.google.com/drive/1Bp7mGEQ1vmyqXPyXHV1yj68cRZEi2mq4?usp=sharing)
 ### Format
 For training data was prepended with a text domain tag, and it should be added as prefix when using the model: >general<, >web<, >news<, >doaj< and >wiki< (standing for general texts, web crawled texts, news, article abstracts and wikipedia texts). Use the prefixes like this, e.g: ">web< Kas tead, et".