Sasha commited on
Commit
5e9f2ac
2 Parent(s): 68b81ec a120b1d

Merge branch 'main' of https://huggingface.co/kravchenko/uk-t5-base

Browse files
Files changed (1) hide show
  1. README.md +3 -5
README.md CHANGED
@@ -5,13 +5,11 @@ tags:
5
  - t5
6
  ---
7
 
8
- The aim is to compress the mT5-base model to leave only the Ukrainian language.
9
 
10
- Reproduced the similar result but with other language from [this](https://towardsdatascience.com/how-to-adapt-a-multilingual-t5-model-for-a-single-language-b9f94f3d9c90) medium article.
11
 
12
  Results:
13
  - 582M params -> 244M params
14
  - 250K tokens -> 30K tokens
15
- - 2.2GB size model -> 0.95GB size model
16
-
17
- The vocabulary consists of 20K Ukrainian tokens and around 10K of English + most used + special tokens the T5 model uses.
 
5
  - t5
6
  ---
7
 
8
+ The aim is to compress the mT5-base model to leave only the Ukrainian language and some basic English.
9
 
10
+ Reproduced the similar result (but with another language) from [this](https://towardsdatascience.com/how-to-adapt-a-multilingual-t5-model-for-a-single-language-b9f94f3d9c90) medium article.
11
 
12
  Results:
13
  - 582M params -> 244M params
14
  - 250K tokens -> 30K tokens
15
+ - 2.2GB size model -> 0.95GB size model