Merge branch 'main' of https://huggingface.co/kravchenko/uk-t5-base
Browse files
README.md
CHANGED
@@ -5,13 +5,11 @@ tags:
|
|
5 |
- t5
|
6 |
---
|
7 |
|
8 |
-
The aim is to compress the mT5-base model to leave only the Ukrainian language.
|
9 |
|
10 |
-
Reproduced the similar result but with
|
11 |
|
12 |
Results:
|
13 |
- 582M params -> 244M params
|
14 |
- 250K tokens -> 30K tokens
|
15 |
-
- 2.2GB size model -> 0.95GB size model
|
16 |
-
|
17 |
-
The vocabulary consists of 20K Ukrainian tokens and around 10K of English + most used + special tokens the T5 model uses.
|
|
|
5 |
- t5
|
6 |
---
|
7 |
|
8 |
+
The aim is to compress the mT5-base model to leave only the Ukrainian language and some basic English.
|
9 |
|
10 |
+
Reproduced the similar result (but with another language) from [this](https://towardsdatascience.com/how-to-adapt-a-multilingual-t5-model-for-a-single-language-b9f94f3d9c90) medium article.
|
11 |
|
12 |
Results:
|
13 |
- 582M params -> 244M params
|
14 |
- 250K tokens -> 30K tokens
|
15 |
+
- 2.2GB size model -> 0.95GB size model
|
|
|
|