Update README.md
Browse files
README.md
CHANGED
@@ -14,7 +14,7 @@ widget:
|
|
14 |
</p>
|
15 |
|
16 |
## Pre-training corpora
|
17 |
-
The bert-base-cypriot-uncased-v1 pre-training corpora consists of 133 documents sourced from Cypriot TV scripts and writings by Cypriot authors (0.
|
18 |
|
19 |
## Pre-training details
|
20 |
* We trained BERT using our own established framework
|
|
|
14 |
</p>
|
15 |
|
16 |
## Pre-training corpora
|
17 |
+
The bert-base-cypriot-uncased-v1 pre-training corpora consists of 133 documents sourced from Cypriot TV scripts and writings by Cypriot authors (7MB or 0.07GB of data in total).
|
18 |
|
19 |
## Pre-training details
|
20 |
* We trained BERT using our own established framework
|