Update README.md
Browse files
README.md
CHANGED
@@ -58,7 +58,8 @@ This will return a list of recognized tokens marked with label 'INSTRUCTION'.
|
|
58 |
|
59 |
## Training
|
60 |
|
61 |
-
|
|
|
62 |
|
63 |
## Evaluation
|
64 |
|
|
|
58 |
|
59 |
## Training
|
60 |
|
61 |
+
It's based on the transformer architecture and specifically uses the [xlm-roberta-base-uk](https://huggingface.co/ukr-models/xlm-roberta-base-uk) model from `ukr-models`, fine-tuned for the token classification task. The training data was carefully chosen to include a balanced distribution of titles containing instructions and those not containing instructions.
|
62 |
+
The dataset contains newspaper titles (~3k titles), with tokens representing instructions manually labeled.
|
63 |
|
64 |
## Evaluation
|
65 |
|