gonzalez-agirre
commited on
Commit
•
f99da79
1
Parent(s):
ae01ef0
Update README.md
Browse files
README.md
CHANGED
@@ -84,7 +84,7 @@ widget:
|
|
84 |
</details>
|
85 |
|
86 |
## Model description
|
87 |
-
The **roberta-base-bne-capitel-ner-plus** is Named Entity Recognition (NER) model for the Spanish language fine-tuned from the [roberta-base-bne](https://huggingface.co/PlanTL-GOB-ES/roberta-base-bne) model, a [RoBERTa](https://arxiv.org/abs/1907.11692) base model pre-trained using the largest Spanish corpus known to date, with a total of 570GB of clean and deduplicated text, processed for this work, compiled from the web crawlings performed by the [National Library of Spain (Biblioteca Nacional de España)](http://www.bne.es/en/Inicio/index.html) from 2009 to 2019. This model is a more robust version of the [roberta-base-bne-capitel-ner](https://huggingface.co/PlanTL-GOB-ES/roberta-base-bne-capitel-ner) model that recognizes better lowercased Named Entities (NE).
|
88 |
|
89 |
## Intended uses and limitations
|
90 |
|
@@ -120,10 +120,10 @@ The model was trained with a batch size of 16 and a learning rate of 5e-5 for 5
|
|
120 |
This model was finetuned maximizing F1 score.
|
121 |
|
122 |
## Evaluation results
|
123 |
-
We evaluated the
|
124 |
|
125 |
|
126 |
-
| Model |
|
127 |
| ------------|:----|
|
128 |
| roberta-large-bne-capitel-ner | **90.51** |
|
129 |
| roberta-base-bne-capitel-ner | 89.60|
|
|
|
84 |
</details>
|
85 |
|
86 |
## Model description
|
87 |
+
The **roberta-base-bne-capitel-ner-plus** is a Named Entity Recognition (NER) model for the Spanish language fine-tuned from the [roberta-base-bne](https://huggingface.co/PlanTL-GOB-ES/roberta-base-bne) model, a [RoBERTa](https://arxiv.org/abs/1907.11692) base model pre-trained using the largest Spanish corpus known to date, with a total of 570GB of clean and deduplicated text, processed for this work, compiled from the web crawlings performed by the [National Library of Spain (Biblioteca Nacional de España)](http://www.bne.es/en/Inicio/index.html) from 2009 to 2019. This model is a more robust version of the [roberta-base-bne-capitel-ner](https://huggingface.co/PlanTL-GOB-ES/roberta-base-bne-capitel-ner) model that recognizes better lowercased Named Entities (NE).
|
88 |
|
89 |
## Intended uses and limitations
|
90 |
|
|
|
120 |
This model was finetuned maximizing F1 score.
|
121 |
|
122 |
## Evaluation results
|
123 |
+
We evaluated the **roberta-base-bne-capitel-ner-plus** on the CAPITEL-NERC test set against standard multilingual and monolingual baselines:
|
124 |
|
125 |
|
126 |
+
| Model | CAPITEL-NERC (F1) |
|
127 |
| ------------|:----|
|
128 |
| roberta-large-bne-capitel-ner | **90.51** |
|
129 |
| roberta-base-bne-capitel-ner | 89.60|
|