PlanTL-GOB-ES
/

roberta-base-bne-capitel-ner-plus

Token Classification

national library of spain

Inference Endpoints

Model card Files Files and versions Community

gonzalez-agirre commited on Nov 30, 2022

Commit

f99da79

·

1 Parent(s): ae01ef0

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -84,7 +84,7 @@ widget:
 </details>
 ## Model description
-The **roberta-base-bne-capitel-ner-plus** is Named Entity Recognition (NER) model for the Spanish language fine-tuned from the [roberta-base-bne](https://huggingface.co/PlanTL-GOB-ES/roberta-base-bne) model, a [RoBERTa](https://arxiv.org/abs/1907.11692) base model pre-trained using the largest Spanish corpus known to date, with a total of 570GB of clean and deduplicated text, processed for this work, compiled from the web crawlings performed by the [National Library of Spain (Biblioteca Nacional de España)](http://www.bne.es/en/Inicio/index.html) from 2009 to 2019. This model is a more robust version of the [roberta-base-bne-capitel-ner](https://huggingface.co/PlanTL-GOB-ES/roberta-base-bne-capitel-ner) model that recognizes better lowercased Named Entities (NE).
 ## Intended uses and limitations
@@ -120,10 +120,10 @@ The model was trained with a batch size of 16 and a learning rate of 5e-5 for 5
 This model was finetuned maximizing F1 score.
 ## Evaluation results
-We evaluated the *roberta-base-bne-capitel-ner-plus** on the CAPITEL-NERC test set against standard multilingual and monolingual baselines:
-| Model        | XNLI (Accuracy) |
 | ------------|:----|
 | roberta-large-bne-capitel-ner | **90.51** |
 | roberta-base-bne-capitel-ner | 89.60|

 </details>
 ## Model description
+The **roberta-base-bne-capitel-ner-plus** is a Named Entity Recognition (NER) model for the Spanish language fine-tuned from the [roberta-base-bne](https://huggingface.co/PlanTL-GOB-ES/roberta-base-bne) model, a [RoBERTa](https://arxiv.org/abs/1907.11692) base model pre-trained using the largest Spanish corpus known to date, with a total of 570GB of clean and deduplicated text, processed for this work, compiled from the web crawlings performed by the [National Library of Spain (Biblioteca Nacional de España)](http://www.bne.es/en/Inicio/index.html) from 2009 to 2019. This model is a more robust version of the [roberta-base-bne-capitel-ner](https://huggingface.co/PlanTL-GOB-ES/roberta-base-bne-capitel-ner) model that recognizes better lowercased Named Entities (NE).
 ## Intended uses and limitations
 This model was finetuned maximizing F1 score.
 ## Evaluation results
+We evaluated the **roberta-base-bne-capitel-ner-plus** on the CAPITEL-NERC test set against standard multilingual and monolingual baselines:
+| Model        | CAPITEL-NERC (F1) |
 | ------------|:----|
 | roberta-large-bne-capitel-ner | **90.51** |
 | roberta-base-bne-capitel-ner | 89.60|