marcosgg commited on
Commit
bb6db5a
1 Parent(s): 3b2108f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +20 -0
README.md CHANGED
@@ -1,3 +1,23 @@
1
  ---
2
  license: gpl-3.0
 
 
 
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: gpl-3.0
3
+
4
+ language:
5
+ - pt
6
+ - gl
7
+
8
+ widget:
9
+
10
+ - text: "A minha amiga Rosa, de São Paulo, estudou en Montreal. Agora trabalha em Santiago de Compostela com o Mário."
11
+
12
  ---
13
+
14
+ # Named Entity Recognition (NER) model for Portuguese
15
+
16
+ This is a NER model for Portuguese which uses the standard `enamex' classes: LOC (geographical locations); PER (people); ORG (organizations); MISC (other entities).
17
+
18
+ The model is based on [BERTimbau Base](https://huggingface.co/neuralmind/bert-base-portuguese-cased), which has been fine-tuned using a combination of available corpus (see [1] for details).
19
+
20
+ It was trained with a batch size of 8 and a learning rate of 2e-5 during 3 epochs. It achieved the following results on the test set (Precision/Recall/F1): 0.913/0.918/0.915.
21
+
22
+ [1] Pablo Gamallo, Marcos Garcia & Patricia Martín-Rodilla, 2019. [NER and open information extraction for Portuguese notebook for IberLEF 2019 Portuguese named entity recognition and relation extraction tasks](https://ceur-ws.org/Vol-2421/NER_Portuguese_paper_6.pdf). In _Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2019)
23
+ co-located with 35th Conference of the Spanish Society for Natural Language Processing (SEPLN 2019)_: 457-467.