StivenLancheros
commited on
Commit
•
d7dcf9c
1
Parent(s):
36748bf
Update README.md
Browse files
README.md
CHANGED
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
|
|
16 |
|
17 |
# biobert-base-cased-v1.2-finetuned-ner-CRAFT_Augmented_ES
|
18 |
|
19 |
-
This model is a fine-tuned version of [dmis-lab/biobert-base-cased-v1.2](https://huggingface.co/dmis-lab/biobert-base-cased-v1.2) on the
|
20 |
It achieves the following results on the evaluation set:
|
21 |
- Loss: 0.2251
|
22 |
- Precision: 0.8276
|
@@ -26,7 +26,10 @@ It achieves the following results on the evaluation set:
|
|
26 |
|
27 |
## Model description
|
28 |
|
29 |
-
|
|
|
|
|
|
|
30 |
|
31 |
## Intended uses & limitations
|
32 |
|
|
|
16 |
|
17 |
# biobert-base-cased-v1.2-finetuned-ner-CRAFT_Augmented_ES
|
18 |
|
19 |
+
This model is a fine-tuned version of [dmis-lab/biobert-base-cased-v1.2](https://huggingface.co/dmis-lab/biobert-base-cased-v1.2) on the CRAFT dataset.
|
20 |
It achieves the following results on the evaluation set:
|
21 |
- Loss: 0.2251
|
22 |
- Precision: 0.8276
|
|
|
26 |
|
27 |
## Model description
|
28 |
|
29 |
+
This model performs Named Entity Recognition for 6 entity tags: Sequence, Cell, Protein, Gene, Taxon, and Chemical from the CRAFT(Colorado Richly Annotated Full Text) Corpus in Spanish (MT translated) and English. Entity tags have been normalized and replaced from the original three letter code to a full name e.g. B-Protein, I-Chemical.
|
30 |
+
|
31 |
+
This model is trained on augmented data created using Entity Replacement. 20% of the entities were replaced using a list of entities for each entity tag obtained from the official ontologies for each entity class. Three datasets (original, augmented, MT translated CRAFT) were concatenated.
|
32 |
+
|
33 |
|
34 |
## Intended uses & limitations
|
35 |
|