richardjonker2000 commited on
Commit
f1ce66f
1 Parent(s): ff08be7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +66 -1
README.md CHANGED
@@ -6,4 +6,69 @@ language:
6
  - en
7
  metrics:
8
  - f1
9
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
6
  - en
7
  metrics:
8
  - f1
9
+ ---
10
+
11
+
12
+ # Model Card for BioNExt
13
+
14
+ BioNExt, is an end-to-end Biomedical Relation Extraction and Classifcation system. The work utilized three modules, a Tagger (Named Entity Recognition), Linker (Entity Linking) and an Extractor (Relation Extraction and Classification).
15
+
16
+ This repositories contains two models:
17
+
18
+ 1. Tagger: Named Entity Recognition module, which performs 6 class biomedical NER: **Genes, Diseases, Chemicals, Variants (mutations), Species, and Cell Lines**.
19
+ 2. Extractor: Performs Relation Extraction and classification. The classes for the relation Extraction are: **Positive Correlation, Negative Correlation, Association, Binding, Drug Interaction, Cotreatment, Comparison, and Conversion.**
20
+
21
+ For a full description on how to utilize our end-to-end pipeline we point you towards our [GitHub](https://github.com/ieeta-pt/BioNExt) repository.
22
+
23
+ ## Model Details
24
+
25
+ ### Model Description
26
+
27
+ - **Developed by:** IEETA
28
+ - **Model type:** BERT Base
29
+ - **Language(s) (NLP):** English
30
+ - **License:** MIT
31
+ - **Finetuned from model:** BioLinkBERT-Large
32
+
33
+ ### Model Sources
34
+
35
+ - **Repository:** [IEETA BioNExt GitHub](https://github.com/ieeta-pt/BioNExt)
36
+ - **Paper:** Towards Discovery: An End-to-End System for Uncovering Novel Biomedical Relations [Awaiting Publication]
37
+
38
+ **Authors:**
39
+ - Tiago Almeida ([ORCID: 0000-0002-4258-3350](https://orcid.org/0000-0002-4258-3350))
40
+ - Richard A A Jonker ([ORCID: 0000-0002-3806-6940](https://orcid.org/0000-0002-3806-6940))
41
+ - Rui Antunes ([ORCID: 0000-0003-3533-8872](https://orcid.org/0000-0003-3533-8872))
42
+ - João R Almeida ([ORCID: 0000-0003-0729-2264](https://orcid.org/0000-0003-0729-2264))
43
+ - Sérgio Matos ([ORCID: 0000-0003-1941-3983](https://orcid.org/0000-0003-1941-3983))
44
+
45
+
46
+ ## Uses
47
+
48
+ Note we do not take any liability for the use of the model in any professional/medical domain. The model is intended for academic purposes only.
49
+
50
+ ## How to Get Started with the Model
51
+
52
+ Please refer to our GitHub repository for more information on our end-to-end inference pipeline: [IEETA BioNExt GitHub](https://github.com/ieeta-pt/BioNExt)
53
+
54
+ ## Training Details
55
+
56
+ ### Training Data
57
+
58
+ The training data utilized was the BioRED corpus, wihtin the scope of the BioCreative-VIII challenge.
59
+
60
+ Ling Luo, Po-Ting Lai, Chih-Hsuan Wei, Cecilia N Arighi, Zhiyong Lu, BioRED: a rich biomedical relation extraction dataset, Briefings in Bioinformatics, Volume 23, Issue 5, September 2022, bbac282, https://doi.org/10.1093/bib/bbac282
61
+
62
+
63
+ ### Results
64
+
65
+ As evaluated as an end to end system, our results are as follows:
66
+ - **Tagger**: 43.10
67
+ - **Linker**: 32.46
68
+ - **Extractor**: 24.59
69
+
70
+ ## Citation
71
+
72
+ **BibTeX:**
73
+
74
+ [Awaiting Publication]