mariananeves commited on
Commit
d759f6b
1 Parent(s): 184bb9b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -108
README.md CHANGED
@@ -8,37 +8,23 @@
8
 
9
  Model for text classification based on the [GoldHamster corpus](https://doi.org/10.5281/zenodo.7152295). [Source code]() is available.
10
 
11
-
12
- <!-- Provide a quick summary of what the model is/does. -->
13
-
14
- This modelcard aims to be a base template for new models. It has been generated using [this raw template](https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/templates/modelcard_template.md?plain=1).
15
-
16
- ## Model Details
17
-
18
-
19
-
20
  ### Model Description
21
 
22
- <!-- Provide a longer summary of what this model is. -->
23
-
24
-
25
 
26
- - **Developed by:** [More Information Needed]
27
- - **Shared by [optional]:** [More Information Needed]
28
- - **Model type:** [More Information Needed]
29
  - **Language(s) (NLP):** English
30
- - **License:** [More Information Needed]
31
- - **Finetuned from model [optional]:** [More Information Needed]
32
 
33
  ### Model Sources [optional]
34
 
35
  - **Repository:** https://github.com/mariananeves/goldhamster
36
- - **Paper [optional]:** https://europepmc.org/article/ppr/ppr479254
37
-
38
 
39
  ## Uses
40
 
41
- Model for detecting our eight-label schema
 
42
 
43
  ### Direct Use
44
 
@@ -46,104 +32,20 @@ Model for detecting our eight-label schema
46
 
47
  [More Information Needed]
48
 
49
- ### Downstream Use [optional]
50
-
51
- <!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
52
-
53
- [More Information Needed]
54
-
55
- ### Out-of-Scope Use
56
-
57
- <!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
58
-
59
- [More Information Needed]
60
-
61
- ## Bias, Risks, and Limitations
62
-
63
- <!-- This section is meant to convey both technical and sociotechnical limitations. -->
64
-
65
- [More Information Needed]
66
-
67
- ### Recommendations
68
-
69
- <!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
70
-
71
- Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
72
-
73
- ## How to Get Started with the Model
74
-
75
- Use the code below to get started with the model.
76
-
77
- [More Information Needed]
78
-
79
- ## Training Details
80
-
81
- ### Training Data
82
-
83
- <!-- This should link to a Data Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
84
-
85
- [More Information Needed]
86
-
87
- ### Training Procedure
88
-
89
- <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
90
-
91
- #### Preprocessing [optional]
92
-
93
- [More Information Needed]
94
-
95
-
96
- #### Training Hyperparameters
97
-
98
- - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
99
-
100
- #### Speeds, Sizes, Times [optional]
101
-
102
- <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
103
-
104
- [More Information Needed]
105
-
106
- ## Evaluation
107
-
108
- <!-- This section describes the evaluation protocols and provides the results. -->
109
-
110
- ### Testing Data, Factors & Metrics
111
-
112
- #### Testing Data
113
-
114
- <!-- This should link to a Data Card if possible. -->
115
-
116
- [More Information Needed]
117
-
118
- #### Factors
119
-
120
- <!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
121
-
122
- [More Information Needed]
123
-
124
- #### Metrics
125
-
126
- <!-- These are the evaluation metrics being used, ideally with a description of why. -->
127
-
128
- [More Information Needed]
129
 
130
  ### Results
131
 
 
 
132
 
133
  | invertebrates | in vivo | human | organs | primary cell lines | immortal cell lines | in silico | others |
134
  | ------------- | --------- | --------- | --------- | ------------------ | ------------------- | ----------- | --------- |
135
  | 0.95 | 0.88 | 0.86 | 0.82 | 0.75 | 0.83 | 0.75 | 0.78 |
136
 
137
 
 
138
 
139
-
140
- \others & 0.70 & \bf 0.78 & 0.67 & 0.76 \\
141
- \midrule
142
- All (average) & 0.79 & \bf 0.83 & 0.80 & 0.80 \\
143
-
144
-
145
- ## Citation [optional]
146
-
147
  @misc {PPR:PPR479254,
148
  Title = {Automatic classification of experimental models in biomedical literature to support searching for alternative methods to animal experiments},
149
  Author = {Neves, Mariana and Klippert, Antonina and Knöspel, Fanny and Rudeck, Juliane and Stolz, Ailine and Ban, Zsofia and Becker, Markus and Diederich, Kai and Grune, Barbara and Kahnau, Pia and Ohnesorge, Nils and Pucher, Johannes and Schönfelder, Gilbert and Bert, Bettina and Butzke, Daniel},
@@ -153,6 +55,7 @@ All (average) & 0.79 & \bf 0.83 & 0.80 & 0.80 \\
153
  Year = {2022},
154
  URL = {https://doi.org/10.21203/rs.3.rs-1526055/v1},
155
  }
 
156
 
157
  ## Contact
158
 
 
8
 
9
  Model for text classification based on the [GoldHamster corpus](https://doi.org/10.5281/zenodo.7152295). [Source code]() is available.
10
 
 
 
 
 
 
 
 
 
 
11
  ### Model Description
12
 
13
+ Model pre-trained on [PubMedBERT](https://huggingface.co/microsoft/BiomedNLP-PubMedBERT-base-uncased-abstract) and fine-tuned on the GoldHamster corpus.
 
 
14
 
 
 
 
15
  - **Language(s) (NLP):** English
16
+ - **License:** [CC BY 3.0 DE](https://creativecommons.org/licenses/by/3.0/de/deed.en)
17
+ - **Finetuned from model:** https://huggingface.co/microsoft/BiomedNLP-PubMedBERT-base-uncased-abstract
18
 
19
  ### Model Sources [optional]
20
 
21
  - **Repository:** https://github.com/mariananeves/goldhamster
22
+ - **Paper:** https://europepmc.org/article/ppr/ppr479254
 
23
 
24
  ## Uses
25
 
26
+ Model for detecting our eight-label schema: invertebrates, in vivo, human, organs, primary cell lines, immortal cell lines, in silico, others.
27
+ Predictions are on the document level.
28
 
29
  ### Direct Use
30
 
 
32
 
33
  [More Information Needed]
34
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
35
 
36
  ### Results
37
 
38
+ Results are in terms of f-score.
39
+
40
 
41
  | invertebrates | in vivo | human | organs | primary cell lines | immortal cell lines | in silico | others |
42
  | ------------- | --------- | --------- | --------- | ------------------ | ------------------- | ----------- | --------- |
43
  | 0.95 | 0.88 | 0.86 | 0.82 | 0.75 | 0.83 | 0.75 | 0.78 |
44
 
45
 
46
+ ## Citation
47
 
48
+ ```latex
 
 
 
 
 
 
 
49
  @misc {PPR:PPR479254,
50
  Title = {Automatic classification of experimental models in biomedical literature to support searching for alternative methods to animal experiments},
51
  Author = {Neves, Mariana and Klippert, Antonina and Knöspel, Fanny and Rudeck, Juliane and Stolz, Ailine and Ban, Zsofia and Becker, Markus and Diederich, Kai and Grune, Barbara and Kahnau, Pia and Ohnesorge, Nils and Pucher, Johannes and Schönfelder, Gilbert and Bert, Bettina and Butzke, Daniel},
 
55
  Year = {2022},
56
  URL = {https://doi.org/10.21203/rs.3.rs-1526055/v1},
57
  }
58
+ ```
59
 
60
  ## Contact
61