oracat
/

bert-paper-classifier

Text Classification

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

oracat commited on Apr 17, 2023

Commit

5eed682

•

1 Parent(s): d59d395

Update README.md

Files changed (1) hide show

README.md +5 -25

README.md CHANGED Viewed

@@ -9,27 +9,15 @@ model-index:
   results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
 # bert-paper-classifier
-This model is a fine-tuned version of [microsoft/BiomedNLP-PubMedBERT-base-uncased-abstract](https://huggingface.co/microsoft/BiomedNLP-PubMedBERT-base-uncased-abstract) on the None dataset.
-It achieves the following results on the evaluation set:
-- Loss: 1.1069
-- Accuracy: 0.6475
-## Model description
-More information needed
 ## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
 ## Training procedure
@@ -37,25 +25,17 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
-- train_batch_size: 256
 - eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 3
-### Training results
-| Training Loss | Epoch | Step | Validation Loss | Accuracy |
-|:-------------:|:-----:|:----:|:---------------:|:--------:|
-| 1.2101        | 1.0   | 704  | 1.1445          | 0.6359   |
-| 1.01          | 2.0   | 1408 | 1.1027          | 0.6472   |
-| 0.8619        | 3.0   | 2112 | 1.1069          | 0.6475   |
 ### Framework versions
 - Transformers 4.28.1
 - Pytorch 2.0.0+cu117
 - Datasets 2.11.0
-- Tokenizers 0.13.3

   results: []
 ---
 # bert-paper-classifier
+This model is a fine-tuned version of [microsoft/BiomedNLP-PubMedBERT-base-uncased-abstract](https://huggingface.co/microsoft/BiomedNLP-PubMedBERT-base-uncased-abstract) on the dataset from [González-Márquez et al., 2023](https://www.biorxiv.org/content/10.1101/2023.04.10.536208v1).
 ## Intended uses & limitations
+This model is intended to predict the category given the paper title (and optionally its abstract) — for the biomedical papers. For example, it is likely to predict `virology` as a category for the paper with a title containing `COVID-19`.
+So far only a subset of the PubMed dataset has been used for training. Future improvements to this model can come with using the full dataset with a combination of titles and abstracts for the fine-tuning as well as extending the training set to the preprints from bioRxiv and/or arXiv.
 ## Training procedure
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
+- train_batch_size: 128
 - eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 3
 ### Framework versions
 - Transformers 4.28.1
 - Pytorch 2.0.0+cu117
 - Datasets 2.11.0
+- Tokenizers 0.13.3