This is the model CovidBERT trained by DeepSet on AllenAI's CORD19 Dataset of scientific articles about coronaviruses.
The model uses the original BERT wordpiece vocabulary and was subsequently fine-tuned on the SNLI and the MultiNLI datasets using the
sentence-transformers library to produce universal sentence embeddings  using the average pooling strategy and a softmax loss.
Parameter details for the original training on CORD-19 are available on DeepSet's MLFlow
deepset/covid_bert_base from HuggingFace's
Training time: ~6 hours on the NVIDIA Tesla P100 GPU provided in Kaggle Notebooks.
|Max. Seq. Length||128|
Performances: The performance was evaluated on the test portion of the STS dataset using Spearman rank correlation and compared to the performances of similar models obtained with the same procedure to verify its performances.
An example usage for similarity-based scientific paper retrieval is provided in the Covid-19 Semantic Browser repository.
 N. Reimers et I. Gurevych, Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
- Downloads last month
Unable to determine this model’s pipeline type. Check the docs .