BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper • 1810.04805 • Published • 29
This is a 22.7M parameter BERT encoder-only model trained on Wikipedia articles labeled as sports related.
This is a domain-specialized small model that often performs as good as models 10-100x larger. It demonstrates that narrowing down a model to a small domain requires less overall parameters than models generalized for all problems.
sportsbert-small can be loaded using Hugging Face Transformers as follows.
from transformers import AutoModel
model = AutoModel.from_pretrained("neuml/sportsbert-small")
The model is intended to be further fine-tuned for a specific task such as Text Classification, Entity Extraction, Sentence Embeddings and so on.
Read more about the model in this article.