kornosk
/

polibertweet-political-twitter-roberta-mlm

masked-token-prediction

Inference Endpoints

Model card Files Files and versions Community

kornosk commited on May 2, 2022

Commit

981250f

•

1 Parent(s): 3ac1d47

Update README.md

Files changed (1) hide show

README.md +36 -1

README.md CHANGED Viewed

@@ -1,3 +1,38 @@
 ---
-license: gpl-3.0
 ---

 ---
+language: "en"
+tags:
+- twitter
+- masked-token-prediction
+- election2020
+- politics
+license: "gpl-3.0"
 ---
+# Pre-trained BERT on Twitter US Political Election 2020
+Pre-trained weights for [PoliBERTweet: A Pre-trained Language Model for Analyzing Political Content on Twitter](XXX), LREC 2022.
+We use the initialized weights from [BERTweet](https://huggingface.co/vinai/bertweet-base) or `vinai/bertweet-base`.
+# Training Data
+This model is pre-trained on over 83 million English tweets about the 2020 US Presidential Election.
+# Training Objective
+This model is initialized with BERTweet and trained with an MLM objective.
+# Reference
+- [PoliBERTweet: A Pre-trained Language Model for Analyzing Political Content on Twitter](XXX), LREC 2022.
+# Citation
+```bibtex
+@inproceedings{kawintiranon2022polibertweet,
+  title     = {PoliBERTweet: A Pre-trained Language Model for Analyzing Political Content on Twitter},
+  author    = {Kawintiranon, Kornraphop and Singh, Lisa},
+  booktitle = {Proceedings of the Language Resources and Evaluation Conference},
+  year      = {2022},
+  publisher = {European Language Resources Association}
+}
+```