vinai
/

bertweet-large

Inference Endpoints

Model card Files Files and versions Community

dqnguyen commited on Apr 26, 2022

Commit

dc5f5fd

•

1 Parent(s): c4b2482

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -67,7 +67,8 @@ Before applying BPE to the pre-training corpus of English Tweets, we tokenized t
 For `vinai/bertweet-large`, given the raw input Tweets, to obtain the same pre-processing output, users could employ our  [TweetNormalizer](https://github.com/VinAIResearch/BERTweet/blob/master/TweetNormalizer.py) module.
-- Installation: `pip3 install nltk emoji`
 ```python
 import torch

 For `vinai/bertweet-large`, given the raw input Tweets, to obtain the same pre-processing output, users could employ our  [TweetNormalizer](https://github.com/VinAIResearch/BERTweet/blob/master/TweetNormalizer.py) module.
+- Installation: `pip3 install nltk emoji==0.6.0`
+- The `emoji` version must be either 0.5.4 or 0.6.0. Newer `emoji` versions have been updated to newer versions of the Emoji Charts, thus not consistent with the one used for pre-processing our pre-training Tweet corpus.
 ```python
 import torch