siebert/sentiment-roberta-large-english · is this model comparing the accuracy of a smaller model?

Nov 13, 2022

Distilbert aims to optimize the training by reducing the size of BERT and increase the speed of BERT — all while trying to retain as much performance as possible. Specifically, Distilbert is 40% smaller than the original BERT-base model, is 60% faster than it, and retains 97% of its functionality.

Thus, is it appropriate?

siebert

Owner Nov 13, 2022

When we released the model, Distilbert-SST2 was the standard sentiment model in Huggingface. Our benchmark shows that our (larger) model is indeed more accurate, even if inference is somewhat slower. So when choosing models, you may consider this trade-off between our more accurate but slower model, or the somewhat faster but less accurate Distilbert model, depending on your use case. Since our model is already trained, the higher computation cost for training is not relevant anymore for your decision. Hope this helps!

ultraleow

Nov 14, 2022

Thanks, I'm now able to understand the initiative behind of it

ultraleow changed discussion status to closed Nov 14, 2022