File size: 492 Bytes
c2b379e
53b02f1
c2b379e
 
 
 
 
 
 
 
7dce53a
c2b379e
 
3ba308b
 
c2b379e
3ba308b
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
---
pipeline_tag: fill-mask
tags:
- online social networks
- twitter
- spanish
language: es
license: apache-2.0
widget:
- text: "Las <mask> causan hipoxia."
  example_title: "Mask filling"
---

Model BERTuit as presented in the [BERTuit: Understanding Spanish language in Twitter through a native transformer](https://arxiv.org/abs/2204.03465) article.

Before tokenization replace user tags and urls with "\<usr\>" and "\<url\>" respectively.

Tokenize text with base class RoBERTaTokenizer.