roberta-base-bne / README.md
asier-gutierrez's picture
Update README.md
eea2436
|
raw
history blame
1.12 kB
metadata
language:
  - es
license: cc-by-4.0
tags:
  - national library of spain
  - spanish
  - bne
datasets:
  - bne
metrics:
  - ppl
widget:
  - text: Este año las campanadas de La Sexta las presentará <mask>.
  - text: David Broncano es un presentador de La <mask>.
  - text: >-
      Gracias a los datos de la BNE se ha podido <mask> este modelo del
      lenguaje.
  - text: Hay base legal dentro del marco <mask> actual.

RoBERTa base trained with data from National Library of Spain (BNE)

Citing

Check out our paper for all the details: https://arxiv.org/abs/2107.07253

@misc{gutierrezfandino2021spanish,
      title={Spanish Language Models}, 
      author={Asier Gutiérrez-Fandiño and Jordi Armengol-Estapé and Marc Pàmies and Joan Llop-Palao and Joaquín Silveira-Ocampo and Casimiro Pio Carrino and Aitor Gonzalez-Agirre and Carme Armentano-Oller and Carlos Rodriguez-Penagos and Marta Villegas},
      year={2021},
      eprint={2107.07253},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

For more information visit our GitHub repository