--- language: - es license: {cc4_0} tags: - {language model} # Example: audio - {national library of spain} # Example: automatic-speech-recognition - {spanish} # Example: speech datasets: - {bne} # Example: common_voice metrics: - {ppl} # Example: wer --- # RoBERTa base trained with data from National Library of Spain (BNE) ## Citing ``` @misc{gutierrezfandino2021spanish, title={Spanish Language Models}, author={Asier Gutiérrez-Fandiño and Jordi Armengol-Estapé and Marc Pàmies and Joan Llop-Palao and Joaquín Silveira-Ocampo and Casimiro Pio Carrino and Aitor Gonzalez-Agirre and Carme Armentano-Oller and Carlos Rodriguez-Penagos and Marta Villegas}, year={2021}, eprint={2107.07253}, archivePrefix={arXiv}, primaryClass={cs.CL} } ``` For more information visit our [GitHub repository](https://github.com/PlanTL-SANIDAD/lm-spanish)