InkubaLM: A small language model for low-resource African languages Paper • 2408.17024 • Published Aug 30 • 12
PuoBERTa: Training and evaluation of a curated language model for Setswana Paper • 2310.09141 • Published Oct 13, 2023 • 1
MasakhaNER 2.0: Africa-centric Transfer Learning for Named Entity Recognition Paper • 2210.12391 • Published Oct 22, 2022
Preparing the Vuk'uzenzele and ZA-gov-multilingual South African multilingual corpora Paper • 2303.03750 • Published Mar 7, 2023
NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation Paper • 2112.02721 • Published Dec 6, 2021
PuoBERTa: Training and evaluation of a curated language model for Setswana Paper • 2310.09141 • Published Oct 13, 2023 • 1
MasakhaPOS: Part-of-Speech Tagging for Typologically Diverse African Languages Paper • 2305.13989 • Published May 23, 2023
Izindaba-Tindzaba: Machine learning news categorisation for Long and Short Text for isiZulu and Siswati Paper • 2306.07426 • Published Jun 12, 2023