arxiv:2411.19638
Taja Kuzman
TajaKuzman
AI & ML interests
Automatic genre identification, web corpora creation and curation, text categorization, machine translation and other language technologies topics.
Recent Activity
updated
a model
15 days ago
classla/multilingual-IPTC-news-topic-classifier
authored
a paper
19 days ago
Do Language Models Care About Text Quality? Evaluating Web-Crawled
Corpora Across 11 Languages
Organizations
Papers
3
models
None public yet