Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models Paper • 2503.09573 • Published 2 days ago • 46
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia Paper • 2503.07920 • Published 4 days ago • 89
Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs Paper • 2502.17424 • Published 18 days ago • 2
SEA-HELM: Southeast Asian Holistic Evaluation of Language Models Paper • 2502.14301 • Published 23 days ago • 1
AfriHate: A Multilingual Collection of Hate Speech and Abusive Language Datasets for African Languages Paper • 2501.08284 • Published Jan 14 • 6
Building Foundations for Natural Language Processing of Historical Turkish: Resources and Models Paper • 2501.04828 • Published Jan 8 • 11
U-MATH and μ-MATH - University-level math evaluation Collection Paper: A UNIVERSITY-LEVEL BENCHMARK FOR EVALUATING MATHEMATICAL SKILLS IN LLMS • 4 items • Updated Jan 14 • 15
Monet: Mixture of Monosemantic Experts for Transformers Paper • 2412.04139 • Published Dec 5, 2024 • 13
Farmer.Chat: Scaling AI-Powered Agricultural Services for Smallholder Farmers Paper • 2409.08916 • Published Sep 13, 2024 • 4
Plant foundation models Collection A collection of pre-trained DNA models for plant genomes. • 19 items • Updated Oct 23, 2024 • 5
Malaysian synthetic dataset Collection Use LLM to generate Malaysian context synthetic dataset. • 33 items • Updated Dec 23, 2024 • 1
RedCode: Risky Code Execution and Generation Benchmark for Code Agents Paper • 2411.07781 • Published Nov 12, 2024 • 1
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models Paper • 2410.20771 • Published Oct 28, 2024 • 3