Running on CPU Upgrade 80 80 Open Japanese LLM Leaderboard 🌸 Explore and compare LLM models through interactive leaderboards and submissions
BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models Paper • 2502.07346 • Published Feb 11 • 53
multilingual_domain_datasets Collection Multilingual datasets. Excluding those which are just a cleaned version of CC. • 3 items • Updated Feb 17
multilingual_domain_datasets Collection Multilingual datasets. Excluding those which are just a cleaned version of CC. • 3 items • Updated Feb 17
multilingual_benchmark Collection For evaluating multilingual ability of LLMs • 1 item • Updated Feb 13