Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
Taylor658ย 
posted an update May 29
Post
1238
Cohere for AI, Argilla, and Hugging Face are collaborating on an Open Science Project to enhance multilingual model evaluations. The project focuses on the widely-used MMLU dataset, which spans 57 subjects like mathematics, computer science, and law. However, existing translations often miss linguistic and cultural nuances, thus embedding biases. ๐Ÿค”

To address this, they have annotated a subset of the MMLU test set and are inviting global perspectives to review prompts, highlighting cultural specifics and required knowledge. They have mentioned that insights will help shape future multilingual model evaluations, ensuring they are more inclusive and accurate. ๐Ÿ—บ๏ธ ๐Ÿ“ ๐Ÿ™Œ

โ–ถ๏ธ To get started go to: CohereForAI/MMLU-evaluation

๐ŸŒ They also have an Aya Discord server for collaboration with other participants: https://discord.gg/9gVhdfnQMN
In this post