Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
Ali-C137ย 
posted an update Feb 14
Post
The Aya project ( CohereForAI/aya_dataset, CohereForAI/aya_collection and CohereForAI/aya_evaluation_suite) by CohereForAI got released yesterday ! And today I'am excited to introduce Arabic Aya (2A) ๐ŸŒŸ

Arabic Aya is a carefully curated dataset, derived from the vast Aya collection by CohereForAI, tailored specifically for Arabic language processing. It consolidates texts across Modern Standard Arabic (MSA) and other dialects, simplifying access to high-quality data for researchers, developers, and linguists.

๐Ÿ” Why Arabic Aya?
- Time-saving : Jump straight into your projects with pre-filtered Arabic texts.
- Diverse applications : Perfect for language modeling, sentiment analysis, dialect identification, and more.
- Community-driven : Your contributions and feedback can help enrich this resource further.

๐Ÿ“š Utilize Arabic Aya for your next NLP/LLM projects and be part of advancing Arabic language technologies. Letโ€™s collaborate to make Arabic AI research more accessible and robust!

Check it out here: 2A2I/Arabic_Aya
In this post