15 26 185

abdeljalil_elma

abdeljalilELmajjodi

elma-dev

AI & ML interests

None yet

Recent Activity

liked a model 5 minutes ago

answerdotai/ModernBERT-Large-Instruct

upvoted a paper 5 minutes ago

It's All in The [MASK]: Simple Instruction-Tuning Enables BERT-like Masked Language Models As Generative Classifiers

liked a model about 2 hours ago

atlasia/Terjman-Ultra-v2.0

View all activity

Organizations

abdeljalilELmajjodi's activity

liked a model 5 minutes ago

answerdotai/ModernBERT-Large-Instruct

Fill-Mask • Updated 29 days ago • 4.02k • 17

upvoted a paper 5 minutes ago

It's All in The [MASK]: Simple Instruction-Tuning Enables BERT-like Masked Language Models As Generative Classifiers

Paper • 2502.03793 • Published Feb 6 • 2

liked a model about 2 hours ago

atlasia/Terjman-Ultra-v2.0

Translation • Updated about 4 hours ago • 3

upvoted an article about 17 hours ago

Article

Introducing EuroBERT: A High-Performance Multilingual Encoder Model

and 3 others •

1 day ago

• 98

liked a model about 17 hours ago

NAMAA-Space/AraModernBert-Base-STS

reacted to alielfilali01's post with 👍 about 20 hours ago

Post

2047

3C3H AraGen Leaderboard welcomes today deepseek-ai/DeepSeek-V3 and 12 other models (including the late gpt-3.5 💀) to the ranking of best LLMs in Arabic !

Observations:
- DeepSeek-v3 ranked 3rd and only Open model among the top 5 !

- A 14B open model ( Qwen/Qwen2.5-14B-Instruct) outperforms gpt-3.5-turbo-0125 (from last year). This shows how much we came in advancing and supporting Arabic presence within the LLM ecosystem !

- Contrary to what observed in likelihood-acc leaderboards (like OALL/Open-Arabic-LLM-Leaderboard) further finetuned models like maldv/Qwentile2.5-32B-Instruct actually decreased the performance compared to the original model Qwen/Qwen2.5-32B-Instruct.
It's worth to note that the decrease is statiscally insignificant which imply that at best, the out-domain finetuning do not really hurts the model original capabilities acquired during pretraining.
Previous work addressed this (finetuning VS pretraining) but more investigation in this regard is required (any PhDs here ? This could be your question ...)

Check out the latest rankings: inceptionai/AraGen-Leaderboard