Ali Elfilali

Ali-C137

AI & ML interests

NLP (mainly for Arabic), Reinforcement Learning and Cognitive science

Organizations

Posts 10

view post
Post
264
Is it just me or is it real that whenever APPLE releases an open model, they accompany it with a library !? First was MLX, about a month ago AXLEARN and now CORENET ! Could it be just coincidences or does Apple playing some game ? if yes then what is it ... ? What do you think ? maybe i'm just hallucinating now 😅
view post
Post
2031
Honestly i don't understand how come we as the open source community haven't surpassed GPT-4 yet ? Like for me it looks like everything is out there just need to be exploited! Clearly specialized small models outperforms gpt4 on downstream tasks ! So why haven't we just trained a 1B-2B really strong general model and then continue pertained and/or finetuned it on datasets for downstream tasks like math, code...well structured as Textbooks format or other datasets formats that have been proven to be really efficient and good! Ounce you have 100 finetuned model, just wrap them all into a FrankenMoE and Voila ✨
And that's just what a NOOB like myself had in mind, I'm sure there is better, more efficient ways to do it ! So the question again, why we haven't yet ? I feel I'm missing something... Right?