2 5 25

Mert Ege

mertege

mertege

AI & ML interests

None yet

Recent Activity

liked a Space 2 months ago

nanotron/ultrascale-playbook

liked a model 2 months ago

ALLaM-AI/ALLaM-7B-Instruct-preview

upvoted a paper 3 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

View all activity

Organizations

mertege's activity

liked a Space 2 months ago

2.53k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a model 2 months ago

ALLaM-AI/ALLaM-7B-Instruct-preview

Text Generation • Updated Mar 12 • 8.69k • 110

upvoted a paper 3 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 390

liked 2 models 3 months ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • Updated Feb 24 • 2.22M • • 1.35k

deepseek-ai/DeepSeek-R1

Text Generation • Updated Mar 27 • 1.74M • • 12k

upvoted a paper 4 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 365

liked a dataset 7 months ago

abdoelsayed/Open-ArabicaQA

Preview • Updated Mar 27, 2024 • 170 • 5

liked a dataset 8 months ago

BAAI/Infinity-Instruct

Viewer • Updated Feb 25 • 20.4M • 4.7k • 618

liked a model 8 months ago

maywell/Qwen2-7B-Multilingual-RP

Text Generation • Updated Jun 25, 2024 • 1.8k • 56

liked a dataset 8 months ago

macadeliccc/opus_samantha

Viewer • Updated Jun 21, 2024 • 3.19k • 186 • 21

liked 3 models 8 months ago

liked a Space 8 months ago

147

Open Arabic LLM Leaderboard

🏆

Track, rank and evaluate open Arabic LLMs and chatbots

upvoted an article 8 months ago

Article

Fit More and Train Faster With ZeRO via DeepSpeed and FairScale

Jan 19, 2021

• 4

liked a model 9 months ago

haoranxu/ALMA-13B-Pretrain

Text Generation • Updated Oct 5, 2024 • 2.27k • 9

liked a dataset 10 months ago

mlfoundations/dclm-baseline-1.0

Preview • Updated Jul 22, 2024 • 813k • 217

upvoted a paper 10 months ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25, 2024 • 96

liked a Space 10 months ago

Magpie

🐦

Generate and rate instruction-response pairs

liked a Space 11 months ago

924

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training