OpenLearnLM

community

AI & ML interests

None defined yet.

Recent Activity

Unggi updated a collection about 2 months ago

Unggi updated a model about 2 months ago

OpenLearnLM/special-r1-deepseek-qwen3-8b-merged-dare-v2

Unggi published a model about 2 months ago

OpenLearnLM/special-r1-deepseek-qwen3-8b-merged-dare-v2

View all activity

Collections 2

models 9

OpenLearnLM/special-r1-deepseek-qwen3-8b-merged-dare-v2

Text Generation • 8B • Updated May 4 • 6

OpenLearnLM/special-r1-deepseek-qwen3-8b-sped-adaptive-think-reward

Text Generation • 8B • Updated Apr 17 • 8

OpenLearnLM/special-r1-deepseek-qwen3-8b-sped-adaptive-think-noreward

Text Generation • 8B • Updated Apr 7 • 3

OpenLearnLM/qwen2.5_7b_nothink_noreward_grpo_step_300

8B • Updated Jan 13 • 2

OpenLearnLM/deepseek_qwen3_8b_think_reward_grpo_step_300

8B • Updated Jul 9, 2025 • 3

OpenLearnLM/deepseek_qwen3_8b_think_noreward_grpo_step_300

8B • Updated Jul 9, 2025 • 1

OpenLearnLM/deepseek_qwen3_8b_nothink_grpo_step_300

8B • Updated Jul 9, 2025 • 2

OpenLearnLM/deepseek_qwen3_8b_pedagogical_think_reward_grpo_step_300

8B • Updated Jul 9, 2025 • 2

OpenLearnLM/deepseek_qwen3_8b_pedagogical_think_noreward_grpo_step_300

8B • Updated Jul 9, 2025 • 2

datasets 0

None public yet