OpenLearnLM
community
AI & ML interests
None defined yet.
models
5
OpenLearnLM/deepseek_qwen3_8b_think_reward_grpo_step_300
8B
•
Updated
•
419
OpenLearnLM/deepseek_qwen3_8b_think_noreward_grpo_step_300
8B
•
Updated
•
64
OpenLearnLM/deepseek_qwen3_8b_nothink_grpo_step_300
8B
•
Updated
•
62
OpenLearnLM/deepseek_qwen3_8b_pedagogical_think_reward_grpo_step_300
8B
•
Updated
•
159
OpenLearnLM/deepseek_qwen3_8b_pedagogical_think_noreward_grpo_step_300
8B
•
Updated
•
62
datasets
0
None public yet