OREAL - a internlm Collection

internlm 's Collections

InternLM-XComposer2.5

OREAL

InternLM2-Reward

InternLM-XComposer2

OREAL

updated Feb 11

internlm/OREAL-32B

Text Generation • Updated 22 days ago • 1.38k • 21
internlm/OREAL-7B

Text Generation • Updated 22 days ago • 522 • 20
internlm/OREAL-DeepSeek-R1-Distill-Qwen-7B

Text Generation • Updated 22 days ago • 1.43k • 8
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published Feb 10 • 60
internlm/OREAL-32B-SFT

Question Answering • Updated 22 days ago • 1.86k • 5
internlm/OREAL-7B-SFT

Text Generation • Updated 22 days ago • 141 • 1
internlm/OREAL-RL-Prompts

Viewer • Updated 29 days ago • 4.21k • 444 • 10