saurabh5/MATH_3000_Filtered_olmo_completions_new_template_filtered Viewer • Updated 4 days ago • 2.93k • 19
saurabh5/DAPO-Math-17k-Processed_filtered_olmo_completions_new_template_filtered Viewer • Updated 4 days ago • 10.4k • 23
saurabh5/DAPO-Math-17k-Processed_filtered_olmo_completions_new_template Viewer • Updated 4 days ago • 12.6k • 13
saurabh5/IF_multi_constraints_upto5_filtered_olmo_completions_filtered Viewer • Updated 4 days ago • 88.6k • 39
saurabh5/rlvr_acecoder_filtered_filtered_olmo_completions_filtered Viewer • Updated 4 days ago • 62.5k • 37
saurabh5/synthetic2-rlvr-code-compressed_filtered_olmo_completions_filtered Viewer • Updated 4 days ago • 10.9k • 26
saurabh5/DAPO-Math-17k-Processed_filtered_olmo_completions_filtered Viewer • Updated 4 days ago • 12.6k • 22
saurabh5/synthetic2-rlvr-code-compressed_filtered_olmo_completions Viewer • Updated 4 days ago • 11k • 34
saurabh5/IF_multi_constraints_upto5_filtered_olmo_completions Viewer • Updated 4 days ago • 95.3k • 46
saurabh5/rlvr-code-view-tool-new-first-turn-only-user-with-repo-name Viewer • Updated 5 days ago • 13.3k • 27
saurabh5/olmo-3-preference-mix-deltas_reasoning-yolo_even_split-DECON-no-chinese Viewer • Updated 11 days ago • 526k • 68
saurabh5/rlvr-prompts_responses-mixin_it_up-v2-filtered-no-chinese Viewer • Updated 12 days ago • 131k • 111
saurabh5/rlvr_mixin_it_up_prompts-qwen25-r1-distill-32b-1_5B-thoughts-x16-filtered-no-chinese Viewer • Updated 13 days ago • 97.6k • 168
saurabh5/rlvr_mixin_it_up_prompts-qwen25-r1-distill-32b-1_5B-thoughts-x16 Viewer • Updated 13 days ago • 95k • 271
saurabh5/rlvr_mixin_it_up_prompts-qwen3-32b-06B-thoughts-x8-filtered-no-chinese Viewer • Updated 16 days ago • 87k • 190
saurabh5/rlvr_mixin_it_up_prompts-qwen3-32b-06B-thoughts-x8 Viewer • Updated 16 days ago • 85.9k • 387
saurabh5/rlvr_mixin_it_up_prompts-qwen3-32b-06B-thoughts-x8-filtered Viewer • Updated 18 days ago • 97.5k • 135