3 5 21

umangkaushik

ubermenchh

AI & ML interests

None yet

Recent Activity

liked a dataset 23 days ago

nl-to-logql/natural-logql

liked a dataset about 1 month ago

michaelmallari/rt-iot2022

liked a dataset about 1 month ago

GAIR/LIMO

View all activity

Organizations

ubermenchh's activity

liked a dataset 23 days ago

nl-to-logql/natural-logql

Viewer • Updated Dec 6, 2024 • 424 • 72 • 5

liked 2 datasets about 1 month ago

michaelmallari/rt-iot2022

Viewer • Updated Apr 18, 2024 • 123k • 49 • 1

GAIR/LIMO

Viewer • Updated Feb 10 • 817 • 4.9k • 147

New activity in ubermenchh/Qwen2.5-3B-open-r1-math about 1 month ago

Adding `safetensors` variant of this model

#1 opened about 1 month ago by

SFconvertbot

updated 2 models about 1 month ago

ubermenchh/Qwen2.5-3B-open-r1-math

Text Generation • Updated Feb 23 • 24

ubermenchh/Qwen2.5-3B-open-r1-math-lora

Updated Feb 23

published 2 models about 1 month ago

ubermenchh/Qwen2.5-3B-open-r1-math-lora

Updated Feb 23

ubermenchh/Qwen2.5-3B-open-r1-math

Text Generation • Updated Feb 23 • 24

updated a model about 1 month ago

ubermenchh/Qwen2.5-3B-openr1-math

Text Generation • Updated Feb 23 • 13

published a model about 1 month ago

ubermenchh/Qwen2.5-3B-openr1-math

Text Generation • Updated Feb 23 • 13

updated a model about 1 month ago

ubermenchh/Qwen2.5-0.5B-openr1-math

Updated Feb 21

published a model about 1 month ago

ubermenchh/Qwen2.5-0.5B-openr1-math

Updated Feb 21

upvoted a collection about 2 months ago

🧠 Reasoning datasets

Collection

Datasets with reasoning traces for math and code released by the community • 20 items • Updated 4 days ago • 122

updated a model about 2 months ago

ubermenchh/llama3.1-8B-gsm8k-grpo

Updated Feb 13 • 31

liked a dataset about 2 months ago

open-r1/OpenR1-Math-Raw

Viewer • Updated Feb 24 • 516k • 1.26k • 72

published a model about 2 months ago

ubermenchh/llama3.1-8B-gsm8k-grpo

Updated Feb 13 • 31

liked a dataset about 2 months ago

prithivMLmods/OpenWeb383K

Viewer • Updated Feb 6 • 383k • 111 • 4

updated a model about 2 months ago

ubermenchh/SmolLM2-SFT-sarvam-samvaad

Text Generation • Updated Feb 7 • 7

published a model about 2 months ago

ubermenchh/SmolLM2-SFT-sarvam-samvaad

Text Generation • Updated Feb 7 • 7

upvoted an article about 2 months ago

Article

The N Implementation Details of RLHF with PPO

Oct 24, 2023

• 48