umangkaushik

ubermenchh

AI & ML interests

None yet

Recent Activity

updated a model 3 days ago
ubermenchh/llama3.1-8B-gsm8k-grpo
liked a dataset 3 days ago
open-r1/OpenR1-Math-Raw
published a model 3 days ago
ubermenchh/llama3.1-8B-gsm8k-grpo
View all activity

Organizations

Social Post Explorers's profile picture

ubermenchh's activity

upvoted an article 9 days ago
view article
Article

The N Implementation Details of RLHF with PPO

37
New activity in ubermenchh/SmolLM2-DPO 15 days ago

details pls

1
#1 opened 15 days ago by
archit11