Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2
4
18
umangkaushik
ubermenchh
Follow
Mohammadmostafa's profile picture
TokenBender's profile picture
timestop1's profile picture
9 followers
·
61 following
ubermenchh_
ubermenchh
AI & ML interests
None yet
Recent Activity
updated
a model
3 days ago
ubermenchh/llama3.1-8B-gsm8k-grpo
liked
a dataset
3 days ago
open-r1/OpenR1-Math-Raw
published
a model
3 days ago
ubermenchh/llama3.1-8B-gsm8k-grpo
View all activity
Organizations
ubermenchh
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a model
3 days ago
ubermenchh/llama3.1-8B-gsm8k-grpo
Updated
3 days ago
•
20
liked
a dataset
3 days ago
open-r1/OpenR1-Math-Raw
Viewer
•
Updated
4 days ago
•
516k
•
61
published
a model
3 days ago
ubermenchh/llama3.1-8B-gsm8k-grpo
Updated
3 days ago
•
20
liked
a dataset
8 days ago
prithivMLmods/OpenWeb383K
Viewer
•
Updated
10 days ago
•
383k
•
111
•
9
updated
a model
9 days ago
ubermenchh/SmolLM2-SFT-sarvam-samvaad
Text Generation
•
Updated
9 days ago
•
12
published
a model
9 days ago
ubermenchh/SmolLM2-SFT-sarvam-samvaad
Text Generation
•
Updated
9 days ago
•
12
upvoted
an
article
9 days ago
view article
Article
The N Implementation Details of RLHF with PPO
Oct 24, 2023
•
37
updated
a model
10 days ago
ubermenchh/SmolLM2-360M-r1-grpo-countdown
Updated
10 days ago
published
a model
10 days ago
ubermenchh/SmolLM2-360M-r1-grpo-countdown
Updated
10 days ago
liked
a dataset
13 days ago
open-r1/OpenThoughts-114k-math
Viewer
•
Updated
18 days ago
•
89.1k
•
2.01k
•
61
updated
a dataset
13 days ago
ubermenchh/information-theory-instruction-tuning
Viewer
•
Updated
13 days ago
•
1
•
29
published
a dataset
13 days ago
ubermenchh/information-theory-instruction-tuning
Viewer
•
Updated
13 days ago
•
1
•
29
updated
a model
14 days ago
ubermenchh/SmolLM2-DPO-ultrafeedback-binarized-preferences
Text Generation
•
Updated
14 days ago
•
7
published
a model
14 days ago
ubermenchh/SmolLM2-DPO-ultrafeedback-binarized-preferences
Text Generation
•
Updated
14 days ago
•
7
New activity in
ubermenchh/SmolLM2-DPO
15 days ago
details pls
1
#1 opened 15 days ago by
archit11
updated
a model
15 days ago
ubermenchh/SmolLM2-DPO
Text Generation
•
Updated
15 days ago
•
5
published
a model
15 days ago
ubermenchh/SmolLM2-DPO
Text Generation
•
Updated
15 days ago
•
5
updated
a model
16 days ago
ubermenchh/SmolLM2-FT-the-smol-stack
Text Generation
•
Updated
16 days ago
•
16
published
a model
16 days ago
ubermenchh/SmolLM2-FT-the-smol-stack
Text Generation
•
Updated
16 days ago
•
16
updated
a model
16 days ago
ubermenchh/SmolLM2-FT-smoltalk
Text Generation
•
Updated
16 days ago
•
13
Load more