Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ankner
's Collections
Base Models With Chat Templates
Hydra Decoding
Oracle 2 Proxy Models
Oracle 2 Proxy Data
Multi Judgement Oversight
Critique-out-Loud Reward Models
Oracle 2 Proxy Data
updated
11 days ago
Upvote
-
ankner/gsm8k-CoT
Viewer
•
Updated
15 days ago
•
8.78k
•
36
ankner/gsm8k-sft
Viewer
•
Updated
13 days ago
•
1.1k
•
68
ankner/gsm8k-rl
Viewer
•
Updated
13 days ago
•
7.68k
•
1.74k
ankner/apps-sft
Viewer
•
Updated
21 days ago
•
3.51k
•
112
ankner/apps-rl
Viewer
•
Updated
11 days ago
•
5.25k
•
162
ankner/apps-rl-deepseek-7b-inst-labeled
Viewer
•
Updated
20 days ago
•
5.25k
•
165
ankner/chat-pref
Viewer
•
Updated
16 days ago
•
39.7k
•
43
Upvote
-
Share collection
View history
Collection guide
Browse collections