Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
gyeongwk
's Collections
Compositionality
Compositionality
updated
13 days ago
Upvote
-
gyeongwk/stage1-rft
8B
•
Updated
Feb 24
•
175
gyeongwk/Teacher-SFT-filter-max-correct_0.8-k_3
8B
•
Updated
Feb 24
•
2
gyeongwk/Teacher-SFT
8B
•
Updated
Feb 24
•
4
gyeongwk/stage2-rl-level2
8B
•
Updated
Feb 24
•
1
gyeongwk/Teacher-SFT-filter-max-correct_1.1-k_1
8B
•
Updated
Feb 24
•
1
gyeongwk/stage2-rl-level2-step-750
8B
•
Updated
Feb 24
•
2
gyeongwk/stage2-rft-50-rl-50
8B
•
Updated
Feb 25
•
1
gyeongwk/stage2-rft-compute-50
8B
•
Updated
Feb 24
•
2
gyeongwk/Bootstrap-SFT
8B
•
Updated
Mar 10
•
1
gyeongwk/On-policy-GRPO
8B
•
Updated
May 4
•
8
gyeongwk/Bootstrap-GRPO
8B
•
Updated
10 days ago
•
44
gyeongwk/onpolicy-grpo-600
8B
•
Updated
15 days ago
•
17
gyeongwk/onpolicy-grpo-300
8B
•
Updated
15 days ago
•
13
gyeongwk/stage2-dpo
8B
•
Updated
Apr 8
•
2
gyeongwk/On-policy-SFT
8B
•
Updated
Apr 16
•
57
gyeongwk/bootstrap-dpo
2B
•
Updated
Apr 17
•
2
Upvote
-
Share collection
View history
Collection guide
Browse collections