Skywork-Reward-Data-Collection
Collection
Open-source preference datasets used to train the Skywork reward model series
•
17 items
•
Updated
•
16
models:
- model: ConvexAI/Luminex-34B-v0.2
- model: fblgit/UNA-34BeagleSimpleMath-32K-v1
merge_method: model_stock
base_model: abacusai/Smaug-34B-v0.1
dtype: bfloat16