9 3 9

Chris (Yuhao) Liu

chrisliu298

https://chrisliu298.ai/

AI & ML interests

Alignment

Recent Activity

liked a model 1 day ago

Skywork/Skywork-Reward-Llama-3.1-8B-v0.2

liked a model 1 day ago

Skywork/Skywork-Reward-Gemma-2-27B-v0.2

authored a paper 17 days ago

Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization

View all activity

Organizations

chrisliu298's activity

liked 2 models 1 day ago

Skywork/Skywork-Reward-Llama-3.1-8B-v0.2

Text Classification • Updated Oct 25, 2024 • 15.7k • 32

Skywork/Skywork-Reward-Gemma-2-27B-v0.2

Text Classification • Updated Oct 25, 2024 • 3.39k • 29

authored 2 papers 17 days ago

Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization

Paper • 2412.18279 • Published Dec 24, 2024

LLM Unlearning via Loss Adjustment with Only Forget Data

Paper • 2410.11143 • Published Oct 14, 2024

liked a dataset 3 months ago

argilla/magpie-ultra-v1.0

Viewer • Updated Nov 26, 2024 • 3.22M • 5.16k • 41

New activity in argilla/magpie-ultra-v1.0 3 months ago

Question About Dataset Content

#2 opened 3 months ago by

chrisliu298

New activity in Skywork/Skywork-Reward-Gemma-2-27B 3 months ago

Reward model returns 0 scores for all cases

#1 opened 5 months ago by

iseesaw

updated 2 models 3 months ago

Skywork/Skywork-o1-Open-PRM-Qwen-2.5-7B

Text Classification • Updated 6 days ago • 7.04k • 47

Skywork/Skywork-o1-Open-PRM-Qwen-2.5-1.5B

Text Classification • Updated 6 days ago • 629 • 27

New activity in Skywork/Skywork-Reward-Gemma-2-27B-v0.2 4 months ago

unexpected results

#1 opened 4 months ago by

ShikaiChen

authored a paper 4 months ago

Skywork-Reward: Bag of Tricks for Reward Modeling in LLMs

Paper • 2410.18451 • Published Oct 24, 2024 • 16

updated a collection 4 months ago

Skywork-Reward-Model

Collection

Skywork reward model series • 6 items • Updated Nov 26, 2024 • 6

updated 2 datasets 4 months ago

Skywork/Skywork-Reward-Preference-80K-v0.1

Viewer • Updated Oct 25, 2024 • 82k • 60 • 42

Skywork/Skywork-Reward-Preference-80K-v0.2

Viewer • Updated Oct 25, 2024 • 77k • 775 • 41

upvoted a paper 4 months ago

Skywork-Reward: Bag of Tricks for Reward Modeling in LLMs

Paper • 2410.18451 • Published Oct 24, 2024 • 16

commented a paper 4 months ago

Skywork-Reward: Bag of Tricks for Reward Modeling in LLMs

Paper • 2410.18451 • Published Oct 24, 2024 • 16 •

updated a collection 4 months ago

Skywork-Reward-Model

Collection

Skywork reward model series • 6 items • Updated Nov 26, 2024 • 6

updated 3 models 4 months ago