Xiusi Chen's picture

4 9

Xiusi Chen

XtremSup

·

https://xiusic.github.io/

AI & ML interests

None yet

Recent Activity

authored a paper about 5 hours ago

ToolRL: Reward is All Tool Learning Needs

upvoted a paper about 12 hours ago

OTC: Optimal Tool Calls via Reinforcement Learning

upvoted a paper about 12 hours ago

ToolRL: Reward is All Tool Learning Needs

View all activity

Organizations

XtremSup's activity

authored a paper about 5 hours ago

ToolRL: Reward is All Tool Learning Needs

Paper • 2504.13958 • Published 6 days ago • 25

upvoted 2 papers about 12 hours ago

OTC: Optimal Tool Calls via Reinforcement Learning

Paper • 2504.14870 • Published 1 day ago • 20

ToolRL: Reward is All Tool Learning Needs

Paper • 2504.13958 • Published 6 days ago • 25

liked 4 datasets about 1 month ago

BytedTsinghua-SIA/DAPO-Math-17k

Viewer • Updated 4 days ago • 1.79M • 4.62k • 64

argilla/distilabel-math-preference-dpo

Viewer • Updated Jul 16, 2024 • 2.42k • 391 • 86

Vezora/Code-Preference-Pairs

Viewer • Updated Jul 28, 2024 • 54k • 212 • 22

infly/INF-ORM-Preference-Magnitude-80K

Viewer • Updated Dec 5, 2024 • 76k • 139 • 7

liked a model about 1 month ago

Skywork/Skywork-Critic-Llama-3.1-8B

Text Generation • Updated Sep 29, 2024 • 187 • 11

liked a dataset about 1 month ago

Skywork/Skywork-Reward-Preference-80K-v0.2

Viewer • Updated Oct 25, 2024 • 77k • 750 • 47

upvoted a paper 6 months ago

SemiEvol: Semi-supervised Fine-tuning for LLM Adaptation

Paper • 2410.14745 • Published Oct 17, 2024 • 48

liked a dataset 6 months ago

McAuley-Lab/Amazon-C4

Viewer • Updated Apr 9, 2024 • 21.2k • 370 • 5

authored a paper 6 months ago

SemiEvol: Semi-supervised Fine-tuning for LLM Adaptation

Paper • 2410.14745 • Published Oct 17, 2024 • 48

liked a dataset about 1 year ago

McAuley-Lab/Amazon-Reviews-2023

Updated Dec 8, 2024 • 33.8k • 142

liked a Space over 2 years ago

Stable Diffusion 2-1

Generate images from text descriptions