arxiv:2501.01264
Jiaheng Liu
CheeryLJH
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 22 hours ago
Demons in the Detail: On Implementing Load Balancing Loss for Training
Specialized Mixture-of-Expert Models
upvoted
a
paper
8 days ago
MiniMax-01: Scaling Foundation Models with Lightning Attention
upvoted
a
paper
15 days ago
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language
Models
Organizations
Papers
40
models
None public yet
datasets
None public yet