Ziniu Li
znli
ยท
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 months ago
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method
for Aligning Large Language Models
commented on
a paper
2 months ago
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model
Post-training
liked
a dataset
3 months ago
allenai/olmo-mix-1124
Organizations
None yet
models
None public yet
datasets
None public yet