arxiv:2410.05255
Eric Lan
Eric-Lan
AI & ML interests
LLM Alignment, RLHF, Reinforcement Learning, Diffusion Model, Deep Learning, Federated Learning
Recent Activity
liked
a model
about 1 month ago
Eric-Lan/stack-llama-2
New activity
about 2 months ago
DwanZhang/SePPO:add the paper link
upvoted
a
paper
about 2 months ago
SePPO: Semi-Policy Preference Optimization for Diffusion Alignment
Organizations
Papers
1
datasets
None public yet