ldwang
ldwang
AI & ML interests
LLM, MLLM, Infra
Recent Activity
upvoted
an
article
3 days ago
Open-R1: a fully open reproduction of DeepSeek-R1
liked
a dataset
3 days ago
dgslibisey/MuSiQue
commented on
a paper
3 days ago
Trajectory Balance with Asynchrony: Decoupling Exploration and Learning
for Fast, Scalable LLM Post-Training