Lazy Beaver
Jayce-Ping
AI & ML interests
None yet
Recent Activity
upvoted a paper about 3 hours ago
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models upvoted a paper about 12 hours ago
Rethinking the Divergence Regularization in LLM RL updated a collection 2 days ago
Flow-DPPO: GenEval2 Reward LoRA Adapters