Yuheng Zhang's picture

1 2

Yuheng Zhang

MatouK98

·

AI & ML interests

None yet

Organizations

MatouK98's activity

upvoted 2 papers 9 months ago

Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning

Paper • 2407.00617 • Published Jun 30, 2024 • 7

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28, 2024 • 101