-
Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers
Paper • 2409.04109 • Published • 48 -
Training Language Models to Self-Correct via Reinforcement Learning
Paper • 2409.12917 • Published • 140 -
Reward-Robust RLHF in LLMs
Paper • 2409.15360 • Published • 6 -
EuroLLM: Multilingual Language Models for Europe
Paper • 2409.16235 • Published • 27
Haote Yang
Hoter
·
AI & ML interests
None yet
Organizations
Collections
1
Papers
1
models
None public yet