Panacea: Pareto Alignment via Preference Adaptation for LLMs Paper • 2402.02030 • Published Feb 3, 2024 • 10
CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents Paper • 2401.10568 • Published Jan 19, 2024 • 15