How to Synthesize Text Data without Model Collapse? Paper โข 2412.14689 โข Published 6 days ago โข 45
RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment Paper โข 2412.13746 โข Published 7 days ago โข 8