TÜLU 3: Pushing Frontiers in Open Language Model Post-Training Paper • 2411.15124 • Published Nov 22, 2024 • 63
Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback Paper • 2410.19133 • Published Oct 24, 2024 • 11
Quoref: A Reading Comprehension Dataset with Questions Requiring Coreferential Reasoning Paper • 1908.05803 • Published Aug 16, 2019
A Dataset of Information-Seeking Questions and Answers Anchored in Research Papers Paper • 2105.03011 • Published May 7, 2021 • 1
DROP: A Reading Comprehension Benchmark Requiring Discrete Reasoning Over Paragraphs Paper • 1903.00161 • Published Mar 1, 2019
TRAM: Bridging Trust Regions and Sharpness Aware Minimization Paper • 2310.03646 • Published Oct 5, 2023