🧠Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 21 items • Updated 7 days ago • 129
view article Article Preference Tuning LLMs with Direct Preference Optimization Methods Jan 18, 2024 • 56