sumukshashidhar-testing/yourbench_y1_single_shot_questions_v2x_answers_reformatted_2 Viewer • Updated 19 days ago • 2.61k • 26
sumukshashidhar-testing/yourbench_y1_single_shot_questions_v2x_answers_judged Viewer • Updated 20 days ago • 2.61k • 47
sumukshashidhar-testing/yourbench_y1_single_shot_questions_v2x_answers_reformatted Viewer • Updated 20 days ago • 2.61k • 37
sumukshashidhar-testing/yourbench_y1_single_shot_questions_v2x_answers Viewer • Updated 21 days ago • 5.22k • 30
sumukshashidhar-testing/yourbench_y1_single_shot_questions_v2 Viewer • Updated 22 days ago • 2.61k • 48
sumukshashidhar-testing/yourbench_y1_singleshot_answers_reformatted Viewer • Updated 22 days ago • 3.49k • 29
sumukshashidhar-testing/yourbench_y1_multihop_questions Viewer • Updated about 1 month ago • 473 • 54
sumukshashidhar-testing/yourbench_y1_single_shot_questions Viewer • Updated about 1 month ago • 2.93k • 50
Democratizing LLMs: An Exploration of Cost-Performance Trade-offs in Self-Refined Open-Source Models Paper • 2310.07611 • Published Oct 11, 2023 • 2