view article Article Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders By orionweller and 5 others β’ 29 days ago β’ 58
view article Article π§ββοΈ "Replacing Judges with Juries" using distilabel By alvarobartt β’ May 3, 2024 β’ 17
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models Paper β’ 2404.18796 β’ Published Apr 29, 2024 β’ 72
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Paper β’ 2405.01535 β’ Published May 2, 2024 β’ 124
Open LLM Leaderboard best models β€οΈβπ₯ Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: β’ 65 items β’ Updated Mar 20 β’ 630