Running 2.45k 2.45k The Ultra-Scale Playbook ๐ The ultimate guide to training LLM on large GPU Clusters
Atla Selene Mini: A General Purpose Evaluation Model Paper โข 2501.17195 โข Published Jan 27 โข 36
view article Article Judge Arena: Benchmarking LLMs as Evaluators By kaikaidai and 7 others โข Nov 19, 2024 โข 56