Running 1.78k 1.78k The Ultra-Scale Playbook ๐ The ultimate guide to training LLM on large GPU Clusters
The Big Benchmarks Collection Collection Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) โข 13 items โข Updated Nov 18, 2024 โข 204
Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper โข 2305.18290 โข Published May 29, 2023 โข 53