Generative Verifiers: Reward Modeling as Next-Token Prediction Paper • 2408.15240 • Published Aug 27, 2024 • 13
neuralmagic/Meta-Llama-3.1-405B-Instruct-FP8-dynamic Text Generation • Updated Oct 19, 2024 • 231 • 14