REBEL Cornell-AGI/REBEL-OpenChat-3.5 Text Generation • Updated May 29 • 35 • 1 Cornell-AGI/REBEL-Llama-3 Text Generation • Updated May 29 • 31 • 1 Cornell-AGI/REBEL-Llama-3-epoch_2 Text Generation • Updated May 29 • 11 • 3 REBEL: Reinforcement Learning via Regressing Relative Rewards Paper • 2404.16767 • Published Apr 25 • 1
REBEL: Reinforcement Learning via Regressing Relative Rewards Paper • 2404.16767 • Published Apr 25 • 1