REBEL Cornell-AGI/REBEL-OpenChat-3.5 Text Generation • Updated May 29 • 110 • 1 Cornell-AGI/REBEL-Llama-3 Text Generation • Updated May 29 • 67 • 1 Cornell-AGI/REBEL-Llama-3-epoch_2 Text Generation • Updated May 29 • 22 • 3 REBEL: Reinforcement Learning via Regressing Relative Rewards Paper • 2404.16767 • Published Apr 25 • 1
REBEL: Reinforcement Learning via Regressing Relative Rewards Paper • 2404.16767 • Published Apr 25 • 1