Chain-of-Verification Reduces Hallucination in Large Language Models Paper • 2309.11495 • Published Sep 20, 2023 • 38
Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model Paper • 2310.09520 • Published Oct 14, 2023 • 10