Llamas Know What GPTs Don't Show: Surrogate Models for Confidence Estimation Paper • 2311.08877 • Published Nov 15, 2023 • 6
Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5? Paper • 2311.07587 • Published Nov 8, 2023 • 3
PathFinder: Guided Search over Multi-Step Reasoning Paths Paper • 2312.05180 • Published Dec 8, 2023 • 9
Chain of Code: Reasoning with a Language Model-Augmented Code Emulator Paper • 2312.04474 • Published Dec 7, 2023 • 29