Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning Paper • 2310.20587 • Published Oct 31, 2023 • 16
SELF: Language-Driven Self-Evolution for Large Language Model Paper • 2310.00533 • Published Oct 1, 2023 • 2
Bigger, Better, Faster: Human-level Atari with human-level efficiency Paper • 2305.19452 • Published May 30, 2023 • 4
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search Paper • 2408.08152 • Published Aug 15 • 52