Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation Paper • 2410.13232 • Published 25 days ago • 40
AutoTrain: No-code training for state-of-the-art models Paper • 2410.15735 • Published 21 days ago • 55
PingPong: A Benchmark for Role-Playing Language Models with User Emulation and Multi-Model Evaluation Paper • 2409.06820 • Published Sep 10 • 62