SPIN-Bench: How Well Do LLMs Plan Strategically and Reason Socially? Paper • 2503.12349 • Published 3 days ago • 28
SPIN-Bench: How Well Do LLMs Plan Strategically and Reason Socially? Paper • 2503.12349 • Published 3 days ago • 28 • 3
EVA2.0: Investigating Open-Domain Chinese Dialogue Systems with Large-Scale Pre-Training Paper • 2203.09313 • Published Mar 17, 2022
A Benchmark for Understanding and Generating Dialogue between Characters in Stories Paper • 2209.08524 • Published Sep 18, 2022
SPIN-Bench: How Well Do LLMs Plan Strategically and Reason Socially? Paper • 2503.12349 • Published 3 days ago • 28