arxiv:2412.01800
hangyu guo
Rosiness
AI & ML interests
Natural Language Processing
Recent Activity
authored
a paper
about 1 month ago
PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos
upvoted
a
paper
2 months ago
Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language
Models
authored
a paper
3 months ago
ING-VP: MLLMs cannot Play Easy Vision-based Games Yet
Organizations
None yet
Papers
3
datasets
None public yet