-
MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models
Paper • 2502.00698 • Published • 24 -
DeepRAG: Thinking to Retrieval Step by Step for Large Language Models
Paper • 2502.01142 • Published • 24 -
ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning
Paper • 2502.01100 • Published • 17 -
The Jumping Reasoning Curve? Tracking the Evolution of Reasoning Performance in GPT-[n] and o-[n] Models on Multimodal Puzzles
Paper • 2502.01081 • Published • 14
Zhitong Gao
ZhitongGao
·
AI & ML interests
None yet
Recent Activity
updated
a collection
about 1 month ago
Vlm
updated
a collection
about 1 month ago
Vlm
updated
a collection
about 1 month ago
Vlm
Organizations
Collections
1
datasets
None public yet