RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios Paper • 2412.08972 • Published 10 days ago • 9
VisionArena: 230K Real World User-VLM Conversations with Preference Labels Paper • 2412.08687 • Published 11 days ago • 11
AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials Paper • 2412.09605 • Published 10 days ago • 25
Apollo: An Exploration of Video Understanding in Large Multimodal Models Paper • 2412.10360 • Published 9 days ago • 130
Finance Commons Collection A large collection of multimodal financial documents in open data. • 7 items • Updated Jul 17 • 7
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais • Nov 13 • 98
YiZhao Dataset Collection Data and filtering models of our financial open-source YiZhao Dataset. • 5 items • Updated 10 days ago • 1
Interleaved Scene Graph for Interleaved Text-and-Image Generation Assessment Paper • 2411.17188 • Published 26 days ago • 21
ShowUI: One Vision-Language-Action Model for GUI Visual Agent Paper • 2411.17465 • Published 26 days ago • 76
Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning Paper • 2411.18203 • Published 25 days ago • 30
OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation Paper • 2412.02592 • Published 19 days ago • 20
VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video Generation Paper • 2412.02259 • Published 19 days ago • 59
Marco-LLM: Bridging Languages via Massive Multilingual Training for Cross-Lingual Enhancement Paper • 2412.04003 • Published 17 days ago • 9
Evaluating Language Models as Synthetic Data Generators Paper • 2412.03679 • Published 18 days ago • 43
MALT: Improving Reasoning with Multi-Agent LLM Training Paper • 2412.01928 • Published 20 days ago • 38
DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation Paper • 2411.16657 • Published 27 days ago • 17
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents Paper • 2410.23218 • Published Oct 30 • 46