RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios Paper • 2412.08972 • Published 10 days ago • 9
VisionArena: 230K Real World User-VLM Conversations with Preference Labels Paper • 2412.08687 • Published 11 days ago • 11
AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials Paper • 2412.09605 • Published 10 days ago • 25
Apollo: An Exploration of Video Understanding in Large Multimodal Models Paper • 2412.10360 • Published 9 days ago • 130
Finance Commons Collection A large collection of multimodal financial documents in open data. • 7 items • Updated Jul 17 • 7