AgentReview: Exploring Peer Review Dynamics with LLM Agents Paper • 2406.12708 • Published Jun 18 • 2
Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis Paper • 2410.07155 • Published Oct 9 • 11
MM-Soc: Benchmarking Multimodal Large Language Models in Social Media Platforms Paper • 2402.14154 • Published Feb 21 • 2
MMSoc Benchmark Collection Benchmark datasets for the paper "MM-Soc: Benchmarking Multimodal Large Language Models in Social Media Platforms" • 7 items • Updated Aug 22 • 1
Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries Paper • 2310.13132 • Published Oct 19, 2023 • 8
HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models Paper • 2310.14566 • Published Oct 23, 2023 • 25