MultiUI Meta Data

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

yuanshengni authored a paper 2 months ago

II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models

yuanshengni authored a paper 2 months ago

MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks

yuanshengni authored a paper 4 months ago

MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark

View all activity

MultiUI-Meta's activity

yuanshengni

authored 2 papers 2 months ago

II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models

Paper • 2406.05862 • Published Jun 9 • 4

MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks

Paper • 2410.10563 • Published Oct 14 • 38

yuanshengni

authored a paper 4 months ago

MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark

Paper • 2409.02813 • Published Sep 4 • 28

yuanshengni

authored a paper 6 months ago

MantisScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation

Paper • 2406.15252 • Published Jun 21 • 14

yuanshengni

authored 2 papers 7 months ago

GenAI Arena: An Open Evaluation Platform for Generative Models

Paper • 2406.04485 • Published Jun 6 • 20

MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark

Paper • 2406.01574 • Published Jun 3 • 43

Solaris99

authored a paper 10 months ago

Trial and Error: Exploration-Based Trajectory Optimization for LLM Agents

Paper • 2403.02502 • Published Mar 4 • 3

yuanshengni

authored a paper 12 months ago

A Comprehensive Study of Knowledge Editing for Large Language Models

Paper • 2401.01286 • Published Jan 2 • 16

yuanshengni

authored a paper about 1 year ago

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

Paper • 2311.16502 • Published Nov 27, 2023 • 35

AI & ML interests

Recent Activity

Team members 2

MultiUI-Meta's activity