58 25 113

Bo Li

luodian

https://brianboli.com/

luodian

AI & ML interests

None yet

Recent Activity

new activity about 14 hours ago

virtuoussy/Multi-subject-RLVR:About subject information.

liked a dataset 9 days ago

lmms-lab/k12

updated a dataset 9 days ago

lmms-lab/k12

View all activity

Organizations

luodian's activity

upvoted a paper 28 days ago

EgoLife: Towards Egocentric Life Assistant

Paper • 2503.03803 • Published 29 days ago • 38

upvoted a collection about 1 month ago

EgoLife

Collection

CVPR 2025 - EgoLife: Towards Egocentric Life Assistant. Homepage: https://egolife-ai.github.io/ • 10 items • Updated 28 days ago • 16

upvoted 2 papers about 2 months ago

ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal Models

Paper • 2502.09696 • Published Feb 13 • 42

Fast Video Generation with Sliding Tile Attention

Paper • 2502.04507 • Published Feb 6 • 50

upvoted a paper 2 months ago

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Paper • 2501.13826 • Published Jan 23 • 25

upvoted 3 papers 4 months ago

upvoted 7 papers 6 months ago

MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures

Paper • 2410.13754 • Published Oct 17, 2024 • 75

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second

Paper • 2410.02073 • Published Oct 2, 2024 • 41

Contrastive Localized Language-Image Pre-Training

Paper • 2410.02746 • Published Oct 3, 2024 • 35

Loong: Generating Minute-level Long Videos with Autoregressive Language Models

Paper • 2410.02757 • Published Oct 3, 2024 • 36

Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models

Paper • 2410.02740 • Published Oct 3, 2024 • 53

Video Instruction Tuning With Synthetic Data

Paper • 2410.02713 • Published Oct 3, 2024 • 38

LLaVA-Critic: Learning to Evaluate Multimodal Models

Paper • 2410.02712 • Published Oct 3, 2024 • 35

upvoted a collection 6 months ago

LLaVA-OneVision

Collection

a model good at arbitrary types of visual input • 15 items • Updated Oct 5, 2024 • 24

upvoted a paper 8 months ago

LLaVA-OneVision: Easy Visual Task Transfer

Paper • 2408.03326 • Published Aug 6, 2024 • 60

upvoted a paper 9 months ago

LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models

Paper • 2407.12772 • Published Jul 17, 2024 • 35

upvoted a collection 9 months ago

LLaVA-Next-Interleave

Collection

7 items • Updated Oct 4, 2024 • 16

upvoted a paper 9 months ago

Long Context Transfer from Language to Vision

Paper • 2406.16852 • Published Jun 24, 2024 • 33