Yuanxin Liu

lyx97

https://llyx97.github.io/

llyx97

AI & ML interests

None yet

Recent Activity

liked a model about 3 hours ago

lmms-lab/LLaVA-Video-7B-Qwen2

updated a dataset 8 days ago

lyx97/t3_probing_data

upvoted a paper 26 days ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

View all activity

Organizations

None yet

lyx97's activity

liked a model about 3 hours ago

lmms-lab/LLaVA-Video-7B-Qwen2

Video-Text-to-Text • Updated Oct 25, 2024 • 65.3k • 59

updated a dataset 8 days ago

lyx97/t3_probing_data

Viewer • Updated 8 days ago • 25.9k • 1

upvoted a paper 26 days ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 124

updated a Space about 1 month ago

Running

🥇

TempCompass

liked 2 Spaces about 2 months ago

Running on CPU Upgrade

560

🌎

Open VLM Leaderboard

VLMEvalKit Evaluation Results Collection

Running

🌎

Open VLM Video Leaderboard

VLMEvalKit Eval Results in video understanding benchmark

liked a dataset 3 months ago

tobiaslee/text_temporal

Viewer • Updated Sep 27, 2024 • 12.5k • 82 • 2

upvoted 2 papers 3 months ago

Video Instruction Tuning With Synthetic Data

Paper • 2410.02713 • Published Oct 3, 2024 • 38

Pixtral 12B

Paper • 2410.07073 • Published Oct 9, 2024 • 63

authored 4 papers 3 months ago

VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models

Paper • 2311.17404 • Published Nov 29, 2023

TempCompass: Do Video LLMs Really Understand Videos?

Paper • 2403.00476 • Published Mar 1, 2024

COST-EFF: Collaborative Optimization of Spatial and Temporal Efficiency with Slenderized Multi-exit Language Models

Paper • 2210.15523 • Published Oct 27, 2022 • 1

Temporal Reasoning Transfer from Text to Video

Paper • 2410.06166 • Published Oct 8, 2024 • 12

upvoted 2 papers 3 months ago

Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation

Paper • 2410.05363 • Published Oct 7, 2024 • 45

Temporal Reasoning Transfer from Text to Video

Paper • 2410.06166 • Published Oct 8, 2024 • 12

liked a model 4 months ago

Qwen/Qwen2-VL-7B-Instruct

Image-Text-to-Text • Updated Dec 6, 2024 • 1.51M • • 1.03k

liked 2 datasets 5 months ago

lmms-lab/Video-MME

Viewer • Updated Jul 4, 2024 • 2.7k • 10.5k • 31

lmms-lab/TempCompass

Viewer • Updated Jun 10, 2024 • 7.54k • 290 • 5

upvoted a paper 6 months ago

MMEvalPro: Calibrating Multimodal Benchmarks Towards Trustworthy and Efficient Evaluation

Paper • 2407.00468 • Published Jun 29, 2024 • 34

liked a dataset 7 months ago

MLVU/MVLU

Preview • Updated Sep 18, 2024 • 4.67k • 18