LongVa

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

PY007 authored a paper 3 days ago

EgoLife: Towards Egocentric Life Assistant

Jingkang authored a paper 3 days ago

EgoLife: Towards Egocentric Life Assistant

PY007 authored a paper 28 days ago

Fast Video Generation with Sliding Tile Attention

View all activity

LongVa's activity

PY007

authored a paper 3 days ago

EgoLife: Towards Egocentric Life Assistant

Paper • 2503.03803 • Published 5 days ago • 31

Jingkang

authored a paper 3 days ago

EgoLife: Towards Egocentric Life Assistant

Paper • 2503.03803 • Published 5 days ago • 31

PY007

authored a paper 28 days ago

Fast Video Generation with Sliding Tile Attention

Paper • 2502.04507 • Published Feb 6 • 49

pufanyi

authored a paper about 1 month ago

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Paper • 2501.13826 • Published Jan 23 • 24

luodian

authored a paper 3 months ago

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published Dec 6, 2024 • 47

kcz358

authored a paper 3 months ago

Large Multi-modal Models Can Interpret Features in Large Multi-modal Models

Paper • 2411.14982 • Published Nov 22, 2024 • 16

luodian

authored a paper 3 months ago

Large Multi-modal Models Can Interpret Features in Large Multi-modal Models

Paper • 2411.14982 • Published Nov 22, 2024 • 16

Jingkang

authored a paper 4 months ago

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Paper • 2411.14432 • Published Nov 21, 2024 • 23

luodian

authored 4 papers 5 months ago

kcz358

authored a paper 5 months ago

MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures

Paper • 2410.13754 • Published Oct 17, 2024 • 75

PY007

authored a paper 5 months ago

Temporal Reasoning Transfer from Text to Video

Paper • 2410.06166 • Published Oct 8, 2024 • 13

luodian

authored 6 papers 7 months ago

Octopus: Embodied Vision-Language Programmer from Environmental Feedback

Paper • 2310.08588 • Published Oct 12, 2023 • 36

MIMIC-IT: Multi-Modal In-Context Instruction Tuning

Paper • 2306.05425 • Published Jun 8, 2023 • 11

FunQA: Towards Surprising Video Comprehension

Paper • 2306.14899 • Published Jun 26, 2023 • 1

MMBench: Is Your Multi-modal Model an All-around Player?

Paper • 2307.06281 • Published Jul 12, 2023 • 5

Long Context Transfer from Language to Vision

Paper • 2406.16852 • Published Jun 24, 2024 • 33

LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models

Paper • 2407.12772 • Published Jul 17, 2024 • 34

AI & ML interests

Recent Activity

Team members 6

LongVa's activity