long-context-mllm - a xing0047 Collection

xing0047 's Collections

SAM

long-context-mllm

long-context-mllm

updated Oct 27, 2024

Visual Context Window Extension: A New Perspective for Long Video Understanding

Paper • 2409.20018 • Published Sep 30, 2024 • 9
LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture

Paper • 2409.02889 • Published Sep 4, 2024 • 55
Long Context Transfer from Language to Vision

Paper • 2406.16852 • Published Jun 24, 2024 • 32
lmms-lab/LongVA-7B-DPO

Text Generation • Updated Jun 26, 2024 • 926 • 7
lmms-lab/LongVA-7B

Text Generation • Updated Jun 26, 2024 • 944 • 15
FreedomIntelligence/LongLLaVA-9B

Image-Text-to-Text • Updated Oct 12, 2024 • 628 • 4
VideoLLaMB: Long-context Video Understanding with Recurrent Memory Bridges

Paper • 2409.01071 • Published Sep 2, 2024 • 27
Why Does the Effective Context Length of LLMs Fall Short?

Paper • 2410.18745 • Published Oct 24, 2024 • 17
LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding

Paper • 2410.17434 • Published Oct 22, 2024 • 25