4 4 3

Xiaoqian Shen

shenxq

AI & ML interests

None yet

Recent Activity

new activity about 2 months ago

shenxq/VideoChat2:Update README.md

liked a dataset 3 months ago

lmms-lab/LLaVA-Video-178K

liked a dataset 3 months ago

LongVideos/LongVideoDB-373K-IterCap

View all activity

Organizations

None yet

shenxq's activity

New activity in shenxq/VideoChat2 about 2 months ago

Update README.md

#3 opened about 2 months ago by

Vision-CAIR

liked 2 datasets 3 months ago

lmms-lab/LLaVA-Video-178K

Viewer • Updated Oct 11, 2024 • 1.63M • 15.4k • 137

LongVideos/LongVideoDB-373K-IterCap

Viewer • Updated Dec 22, 2024 • 250k • 98 • 2

New activity in Vision-CAIR/LongVU_Qwen2_7B 6 months ago

Update README.md

#5 opened 6 months ago by

shenxq

updated a model 6 months ago

Vision-CAIR/LongVU_Qwen2_7B

Video-Text-to-Text • Updated Feb 28 • 250 • 69

authored 7 papers 6 months ago

ChatGPT Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions

Paper • 2303.06594 • Published Mar 12, 2023

MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models

Paper • 2304.10592 • Published Apr 20, 2023

StoryGPT-V: Large Language Models as Consistent Story Visualizers

Paper • 2312.02252 • Published Dec 4, 2023 • 1

Affective Visual Dialog: A Large-Scale Benchmark for Emotional Reasoning Based on Visually Grounded Conversations

Paper • 2308.16349 • Published Aug 30, 2023

MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual Tokens

Paper • 2404.03413 • Published Apr 4, 2024 • 29

Goldfish: Vision-Language Understanding of Arbitrarily Long Videos

Paper • 2407.12679 • Published Jul 17, 2024 • 8

LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding

Paper • 2410.17434 • Published Oct 22, 2024 • 29

upvoted a paper 6 months ago

LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding

Paper • 2410.17434 • Published Oct 22, 2024 • 29

commented a paper 6 months ago

LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding

Paper • 2410.17434 • Published Oct 22, 2024 • 29 •

updated 2 datasets 6 months ago

shenxq/OneVision

Updated Oct 23, 2024 • 36

shenxq/VideoChat2

Viewer • Updated Feb 28 • 661k • 303 • 4

authored a paper 9 months ago

Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling

Paper • 2408.03695 • Published Aug 7, 2024 • 13

updated 3 models about 1 year ago