6 31 16

Joya Chen

chenjoya

https://chenjoya.github.io/

chenjoya

AI & ML interests

Streaming Video LLM

Recent Activity

upvoted a paper 11 days ago

Long-Context Autoregressive Video Modeling with Next-Frame Prediction

upvoted a paper 18 days ago

Impossible Videos

upvoted a paper 19 days ago

VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning

View all activity

Organizations

chenjoya's activity

upvoted a paper 11 days ago

Long-Context Autoregressive Video Modeling with Next-Frame Prediction

Paper • 2503.19325 • Published 12 days ago • 71

upvoted a paper 18 days ago

Impossible Videos

Paper • 2503.14378 • Published 19 days ago • 58

upvoted a paper 19 days ago

VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning

Paper • 2503.13444 • Published 20 days ago • 15

New activity in chenjoya/videollm-online-8b-v1plus 19 days ago

About the inference efficiency

#3 opened 19 days ago by

Abracdabra-H

upvoted a paper 24 days ago

TPDiff: Temporal Pyramid Video Diffusion Model

Paper • 2503.09566 • Published 25 days ago • 44

upvoted a paper 26 days ago

Automated Movie Generation via Multi-Agent CoT Planning

Paper • 2503.07314 • Published 27 days ago • 43

upvoted 3 papers about 1 month ago

DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles

Paper • 2503.03651 • Published Mar 5 • 16

Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models

Paper • 2503.01774 • Published Mar 3 • 42

PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data

Paper • 2502.14397 • Published Feb 20 • 40

liked a dataset about 2 months ago

MCG-NJU/OVBench

Viewer • Updated 18 days ago • 1.46k • 436 • 4

upvoted 3 papers about 2 months ago

WorldGUI: Dynamic Testing for Comprehensive Desktop GUI Automation

Paper • 2502.08047 • Published Feb 12 • 27

TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation

Paper • 2502.07870 • Published Feb 11 • 43

MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation

Paper • 2502.01572 • Published Feb 3 • 20

liked a model 2 months ago

Qwen/Qwen2-VL-7B-Instruct

Image-Text-to-Text • Updated Feb 6 • 1.2M • • 1.17k

upvoted 6 papers 3 months ago