2 15 1

Jiwen Yu

VictorYuki

AI & ML interests

None yet

Recent Activity

updated a dataset 4 days ago

VictorYuki/test

upvoted a paper 7 days ago

Personalized Text-to-Image Generation with Auto-Regressive Models

upvoted a paper 11 days ago

GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation

View all activity

Organizations

VictorYuki's activity

upvoted a paper 7 days ago

Personalized Text-to-Image Generation with Auto-Regressive Models

Paper • 2504.13162 • Published 7 days ago • 17

upvoted a paper 11 days ago

GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation

Paper • 2504.08736 • Published 13 days ago • 47

upvoted a paper 14 days ago

HoloPart: Generative 3D Part Amodal Segmentation

Paper • 2504.07943 • Published 14 days ago • 28

upvoted 2 papers 23 days ago

Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation

Paper • 2503.24379 • Published 24 days ago • 75

Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1

Paper • 2503.24376 • Published 24 days ago • 38

upvoted 4 papers about 1 month ago

Position: Interactive Generative Video as Next-Generation Game Engine

Paper • 2503.17359 • Published Mar 21 • 62

Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation

Paper • 2503.16430 • Published Mar 20 • 35

RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints

Paper • 2503.16408 • Published Mar 20 • 40

PUMA: Empowering Unified MLLM with Multi-granular Visual Generation

Paper • 2410.13861 • Published Oct 17, 2024 • 57

upvoted a paper 3 months ago

GameFactory: Creating New Games with Generative Interactive Videos

Paper • 2501.08325 • Published Jan 14 • 66

upvoted a collection 4 months ago

Cosmos

Collection

The collection of Cosmos models • 31 items • Updated 1 day ago • 283

upvoted a paper 6 months ago

LVD-2M: A Long-take Video Dataset with Temporally Dense Captions

Paper • 2410.10816 • Published Oct 14, 2024 • 21

upvoted a paper 7 months ago

Loong: Generating Minute-level Long Videos with Autoregressive Language Models

Paper • 2410.02757 • Published Oct 3, 2024 • 37

upvoted a paper about 1 year ago

GeneOH Diffusion: Towards Generalizable Hand-Object Interaction Denoising via Denoising Diffusion

Paper • 2402.14810 • Published Feb 22, 2024 • 9

upvoted a paper over 1 year ago

AnimateZero: Video Diffusion Models are Zero-Shot Image Animators

Paper • 2312.03793 • Published Dec 6, 2023 • 18