arxiv:2412.04445
Yixiao Ge
yxgeee
AI & ML interests
Computer Vision, Foundation Models
Recent Activity
authored
a paper
12 days ago
Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation
authored
a paper
13 days ago
Moto: Latent Motion Token as the Bridging Language for Robot
Manipulation
Organizations
Papers
18
models
None public yet
datasets
None public yet