-
Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion
Paper • 2402.03162 • Published • 17 -
ShortGPT: Layers in Large Language Models are More Redundant Than You Expect
Paper • 2403.03853 • Published • 61 -
OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation
Paper • 2407.02371 • Published • 50
Pengxiang Li
pengxiang
AI & ML interests
Video generation, Image editing, AD
Recent Activity
upvoted
a
paper
5 days ago
Training Large Language Models to Reason in a Continuous Latent Space
upvoted
a
paper
20 days ago
ShowUI: One Vision-Language-Action Model for GUI Visual Agent
Organizations
None yet
Collections
1
Papers
1
models
6
pengxiang/TrackDiffusion_Pretrain
Updated
•
1
pengxiang/GLIGEN_1_4
Updated
•
4
pengxiang/TrackDiffusion_SVD_Stage1
Text-to-Video
•
Updated
pengxiang/TrackDiffusion_ModelScope
Text-to-Video
•
Updated
pengxiang/TrackDiffusion_SVD_Stage2
Text-to-Video
•
Updated
pengxiang/trackdiffusion_ytvis
Text-to-Video
•
Updated
•
2