HoloPart: Generative 3D Part Amodal Segmentation Paper • 2504.07943 • Published about 20 hours ago • 21
DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning Paper • 2504.07128 • Published 10 days ago • 23
Kimi-VL-A3B Collection Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 4 items • Updated about 7 hours ago • 52
OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper • 2504.06263 • Published 3 days ago • 121
Less-to-More Generalization: Unlocking More Controllability by In-Context Generation Paper • 2504.02160 • Published 9 days ago • 30
An Empirical Study of GPT-4o Image Generation Capabilities Paper • 2504.05979 • Published 3 days ago • 56
Science-T2I Collection Addressing Scientific Illusions in Image Synthesis • 9 items • Updated 7 days ago • 2
Scaling Language-Free Visual Representation Learning Paper • 2504.01017 • Published 10 days ago • 25
MoCha: Towards Movie-Grade Talking Character Synthesis Paper • 2503.23307 • Published 12 days ago • 116
Qwen2.5-Omni Collection End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 3 items • Updated 15 days ago • 83
Modifying Large Language Model Post-Training for Diverse Creative Writing Paper • 2503.17126 • Published 21 days ago • 34
TULIP: Towards Unified Language-Image Pretraining Paper • 2503.15485 • Published 23 days ago • 44
DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper • 2503.14476 • Published 24 days ago • 116
RWKV-7 "Goose" with Expressive Dynamic State Evolution Paper • 2503.14456 • Published 24 days ago • 137
Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption Paper • 2503.09279 • Published about 1 month ago • 5
Autoregressive Image Generation with Randomized Parallel Decoding Paper • 2503.10568 • Published 29 days ago • 8
MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice Paper • 2503.05978 • Published Mar 7 • 35