Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency Paper • 2503.20785 • Published 13 days ago • 20
Self-Supervised Learning of Motion Concepts by Optimizing Counterfactuals Paper • 2503.19953 • Published 14 days ago • 3
FirePlace: Geometric Refinements of LLM Common Sense Reasoning for 3D Object Placement Paper • 2503.04919 • Published Mar 6 • 7
Open Deep Search: Democratizing Search with Open-source Reasoning Agents Paper • 2503.20201 • Published 13 days ago • 42
AMD-Hummingbird: Towards an Efficient Text-to-Video Model Paper • 2503.18559 • Published 15 days ago • 5
Cosmos Transfer1 Collection Multimodal Conditional World Generation for World2World Transfer • 5 items • Updated 4 days ago • 14
FFN Fusion: Rethinking Sequential Computation in Large Language Models Paper • 2503.18908 • Published 15 days ago • 17
Vision-R1: Evolving Human-Free Alignment in Large Vision-Language Models via Vision-Guided Reinforcement Learning Paper • 2503.18013 • Published 16 days ago • 18
1000+ FPS 4D Gaussian Splatting for Dynamic Scene Rendering Paper • 2503.16422 • Published 19 days ago • 14
JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse Paper • 2503.16365 • Published 19 days ago • 38
Physical AI Collection Collection of commercial-grade datasets for physical AI developers • 10 items • Updated 4 days ago • 37