InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners Paper • 2504.14239 • Published 3 days ago • 11
Shakker-Labs/FLUX.1-dev-ControlNet-Union-Pro-2.0 Text-to-Image • Updated about 12 hours ago • 11.9k • 188
DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning Paper • 2504.07128 • Published 21 days ago • 82
Running on Zero 183 183 CraftsMan: High-fidelity Mesh Generation with 3D Native Generation and Interactive Geometry Refiner 🚀 Generate 3D models from images
Running on Zero 48 48 IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System 🎙 Generate speech from text using reference audio