TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization Paper • 2412.21037 • Published 3 days ago • 19
Efficiently Serving LLM Reasoning Programs with Certaindex Paper • 2412.20993 • Published 3 days ago • 26
RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response Paper • 2412.14922 • Published 14 days ago • 80
Diving into Self-Evolving Training for Multimodal Reasoning Paper • 2412.17451 • Published 10 days ago • 38
CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up Paper • 2412.16112 • Published 13 days ago • 20
LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps Paper • 2412.15035 • Published 14 days ago • 4
IDOL: Instant Photorealistic 3D Human Creation from a Single Image Paper • 2412.14963 • Published 14 days ago • 5
FastVLM: Efficient Vision Encoding for Vision Language Models Paper • 2412.13303 • Published 16 days ago • 13
CAD-Recode: Reverse Engineering CAD Code from Point Clouds Paper • 2412.14042 • Published 15 days ago • 5
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks Paper • 2412.14161 • Published 15 days ago • 47
Seeker: Towards Exception Safety Code Generation with Intermediate Language Agents Framework Paper • 2412.11713 • Published 17 days ago • 5
Emergence of Abstractions: Concept Encoding and Decoding Mechanism for In-Context Learning in Transformers Paper • 2412.12276 • Published 17 days ago • 15
Compressed Chain of Thought: Efficient Reasoning Through Dense Representations Paper • 2412.13171 • Published 16 days ago • 31
Byte Latent Transformer: Patches Scale Better Than Tokens Paper • 2412.09871 • Published 20 days ago • 80
SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding Paper • 2412.09604 • Published 21 days ago • 35