Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper β’ 2503.16219 β’ Published 15 days ago β’ 46
Running on Zero 60 60 Splatt3R - Zero-shot Gaussian Splatting from Uncalibarated Image Pairs β° Generate 3D scenes from one or two images
Running on L4 1.73k 1.73k MagicQuill πͺΆ Edit and enhance images with custom color and edge modifications
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL Paper β’ 2503.07536 β’ Published 24 days ago β’ 83
4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models Paper β’ 2503.10437 β’ Published 22 days ago β’ 30
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers Paper β’ 2502.15007 β’ Published Feb 20 β’ 169
MLGym: A New Framework and Benchmark for Advancing AI Research Agents Paper β’ 2502.14499 β’ Published Feb 20 β’ 188
ReLearn: Unlearning via Learning for Large Language Models Paper β’ 2502.11190 β’ Published Feb 16 β’ 29
SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering? Paper β’ 2502.12115 β’ Published Feb 17 β’ 43
InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU Paper β’ 2502.08910 β’ Published Feb 13 β’ 147