Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning Paper • 2601.06943 • Published 13 days ago • 206
EnvScaler Collection The official datasets and models of "EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis" • 8 items • Updated 11 days ago • 3
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published Jan 8, 2025 • 288
view article Article Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR 18 days ago • 67
💧 LFM2.5 Collection Collection of Instruct, Base, and Japanese LFM2.5-1.2B models. • 22 items • Updated 3 days ago • 78
Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling Paper • 2601.02346 • Published 19 days ago • 26
view article Article NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI 18 days ago • 59
TimeBill: Time-Budgeted Inference for Large Language Models Paper • 2512.21859 • Published 29 days ago • 25
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch +5 May 21, 2025 • 251
Optimal Sparsity Math Collection Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks • 67 items • Updated Aug 19, 2025 • 2
view article Article The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator Dec 17, 2025 • 45
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 Dec 18, 2025 • 115
Nemotron-Cascade Collection Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 18 items • Updated 4 days ago • 48