Saying Thank You to a LLM Isn't Free — Measuring the Energy Cost of Politeness By jdelavande and 2 others • 7 days ago • 13
💥 Building a Vulnerable Bank MCP — Then Automating an Agent to Hack It By jdelavande and 2 others • 1 day ago • 7
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 157
OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve By codelion • 30 days ago • 24
Introducing Cosmos Predict-2: A Foundation For Your Own World Model By nvidia and 2 others • 3 days ago • 5
Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H By Hcompany and 1 other • 16 days ago • 67
Nemotron-Personas: Improve AI Training With the First Synthetic Personas Dataset Aligned to Real-World Distributions By nvidia and 1 other • 9 days ago • 12
Saying Thank You to a LLM Isn't Free — Measuring the Energy Cost of Politeness By jdelavande and 2 others • 7 days ago • 13
💥 Building a Vulnerable Bank MCP — Then Automating an Agent to Hack It By jdelavande and 2 others • 1 day ago • 7
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 157
OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve By codelion • 30 days ago • 24
Introducing Cosmos Predict-2: A Foundation For Your Own World Model By nvidia and 2 others • 3 days ago • 5
Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H By Hcompany and 1 other • 16 days ago • 67
Nemotron-Personas: Improve AI Training With the First Synthetic Personas Dataset Aligned to Real-World Distributions By nvidia and 1 other • 9 days ago • 12