CU-1 for Autonomous UI Agent Systems: An Open Alternative to Proprietary Solutions By paulml • 3 days ago • 12
When Does Reasoning Matter? Unpacking the Contribution of Reasoning to LLM Performance By Nicolas-BZRD and 1 other • 5 days ago • 11
How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons By sherryxychen • 5 days ago • 10
Gaia2 Leaderboard Update: New Models and New Observations By meta-agents-research-environments and 3 others • 2 days ago • 6
Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face By dvgodoy • Feb 11 • 73
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 227
CU-1 for Autonomous UI Agent Systems: An Open Alternative to Proprietary Solutions By paulml • 3 days ago • 12
When Does Reasoning Matter? Unpacking the Contribution of Reasoning to LLM Performance By Nicolas-BZRD and 1 other • 5 days ago • 11
How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons By sherryxychen • 5 days ago • 10
Gaia2 Leaderboard Update: New Models and New Observations By meta-agents-research-environments and 3 others • 2 days ago • 6
Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face By dvgodoy • Feb 11 • 73
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 227