CU-1 for Autonomous UI Agent Systems: An Open Alternative to Proprietary Solutions By paulml • 6 days ago • 14
How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons By sherryxychen • 8 days ago • 16
Gaia2 Leaderboard Update: New Models and New Observations By meta-agents-research-environments and 3 others • 5 days ago • 7
Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face By dvgodoy • Feb 11 • 74
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 228
CU-1 for Autonomous UI Agent Systems: An Open Alternative to Proprietary Solutions By paulml • 6 days ago • 14
How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons By sherryxychen • 8 days ago • 16
Gaia2 Leaderboard Update: New Models and New Observations By meta-agents-research-environments and 3 others • 5 days ago • 7
Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face By dvgodoy • Feb 11 • 74
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 228