🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It? By Kseniase • 22 days ago • 134
Custom Vibe Coding Quest Part 2: 🚙 Fine-Tuning Gemma 3 for Code Reasoning By burtenshaw • 7 days ago • 18
Topic 33: Slim Attention, KArAt, XAttention and Multi-Token Attention Explained – What’s Really Changing in Transformers? By Kseniase and 1 other • 4 days ago • 13
Everyone Can Optimise AI Models and Make Them Faster, Smaller, Cheaper, Greener By PrunaAI and 2 others • 4 days ago • 10
Deepsite:HuggingFace Founder Introduces Free Web-Based 'Cursor' Alternative By LLMhacker • 7 days ago • 9
Training Large Language Models with Interpreter Feedback using WebAssembly By axolotl-ai-co and 1 other • 5 days ago • 8
Porting Pi0-FAST to LeRobot from JAX to PyTorch: Challenges, Fixes, and Open Questions By danaaubakirova and 3 others • 6 days ago • 7
Ghibli AI: The Ultimate Guide to Studio Ghibli-Style AI Image Generation By LLMhacker • 9 days ago • 10
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 95
🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It? By Kseniase • 22 days ago • 134
Custom Vibe Coding Quest Part 2: 🚙 Fine-Tuning Gemma 3 for Code Reasoning By burtenshaw • 7 days ago • 18
Topic 33: Slim Attention, KArAt, XAttention and Multi-Token Attention Explained – What’s Really Changing in Transformers? By Kseniase and 1 other • 4 days ago • 13
Everyone Can Optimise AI Models and Make Them Faster, Smaller, Cheaper, Greener By PrunaAI and 2 others • 4 days ago • 10
Deepsite:HuggingFace Founder Introduces Free Web-Based 'Cursor' Alternative By LLMhacker • 7 days ago • 9
Training Large Language Models with Interpreter Feedback using WebAssembly By axolotl-ai-co and 1 other • 5 days ago • 8
Porting Pi0-FAST to LeRobot from JAX to PyTorch: Challenges, Fixes, and Open Questions By danaaubakirova and 3 others • 6 days ago • 7
Ghibli AI: The Ultimate Guide to Studio Ghibli-Style AI Image Generation By LLMhacker • 9 days ago • 10
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 95