NEW Articles from Team or Enterprise organizations will get promoted to the main section. Training-Free Reasoning at 88.89% on GPQA Diamond: How Darwin Family Hit Frontier Scores Without a Single Gradient Step
FINAL-Bench
• • 16
Vividh-ASR: Diagnosing and Fixing Studio-Bias in Whisper for Indic Languages
adalat-ai
• • 11
Two Years of Local AI on a Laptop: When Open Models Outpaced Moore's Law
EMO: Pretraining mixture of experts for emergent modularity
allenai
• • 37
KV Caching Explained: Optimizing Transformer Inference Efficiency
not-lain
• • 332
How to Comply with SOC 2 and ISO 27001 with Hugging Face: A Practical Guide to AI Model Supply Chain Governance
jeffboudier
• • 5
Code a simple RAG from scratch
ngxson
• • 335
Uncensor any LLM with abliteration
mlabonne
• • 853
Small Language Models (SLM): A Comprehensive Overview
jjokah
• • 153
NEO-unify: Building Native Multimodal Unified Models End to End
Self Evolving is the Endgame or final destiny
rajkumarrawal
• • 3
makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch
AviSoori1x
• • 121
Metric and Relative Monocular Depth Estimation: An Overview. Fine-Tuning Depth Anything V2 👐 📚
Isayoften
• • 96
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment
NormalUhr
• • 121
Common AI Model Formats
ngxson
• • 72
Norm-Preserving Biprojected Abliteration
grimjim
• • 80
A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond
karina-zadorozhny
• • 18
LLM Architectures Explained: What Powers Today’s Top Models
NVIDIA Isaac GR00T N1.7: Open Reasoning VLA Model for Humanoid Robots
Introducing the agentic robotics appstore for 10,000 Reachy Minis