mlx-community/DeepSeek-R1-Distill-Qwen-32B-abliterated-4bit Text Generation • Updated 4 days ago • 90 • 3
mlx-community/DeepSeek-R1-Distill-Qwen-32B-abliterated Text Generation • Updated 4 days ago • 61 • 2
Running 1.44k 1.44k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters