mlx-community/DeepSeek-R1-Distill-Qwen-32B-abliterated-4bit Text Generation • Updated 4 days ago • 67 • 3
mlx-community/DeepSeek-R1-Distill-Qwen-32B-abliterated Text Generation • Updated 3 days ago • 51 • 2
Running 1.42k 1.42k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters