unsloth/Nemotron-3-Nano-30B-A3B-GGUF Text Generation • 32B • Updated about 18 hours ago • 74.7k • 164
view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand 25 days ago • 62
Running on CPU Upgrade Featured 2.72k The Smol Training Playbook 📚 2.72k The secrets to building world-class LLMs