Running 1.44k 1.44k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
hugging-quants/Meta-Llama-3.1-70B-Instruct-GPTQ-INT4 Text Generation • Updated Aug 7, 2024 • 4.63k • 22