Running 3.16k 3.16k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
gradientai/Llama-3-70B-Instruct-Gradient-1048k Text Generation • 71B • Updated Oct 28, 2024 • 22 • 122