view article Article Welcome GPT OSS, the new open-source model family from OpenAI! By reach-vb and 11 others • Aug 5 • 489
Running 3.16k 3.16k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
A Probabilistic Inference Approach to Inference-Time Scaling of LLMs using Particle-Based Monte Carlo Methods Paper • 2502.01618 • Published Feb 3 • 10
view article Article From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate By muellerzr and 3 others • Jun 13, 2024 • 58