Benchmarking Optimizers for Large Language Model Pretraining Paper • 2509.01440 • Published 5 days ago • 21