BaCaDI: Bayesian Causal Discovery with Unknown Interventions Paper • 2206.01665 • Published Jun 3, 2022 • 2
The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training Paper • 2501.18965 • Published Jan 31 • 7
Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations Paper • 2405.18392 • Published May 28, 2024 • 12