Specialization after Generalization: Towards Understanding Test-Time Training in Foundation Models Paper • 2509.24510 • Published Sep 29 • 3