Benchmarking Optimizers for Large Language Model Pretraining Paper • 2509.01440 • Published 5 days ago • 20
Just a Simple Transformation is Enough for Data Protection in Vertical Federated Learning Paper • 2412.11689 • Published Dec 16, 2024 • 2
Sparse Concept Bottleneck Models: Gumbel Tricks in Contrastive Learning Paper • 2404.03323 • Published Apr 4, 2024 • 3