Andrei Semenov's picture

9 20 1

Andrei Semenov

Andron00e

·

https://andron00e.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a collection 3 days ago

commented on a paper 3 days ago

Benchmarking Optimizers for Large Language Model Pretraining

authored a paper 3 days ago

Gradient Clipping Improves AdaGrad when the Noise Is Heavy-Tailed

View all activity

Organizations

authored 2 papers 3 days ago

Gradient Clipping Improves AdaGrad when the Noise Is Heavy-Tailed

Paper • 2406.04443 • Published Jun 6, 2024

Benchmarking Optimizers for Large Language Model Pretraining

Paper • 2509.01440 • Published 5 days ago • 21

authored a paper over 1 year ago

Sparse Concept Bottleneck Models: Gumbel Tricks in Contrastive Learning

Paper • 2404.03323 • Published Apr 4, 2024 • 3