Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Andrei Semenov's picture
9 20 1

Andrei Semenov

Andron00e
EffyOsvin's profile picture
·
https://andron00e.github.io/
  • AndreiSemenov17
  • Andron00e
  • semenov-andrei-v
  • andreisemenov.bsky.social

AI & ML interests

None yet

Recent Activity

upvoted a collection 3 days ago
Apertus LLM
commented on a paper 3 days ago
Benchmarking Optimizers for Large Language Model Pretraining
authored a paper 3 days ago
Gradient Clipping Improves AdaGrad when the Noise Is Heavy-Tailed
View all activity

Organizations

EPFL Machine Learning and Optimization Laboratory's profile picture Social Post Explorers's profile picture Hugging Face Discord Community's profile picture opt-ssm's profile picture Intelligent-systems's profile picture BRAIn Lab's profile picture

authored 2 papers 3 days ago

Gradient Clipping Improves AdaGrad when the Noise Is Heavy-Tailed

Paper • 2406.04443 • Published Jun 6, 2024

Benchmarking Optimizers for Large Language Model Pretraining

Paper • 2509.01440 • Published 5 days ago • 21
authored a paper over 1 year ago

Sparse Concept Bottleneck Models: Gumbel Tricks in Contrastive Learning

Paper • 2404.03323 • Published Apr 4, 2024 • 3
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略