Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Alexander Hägele's picture
2 1

Alexander Hägele

haeggee
aryopg's profile picture fabian-sp's profile picture
·
https://haeggee.github.io
  • haeggee
  • haeggee

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago
swiss-ai/Apertus-70B-Instruct-2509
upvoted a paper 3 days ago
Benchmarking Optimizers for Large Language Model Pretraining
upvoted a collection 4 days ago
Apertus LLM
View all activity

Organizations

EPFL Machine Learning and Optimization Laboratory's profile picture Inverse Scaling's profile picture

authored 2 papers about 2 months ago

BaCaDI: Bayesian Causal Discovery with Unknown Interventions

Paper • 2206.01665 • Published Jun 3, 2022 • 2

Inverse Scaling in Test-Time Compute

Paper • 2507.14417 • Published Jul 19 • 27
authored a paper 7 months ago

The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training

Paper • 2501.18965 • Published Jan 31 • 7
authored a paper over 1 year ago

Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations

Paper • 2405.18392 • Published May 28, 2024 • 12
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略