2 140 6

Léo Hunout

hunoutl

AI & ML interests

AI Engineer working on Jean Zay supercomputer in France 🇫🇷

Recent Activity

liked a Space 3 days ago

nanotron/ultrascale-playbook

upvoted a paper 19 days ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

upvoted a paper 19 days ago

Towards Best Practices for Open Datasets for LLM Training

View all activity

Organizations

hunoutl's activity

liked a Space 3 days ago

1.36k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted 8 papers 19 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 330

Humanity's Last Exam

Paper • 2501.14249 • Published about 1 month ago • 62

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Paper • 2501.18512 • Published 24 days ago • 27

Scalable-Softmax Is Superior for Attention

Paper • 2501.19399 • Published 23 days ago • 20

s1: Simple test-time scaling

Paper • 2501.19393 • Published 23 days ago • 105

upvoted an article 20 days ago

Article

Welcome to Inference Providers on the Hub 🔥

27 days ago

• 384

upvoted 10 papers about 1 month ago

Ultra-Sparse Memory Network

Paper • 2411.12364 • Published Nov 19, 2024 • 22

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 42

SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters

Paper • 2412.00174 • Published Nov 29, 2024 • 23

DeMo: Decoupled Momentum Optimization

Paper • 2411.19870 • Published Nov 29, 2024 • 6

Puzzle: Distillation-Based NAS for Inference-Optimized LLMs

Paper • 2411.19146 • Published Nov 28, 2024 • 17

Yi-Lightning Technical Report

Paper • 2412.01253 • Published Dec 2, 2024 • 27

Video Depth without Video Models

Paper • 2411.19189 • Published Nov 28, 2024 • 36

GRAPE: Generalizing Robot Policy via Preference Alignment

Paper • 2411.19309 • Published Nov 28, 2024 • 44

1.58-bit FLUX

Paper • 2412.18653 • Published Dec 24, 2024 • 78

ResearchTown: Simulator of Human Research Community

Paper • 2412.17767 • Published Dec 23, 2024 • 14