1 28 3

Jonathan LYS

jonathan-lys

jonathanlys01

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago

Distillation Scaling Laws

upvoted a paper 12 days ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

upvoted an article 25 days ago

How biased is Whisper ? Evaluating Whisper Models for Robustness to Diverse English Accents

View all activity

Organizations

jonathan-lys's activity

upvoted a paper 11 days ago

Distillation Scaling Laws

Paper • 2502.08606 • Published 11 days ago • 43

upvoted a paper 12 days ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published 16 days ago • 114

upvoted an article 25 days ago

Article

How biased is Whisper ? Evaluating Whisper Models for Robustness to Diverse English Accents

•

25 days ago

• 16

upvoted a paper 2 months ago

Parallelized Autoregressive Visual Generation

Paper • 2412.15119 • Published Dec 19, 2024 • 51

upvoted 3 papers 3 months ago

SNOOPI: Supercharged One-step Diffusion Distillation with Proper Guidance

Paper • 2412.02687 • Published Dec 3, 2024 • 109

TinyFusion: Diffusion Transformers Learned Shallow

Paper • 2412.01199 • Published Dec 2, 2024 • 14

When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training

Paper • 2411.13476 • Published Nov 20, 2024 • 16

upvoted 3 papers 4 months ago

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

Paper • 2410.22366 • Published Oct 28, 2024 • 78

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22, 2024 • 89

Pixtral 12B

Paper • 2410.07073 • Published Oct 9, 2024 • 64

upvoted a paper 5 months ago

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1, 2024 • 145

upvoted an article 5 months ago

Article

wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR??

•

Sep 27, 2024

• 40

upvoted 2 papers 7 months ago

SAM 2: Segment Anything in Images and Videos

Paper • 2408.00714 • Published Aug 1, 2024 • 113

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31, 2024 • 112

upvoted 3 papers 9 months ago

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

Paper • 2405.21060 • Published May 31, 2024 • 64

Phased Consistency Model

Paper • 2405.18407 • Published May 28, 2024 • 46

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 88

upvoted 3 papers about 1 year ago

Improving fine-grained understanding in image-text pre-training

Paper • 2401.09865 • Published Jan 18, 2024 • 17

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Paper • 2401.09417 • Published Jan 17, 2024 • 60

Scalable Pre-training of Large Autoregressive Image Models

Paper • 2401.08541 • Published Jan 16, 2024 • 38