Ribbit Ribbit's picture

13 7

Ribbit Ribbit

ribbitribbit365

·

https://RibbitRibbit.co

ribbitribbit365

AI & ML interests

None yet

Recent Activity

commented on a paper 9 days ago

Distillation Scaling Laws

upvoted a paper 9 days ago

Distillation Scaling Laws

commented on a paper 12 days ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

View all activity

Organizations

None yet

ribbitribbit365's activity

commented a paper 9 days ago

Distillation Scaling Laws

Paper • 2502.08606 • Published 11 days ago • 43 •

commented a paper 12 days ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published 13 days ago • 134 •

commented a paper 13 days ago

Goku: Flow Based Video Generative Foundation Models

Paper • 2502.04896 • Published 16 days ago • 88 •

commented a paper 15 days ago

Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2

Paper • 2502.03544 • Published 18 days ago • 42 •

commented a paper 20 days ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published 23 days ago • 105 •

commented a paper 22 days ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 26 days ago • 106 •

commented a paper 27 days ago

Chain-of-Retrieval Augmented Generation

Paper • 2501.14342 • Published about 1 month ago • 51 •

commented a paper 30 days ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published Jan 22 • 83 •

commented 6 papers about 1 month ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 106 •

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Paper • 2501.09732 • Published Jan 16 • 69 •

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 273 •

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 273 •

OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints

Paper • 2501.03841 • Published Jan 7 • 53 •

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published Jan 9 • 89 •