Ilya Berg's picture

5 2

Ilya Berg

Pitosha

·

AI & ML interests

NLP, RL, LLM, robotics

Organizations

upvoted an article 8 months ago

Article

CircleGuardBench: New Standard for Evaluating AI Moderation Models

May 7, 2025

•

59

upvoted a collection 10 months ago

Gemma 3 Release

28 items • Updated Aug 11, 2025 • 577

upvoted a paper 10 months ago

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published Mar 5, 2025 • 232

upvoted 2 papers 11 months ago

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20, 2025 • 193

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published Feb 3, 2025 • 113