Kai Zuberbühler's picture

583 313

Kai Zuberbühler

kaizuberbuehler

·

k-zubi

AI & ML interests

language models, agents, image generation, music generation

Recent Activity

updated a collection 1 day ago

updated a collection 1 day ago

Vision Language Models

updated a collection 1 day ago

View all activity

Organizations

None yet

kaizuberbuehler's activity

updated 3 collections 1 day ago

Benchmarks

80 items • Updated 1 day ago • 1

Vision Language Models

77 items • Updated 1 day ago • 5

Agents

94 items • Updated 1 day ago • 3

upvoted a paper 1 day ago

PC-Agent: A Hierarchical Multi-Agent Collaboration Framework for Complex Task Automation on PC

Paper • 2502.14282 • Published 3 days ago • 14

updated 2 collections 1 day ago

Code Generation

22 items • Updated 1 day ago

Reasoning, Thinking, RL and Test-Time Scaling

98 items • Updated 1 day ago • 2

upvoted a paper 1 day ago

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published 3 days ago • 49

updated 2 collections 1 day ago

Benchmarks

80 items • Updated 1 day ago • 1

Agents

94 items • Updated 1 day ago • 3

upvoted a paper 1 day ago

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published 3 days ago • 147

updated a collection 1 day ago

Reasoning, Thinking, RL and Test-Time Scaling

98 items • Updated 1 day ago • 2

upvoted a paper 1 day ago

Small Models Struggle to Learn from Strong Reasoners

Paper • 2502.12143 • Published 6 days ago • 24

updated a collection 1 day ago

Benchmarks

80 items • Updated 1 day ago • 1

upvoted a paper 1 day ago

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published 4 days ago • 26

updated 3 collections 1 day ago

Leaderboards

28 items • Updated 1 day ago • 2

Vision Language Models

77 items • Updated 1 day ago • 5

Benchmarks

80 items • Updated 1 day ago • 1

upvoted a paper 1 day ago

ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal Models

Paper • 2502.09696 • Published 10 days ago • 38

updated a collection 1 day ago

Agents

94 items • Updated 1 day ago • 3

upvoted a paper 1 day ago

The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks

Paper • 2502.08235 • Published 11 days ago • 53