ThinkDial: An Open Recipe for Controlling Reasoning Effort in Large Language Models Paper • 2508.18773 • Published 12 days ago • 14 • 3
DeepCritic: Deliberate Critique with Large Language Models Paper • 2505.00662 • Published May 1 • 55 • 8
DeepCritic: Deliberate Critique with Large Language Models Paper • 2505.00662 • Published May 1 • 55 • 8
Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization Paper • 2406.11431 • Published Jun 17, 2024 • 4 • 2