Thinking About Thinking: Evaluating Reasoning in Post-Trained Language Models Paper • 2510.16340 • Published Oct 18 • 8
IPO: Your Language Model is Secretly a Preference Classifier Paper • 2502.16182 • Published Feb 22 • 2