Analysing Chain of Thought Dynamics: Active Guidance or Unfaithful Post-hoc Rationalisation? Paper • 2508.19827 • Published 12 days ago • 30
HumanRankEval: Automatic Evaluation of LMs as Conversational Assistants Paper • 2405.09186 • Published May 15, 2024 • 22