Navigating the Alignment-Calibration Trade-off: A Pareto-Superior Frontier via Model Merging Paper • 2510.17426 • Published Oct 20 • 1
SimBench: Benchmarking the Ability of Large Language Models to Simulate Human Behaviors Paper • 2510.17516 • Published Oct 20 • 2