Thinking Makes LLM Agents Introverted: How Mandatory Thinking Can Backfire in User-Engaged Agents Paper • 2602.07796 • Published 9 days ago • 7
Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense Paper • 2510.07242 • Published Oct 8, 2025 • 30
Debate or Vote: Which Yields Better Decisions in Multi-Agent Large Language Models? Paper • 2508.17536 • Published Aug 24, 2025 • 1