SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning Paper • 2505.16186 • Published May 22 • 7
More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models Paper • 2505.21523 • Published May 23 • 14
Hidden in Plain Sight: Probing Implicit Reasoning in Multimodal Language Models Paper • 2506.00258 • Published May 30 • 3
"PhyWorldBench": A Comprehensive Evaluation of Physical Realism in Text-to-Video Models Paper • 2507.13428 • Published Jul 17 • 15
"PhyWorldBench": A Comprehensive Evaluation of Physical Realism in Text-to-Video Models Paper • 2507.13428 • Published Jul 17 • 15
Agents of Change: Self-Evolving LLM Agents for Strategic Planning Paper • 2506.04651 • Published Jun 5 • 8
Hidden in Plain Sight: Probing Implicit Reasoning in Multimodal Language Models Paper • 2506.00258 • Published May 30 • 3
Hidden in Plain Sight: Probing Implicit Reasoning in Multimodal Language Models Paper • 2506.00258 • Published May 30 • 3 • 1
Agents of Change: Self-Evolving LLM Agents for Strategic Planning Paper • 2506.04651 • Published Jun 5 • 8
Agents of Change: Self-Evolving LLM Agents for Strategic Planning Paper • 2506.04651 • Published Jun 5 • 8 • 2
More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models Paper • 2505.21523 • Published May 23 • 14
VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation Paper • 2206.08522 • Published Jun 17, 2022
Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space Paper • 2505.15778 • Published May 21 • 17
SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning Paper • 2505.16186 • Published May 22 • 7
SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning Paper • 2505.16186 • Published May 22 • 7 • 2
Multimodal Reasoning Collection A collection for Multimodal Reasoning Models and Benchmarks. • 5 items • Updated May 23