Self-Rewarding Vision-Language Model via Reasoning Decomposition Paper • 2508.19652 • Published 11 days ago • 79
Semantically-Aware Rewards for Open-Ended R1 Training in Free-Form Generation Paper • 2506.15068 • Published Jun 18 • 14
VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations for Synthetic Videos Paper • 2505.01481 • Published May 2 • 3