RotBench: Evaluating Multimodal Large Language Models on Identifying Image Rotation Paper • 2508.13968 • Published 1 day ago • 2
MMAU-Pro: A Challenging and Comprehensive Benchmark for Holistic Evaluation of Audio General Intelligence Paper • 2508.13992 • Published 1 day ago • 3 • 2
MultiRef: Controllable Image Generation with Multiple Visual References Paper • 2508.06905 • Published 12 days ago • 15 • 2
Describe What You See with Multimodal Large Language Models to Enhance Video Recommendations Paper • 2508.09789 • Published 8 days ago • 4 • 4
MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents Paper • 2508.13186 • Published 7 days ago • 10 • 4
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL Paper • 2508.13167 • Published 15 days ago • 76 • 6
Rapidly Adapting to New Voice Spoofing: Few-Shot Detection of Synthesized Speech Under Distribution Shifts Paper • 2508.13320 • Published 2 days ago • 1 • 2
Copyright Protection for Large Language Models: A Survey of Methods, Challenges, and Trends Paper • 2508.11548 • Published 6 days ago • 5 • 2
A Stitch in Time Saves Nine: Proactive Self-Refinement for Language Models Paper • 2508.12903 • Published 3 days ago • 9 • 2
Evaluating Podcast Recommendations with Profile-Aware LLM-as-a-Judge Paper • 2508.08777 • Published 9 days ago • 10 • 2
Mind the Generation Process: Fine-Grained Confidence Estimation During LLM Generation Paper • 2508.12040 • Published 5 days ago • 10 • 2
ZARA: Zero-shot Motion Time-Series Analysis via Knowledge and Retrieval Driven LLM Agents Paper • 2508.04038 • Published 15 days ago • 1 • 2
Leveraging Large Language Models for Predictive Analysis of Human Misery Paper • 2508.12669 • Published 3 days ago • 10 • 2
MedSAMix: A Training-Free Model Merging Approach for Medical Image Segmentation Paper • 2508.11032 • Published 6 days ago • 2 • 2
Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer Paper • 2508.09131 • Published 8 days ago • 10 • 2
Beyond Human Judgment: A Bayesian Evaluation of LLMs' Moral Values Understanding Paper • 2508.13804 • Published 1 day ago • 1 • 2
Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation Paper • 2508.13998 • Published 1 day ago • 11 • 2
Retrieval-augmented reasoning with lean language models Paper • 2508.11386 • Published 6 days ago • 2 • 2