Submitted by abhi1nandy2 50 YesBut: A High-Quality Annotated Multimodal Dataset for evaluating Satire Comprehension capability of Vision-Language Models · 8 authors 9
Submitted by skrishna 24 Fact, Fetch, and Reason: A Unified Evaluation of Retrieval-Augmented Generation · 7 authors 3
Submitted by akhaliq 17 Portrait Video Editing Empowered by Multimodal Generative Priors · 6 authors 2
Submitted by sci-m-wang 11 Minstrel: Structural Prompt Generation with Multi-Agents Coordination for Non-AI Experts · 11 authors 2
Submitted by akhaliq 11 V^3: Viewing Volumetric Videos on Mobiles via Streamable 2D Dynamic Gaussians · 8 authors 2
Submitted by marik0 8 Hackphyr: A Local Fine-Tuned LLM Agent for Network Security Environments · 3 authors 2
Submitted by amine-bh 4 LLM-Agent-UMF: LLM-based Agent Unified Modeling Framework for Seamless Integration of Multi Active/Passive Core-Agents · 3 authors 2