Submitted by yichaodu 53 MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation? · 19 authors 5
Submitted by FeYuan 35 LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages · 5 authors 2
Submitted by xhluca 29 Learning Action and Reasoning-Centric Image Editing from Videos and Simulations · 7 authors 2
Submitted by ethanchern 21 ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation · 4 authors 4
Submitted by fredsala 16 Evaluating Language Model Context Windows: A "Working Memory" Test and Inference-time Correction · 4 authors 1
Submitted by akhaliq 13 UltraEdit: Instruction-based Fine-Grained Image Editing at Scale · 10 authors 1
Submitted by wyt2000 12 InverseCoder: Unleashing the Power of Instruction-Tuned Code LLMs with Inverse-Instruct · 16 authors 2
Submitted by myownskyW7 12 Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images · 10 authors 1
Submitted by ThomasFEL 5 Understanding Visual Feature Reliance through the Lens of Complexity · 5 authors 1
Submitted by TranSirius 2 LLMAEL: Large Language Models are Good Context Augmenters for Entity Linking · 8 authors 1
Submitted by vanilla1116 1 ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models · 6 authors 3