Submitted by akhaliq 27 HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models · 7 authors 6
Submitted by akhaliq 14 DEsignBench: Exploring and Benchmarking DALL-E 3 for Imagining Visual Design · 5 authors 2
Submitted by akhaliq 10 FreeNoise: Tuning-Free Longer Video Diffusion Via Noise Rescheduling · 7 authors
Submitted by akhaliq 8 Branch-Solve-Merge Improves Large Language Model Evaluation and Generation · 6 authors
Submitted by akhaliq 7 TexFusion: Synthesizing 3D Textures with Text-Guided Image Diffusion Models · 5 authors 2
Submitted by akhaliq 7 Localizing and Editing Knowledge in Text-to-Image Generative Models · 5 authors 2
Submitted by akhaliq 5 Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs · 7 authors 2
Submitted by akhaliq 2 InstructExcel: A Benchmark for Natural Language Instruction in Excel · 10 authors 2