MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI Paper • 2311.16502 • Published Nov 27, 2023 • 35
iDesigner: A High-Resolution and Complex-Prompt Following Text-to-Image Diffusion Model for Interior Design Paper • 2312.04326 • Published Dec 7, 2023 • 3
Lyrics: Boosting Fine-grained Language-Vision Alignment and Comprehension via Semantic-aware Visual Objects Paper • 2312.05278 • Published Dec 8, 2023 • 3
Fostering Natural Conversation in Large Language Models with NICO: a Natural Interactive COnversation dataset Paper • 2408.09330 • Published Aug 18, 2024
Preference Leakage: A Contamination Problem in LLM-as-a-judge Paper • 2502.01534 • Published 20 days ago • 37
Preference Leakage: A Contamination Problem in LLM-as-a-judge Paper • 2502.01534 • Published 20 days ago • 37
Taiyi-Diffusion-XL: Advancing Bilingual Text-to-Image Generation with Large Vision-Language Model Support Paper • 2401.14688 • Published Jan 26, 2024 • 13