
MMIE/MMIE-Score
Image-Text-to-Text
•
4B
•
Updated
•
1
MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
\n\n[📖 Project]\n[📄 Paper]\n[💻 Code]\n[📝 Dataset]\n[🤖 Evaluation Model]\n[🏆 Leaderboard]\n
\nMMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
\n\n[📖 Project]\n[📄 Paper]\n[💻 Code]\n[📝 Dataset]\n[🤖 Evaluation Model]\n[🏆 Leaderboard]\n
\nWe introduce MMIE, a robust, knowledge-intensive benchmark to evaluate interleaved multimodal comprehension and generation in LVLMs. With 20K+ examples covering 12 fields and 102 subfields, MMIE is definitely setting new standards for testing the depths of multimodal understanding.
\n