AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding Paper • 2502.01341 • Published Feb 3, 2025 • 39
DASH: Detection and Assessment of Systematic Hallucinations of VLMs Paper • 2503.23573 • Published Mar 30, 2025 • 12
SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published Apr 7, 2025 • 202