Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection Paper β’ 2412.04455 β’ Published Dec 5, 2024 β’ 38
VLSBench: Unveiling Visual Leakage in Multimodal Safety Paper β’ 2411.19939 β’ Published Nov 29, 2024 β’ 10
VLSBench: Unveiling Visual Leakage in Multimodal Safety Paper β’ 2411.19939 β’ Published Nov 29, 2024 β’ 10 β’ 2
stabilityai/stable-diffusion-3-medium-diffusers Text-to-Image β’ Updated Jun 19, 2024 β’ 102k β’ β’ 376
Running 542 542 Vision Arena (Testing VLMs side-by-side) πΌ Analyze images to detect and label objects