HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models Paper • 2512.09928 • Published 24 days ago • 11
Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach Paper • 2512.02834 • Published Dec 2, 2025 • 40
Mitigating Object and Action Hallucinations in Multimodal LLMs via Self-Augmented Contrastive Alignment Paper • 2512.04356 • Published Dec 4, 2025 • 9
VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models Paper • 2503.21781 • Published Mar 27, 2025 • 1
Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers Paper • 2311.17717 • Published Nov 29, 2023 • 2
EMLoC: Emulator-based Memory-efficient Fine-tuning with LoRA Correction Paper • 2506.12015 • Published Jun 13, 2025 • 4
Mitigating Object and Action Hallucinations in Multimodal LLMs via Self-Augmented Contrastive Alignment Paper • 2512.04356 • Published Dec 4, 2025 • 9 • 3
Mitigating Object and Action Hallucinations in Multimodal LLMs via Self-Augmented Contrastive Alignment Paper • 2512.04356 • Published Dec 4, 2025 • 9
Mitigating Object and Action Hallucinations in Multimodal LLMs via Self-Augmented Contrastive Alignment Paper • 2512.04356 • Published Dec 4, 2025 • 9 • 3