DeepGlint-AI/mlcd-vit-bigG-patch14-448 Image Feature Extraction • 2B • Updated May 13, 2025 • 727 • 4
MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning Attacks Paper • 2502.17832 • Published Feb 25, 2025 • 6
meta-llama/Llama-3.2-11B-Vision-Instruct Image-Text-to-Text • 11B • Updated Dec 4, 2024 • 104k • • 1.55k
TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning Paper • 2509.25760 • Published Sep 30, 2025 • 55
EpiCache: Episodic KV Cache Management for Long Conversational Question Answering Paper • 2509.17396 • Published Sep 22, 2025 • 19
UserBench: An Interactive Gym Environment for User-Centric Agents Paper • 2507.22034 • Published Jul 29, 2025 • 29
Perception-Aware Policy Optimization for Multimodal Reasoning Paper • 2507.06448 • Published Jul 8, 2025 • 47
InfiniPot-V: Memory-Constrained KV Cache Compression for Streaming Video Understanding Paper • 2506.15745 • Published Jun 18, 2025 • 13