Octavius: Mitigating Task Interference in MLLMs via LoRA-MoE Paper • 2311.02684 • Published Nov 5, 2023
MP5: A Multi-modal Open-ended Embodied System in Minecraft via Active Perception Paper • 2312.07472 • Published Dec 12, 2023 • 2
Bamboo: Building Mega-Scale Vision Dataset Continually with Human-Machine Synergy Paper • 2203.07845 • Published Mar 15, 2022
MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control Paper • 2403.12037 • Published Mar 18, 2024 • 1
Assessment of Multimodal Large Language Models in Alignment with Human Values Paper • 2403.17830 • Published Mar 26, 2024
SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model Paper • 2406.12030 • Published Jun 17, 2024
GenderBias-\emph{VL}: Benchmarking Gender Bias in Vision Language Models via Counterfactual Probing Paper • 2407.00600 • Published Jun 30, 2024
Two Heads Are Better Than One: A Multi-Agent System Has the Potential to Improve Scientific Idea Generation Paper • 2410.09403 • Published Oct 12, 2024
B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens Paper • 2412.09919 • Published Dec 13, 2024 • 1
MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems Paper • 2503.03686 • Published Mar 5 • 1
CompBench: Benchmarking Complex Instruction-guided Image Editing Paper • 2505.12200 • Published May 18
MASLab: A Unified and Comprehensive Codebase for LLM-based Multi-Agent Systems Paper • 2505.16988 • Published May 22
VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning Paper • 2506.09049 • Published Jun 10 • 36
BMMR: A Large-Scale Bilingual Multimodal Multi-Discipline Reasoning Dataset Paper • 2507.03483 • Published Jul 4 • 23
OASIS: Open Agent Social Interaction Simulations with One Million Agents Paper • 2411.11581 • Published Nov 18, 2024
RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents Paper • 2403.19622 • Published Mar 28, 2024
aiXiv: A Next-Generation Open Access Ecosystem for Scientific Discovery Generated by AI Scientists Paper • 2508.15126 • Published 18 days ago • 19
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey Paper • 2509.02547 • Published 5 days ago • 155
Towards Physically Plausible Video Generation via VLM Planning Paper • 2503.23368 • Published Mar 30 • 41