OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use Paper • 2508.04482 • Published 15 days ago • 9
InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization Paper • 2508.05731 • Published 13 days ago • 25
HarmonyGuard: Toward Safety and Utility in Web Agents via Adaptive Policy Enhancement and Dual-Objective Optimization Paper • 2508.04010 • Published 15 days ago • 8
InfiAlign: A Scalable and Sample-Efficient Framework for Aligning LLMs to Enhance Reasoning Capabilities Paper • 2508.05496 • Published 14 days ago • 9
Efficient Agents: Building Effective Agents While Reducing Cost Paper • 2508.02694 • Published 27 days ago • 81
InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners Paper • 2504.14239 • Published Apr 19 • 14
InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection Paper • 2501.04575 • Published Jan 8 • 24