OpenCUA: Open Foundations for Computer-Use Agents Collection This is the official versions of OpenCUA models and AgentNet datasets. Website: https://opencua.xlang.ai/ • 7 items • Updated 26 days ago • 19
Chain-of-Focus: Adaptive Visual Search and Zooming for Multimodal Reasoning via RL Paper • 2505.15436 • Published May 21 • 1
InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning Paper • 2502.11573 • Published Feb 17 • 9