Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Longinus02 's Collections
Agent Evaluation
Dataset
GUI Evaluation

GUI Evaluation

updated Jul 22
Upvote
-

  • GUI-Robust: A Comprehensive Dataset for Testing GUI Agent Robustness in Real-World Anomalies

    Paper • 2506.14477 • Published Jun 17

  • VideoGUI: A Benchmark for GUI Automation from Instructional Videos

    Paper • 2406.10227 • Published Jun 14, 2024 • 9

  • GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents

    Paper • 2506.03143 • Published Jun 3 • 52

  • UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning

    Paper • 2503.21620 • Published Mar 27 • 63

  • GTA1: GUI Test-time Scaling Agent

    Paper • 2507.05791 • Published Jul 8 • 25

  • CHOP: Mobile Operating Assistant with Constrained High-frequency Optimized Subtask Planning

    Paper • 2503.03743 • Published Mar 5

  • GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

    Paper • 2507.01006 • Published Jul 1 • 234
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略