metadata

library_name: transformers
license: apache-2.0
pipeline_tag: image-text-to-text
tags:
  - multimodal
  - gui

MobiMind-Grounder-3B Model

This is the Grounder Model of MobiAgent with 3B parameters, capable of low-level UI element grounding in GUI agent task execution, as presented in the paper MobiAgent: A Systematic Framework for Customizable Mobile Agents.

About MobiAgent

MobiAgent is a powerful mobile agent system including:

System Architecture:

## Usage

Deploy model inference service with vLLM:

vllm serve IPADS-SAI/MobiMind-Grounder-3B

For more usage details, e.g., execute GUI tasks with ADB or our Android App, please refer to our repo!