sjtuzc commited on
Commit
55cf220
·
verified ·
1 Parent(s): f3cb6a0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +41 -4
README.md CHANGED
@@ -1,10 +1,47 @@
1
  ---
2
- license: mit
3
  pipeline_tag: image-text-to-text
4
  tags:
5
- - multimodal
6
- - gui
7
  library_name: transformers
8
  ---
9
 
10
- # MobiMind-Grounder-3B Model
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ license: apache-2.0
3
  pipeline_tag: image-text-to-text
4
  tags:
5
+ - multimodal
6
+ - gui
7
  library_name: transformers
8
  ---
9
 
10
+ # MobiMind-Grounder-3B Model
11
+
12
+ This is the Grounder Model of [MobiAgent](https://github.com/IPADS-SAI/MobiAgent) with 3B parameters, capable of low-level UI element grounding in GUI agent task execution.
13
+
14
+ ## About MobiAgent
15
+
16
+ **MobiAgent** is a powerful mobile agent system including:
17
+
18
+ * **An agent model family**: MobiMind
19
+ * **An agent acceleration framework**: AgentRR
20
+ * **An agent benchmark**: MobiFlow
21
+
22
+ **System Architecture:**
23
+
24
+ <div align="center">
25
+ <p align="center">
26
+ <img src="https://raw.githubusercontent.com/IPADS-SAI/MobiAgent/main/assets/arch.png" width="100%"/>
27
+ </p>
28
+ </div>
29
+
30
+ ## Evaluation Results
31
+
32
+ <table>
33
+ <tr>
34
+ <td><img src="https://raw.githubusercontent.com/IPADS-SAI/MobiAgent/main/assets/result1.png" width="100%"/></td>
35
+ <td><img src="https://raw.githubusercontent.com/IPADS-SAI/MobiAgent/main/assets/result2.png" width="100%"/></td>
36
+ <td><img src="https://raw.githubusercontent.com/IPADS-SAI/MobiAgent/main/assets/result3.png" width="100%"/></td>
37
+ </tr>
38
+ </table>
39
+ ## Usage
40
+
41
+ Deploy model inference service with vLLM:
42
+
43
+ ```bash
44
+ vllm serve IPADS-SAI/MobiMind-Grounder-3B
45
+ ```
46
+
47
+ For more usage details, e.g., execute GUI tasks with ADB or our Android App, please refer to our [repo](https://github.com/IPADS-SAI/MobiAgent)!