Update README.md
Browse files
README.md
CHANGED
@@ -82,7 +82,27 @@ Welcome to try out our online demo [here](https://dr.miromind.ai/). In this demo
|
|
82 |
|
83 |
### More Benchmarks
|
84 |
|
85 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
86 |
|
87 |
## Quick Start
|
88 |
|
|
|
82 |
|
83 |
### More Benchmarks
|
84 |
|
85 |
+
| Method | HLE<br>Pass@1 | Frames<br>Pass@1 | BrowseComp<br>Pass@1 | <span style="white-space:nowrap;">BrowseComp-ZH</span><br>Pass@1 | WebWalkerQA<br>Pass@1 |
|
86 |
+
|-------------------------------------------------------------------|:-------------:|:----------------:|:--------------------:|:----------------------------------------------------------------:|:---------------------:|
|
87 |
+
| OpenAI Deep Research | 26.6 | - | 51.5 | 42.9 | - |
|
88 |
+
| Gemini Deep Research | 26.9 | - | - | - | - |
|
89 |
+
| Kimi-Researcher | 26.9 | 78.8 | - | - | - |
|
90 |
+
| | | | | | |
|
91 |
+
| WebDancer-7B | - | - | - | - | 36.0 |
|
92 |
+
| WebSailor-7B | - | - | 6.7 | 14.2 | - |
|
93 |
+
| MiroThinker-8B-SFT-v0.1 | - | 58.0 | 5.5 | 9.3 | 41.3 |
|
94 |
+
| MiroThinker-8B-DPO-v0.1 | - | 64.4 | 8.7 | 13.6 | 45.7 |
|
95 |
+
| | | | | | |
|
96 |
+
| WebThinker-32B-RL | - | - | - | - | 46.5 |
|
97 |
+
| WebDancer-QwQ-32B | - | - | 3.8 | 18.0 | 47.9 |
|
98 |
+
| WebSailor-32B | - | - | 10.5 | 25.5 | - |
|
99 |
+
| WebShaper-32B | - | - | - | - | 51.4 |
|
100 |
+
| MiroThinker-32B-SFT-v0.1 | 10.2 | 70.4 | 10.6 | 13.8 | 45.7 |
|
101 |
+
| <span style="white-space:nowrap;">MiroThinker-32B-DPO-v0.1</span> | 11.8 | 71.7 | 13.0 | 17.0 | 49.3 |
|
102 |
+
|
103 |
+
1. MiroThinker’s performance was tested with [this repository](https://github.com/MiroMindAI/MiroThinker) and open-source tools; other models’ results are from their papers and official sites.
|
104 |
+
|
105 |
+
2. As [MiroVerse-v0.1](https://huggingface.co/datasets/miromind-ai/MiroVerse-v0.1) mainly contains English data, the model’s Chinese capability is limited. We plan to add more Chinese data in the next version.
|
106 |
|
107 |
## Quick Start
|
108 |
|