callanwu commited on
Commit
f8de16f
·
verified ·
1 Parent(s): 7f9cc3f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -1
README.md CHANGED
@@ -2,4 +2,8 @@
2
  license: mit
3
  base_model:
4
  - Qwen/QwQ-32B
5
- ---
 
 
 
 
 
2
  license: mit
3
  base_model:
4
  - Qwen/QwQ-32B
5
+ ---
6
+ - Native agentic search reasoning model using ReAct framework towards autonomous information seeking agency and Deep Research-like model.
7
+ - We introduce a four-stage training paradigm comprising browsing data construction, trajectory sampling, supervised fine-tuning for effective cold start, and reinforcement learning for improved generalization, enabling the agent to autonomously acquire autonomous search and reasoning skills.
8
+ - Our data-centric approach integrates trajectory-level supervision fine-tuning and reinforcement learning (DAPO) to develop a scalable pipeline for training agentic systems via SFT or RL.
9
+ - WebDancer achieves a Pass@3 score of 61.1% on GAIA and 54.6% on WebWalkerQA.