Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ZhuofengLi
/
octo-search-qwen2.5-7b-grpo-step-60-v1.5
like
0
Safetensors
qwen2
Model card
Files
Files and versions
xet
Community
ZhuofengLi
commited on
Jul 28
Commit
0ba4f74
·
verified
·
1 Parent(s):
244311b
Create README.md
Browse files
Files changed (1)
hide
show
README.md
+2
-0
README.md
ADDED
Viewed
@@ -0,0 +1,2 @@
1
+
+ Dataset: nq_search (octo-serach prompt)
2
+
+ Training curve: https://wandb.ai/1004271927-SHU/torl/runs/3gtqdke9?nw=nwuser1004271927