Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
8
6
ZhuofengLi
ZhuofengLi
Follow
eigentom's profile picture
Mi6paulino's profile picture
2 followers
·
6 following
https://github.com/Zhuofeng-Li
Zhuofeng-Li
zhuofeng-li-6a528626a
AI & ML interests
None yet
Recent Activity
authored
a paper
4 days ago
VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use
upvoted
a
paper
4 days ago
VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use
liked
a dataset
18 days ago
PersonalAILab/AFM-WebAgent-RL-Dataset
View all activity
Organizations
ZhuofengLi
's models
12
Sort: Recently updated
ZhuofengLi/octo-science-qwen2.5-7b-grpo-step-40-v2
2B
•
Updated
Aug 3
•
12
ZhuofengLi/octo-search-qwen2.5-7b-grpo-155-step-v1
8B
•
Updated
Jul 29
•
13
ZhuofengLi/octo-search-qwen2.5-7b-grpo-step-60-v1.5
2B
•
Updated
Jul 28
•
11
ZhuofengLi/tool-n1-multi-turn-reason-lora-sft-1180-step
Text Generation
•
8B
•
Updated
Jul 14
•
10
ZhuofengLi/xlam-reason-lora-sft-1340-step
Text Generation
•
3B
•
Updated
Jul 13
•
9
ZhuofengLi/tool-n1-reason-lora-sft-800-step
Text Generation
•
8B
•
Updated
Jul 4
•
10
ZhuofengLi/pot-r1-grpo-qwen2.5-7b-Instruct
Text Generation
•
8B
•
Updated
Mar 30
•
6
ZhuofengLi/pot-r1-grpo-qwen2.5-1.5b-Instruct
Text Generation
•
2B
•
Updated
Mar 30
•
6
ZhuofengLi/pot-r1-grpo-qwen2.5-1.5b-Instruct-wo-warmup
Text Generation
•
2B
•
Updated
Mar 28
•
11
ZhuofengLi/Qwen2.5-1.5B-Open-R1-GRPO
Updated
Mar 26
ZhuofengLi/pot-r1-grpo-qwen2.5-7b-Instruct-wo-warmup
Text Generation
•
8B
•
Updated
Mar 25
•
6
ZhuofengLi/SciBART-original
Updated
Jul 4, 2024
•
20