ZhuofengLi's picture

2 8 6

ZhuofengLi

ZhuofengLi

·

https://github.com/Zhuofeng-Li

AI & ML interests

None yet

Recent Activity

authored a paper 4 days ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

upvoted a paper 4 days ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

liked a dataset 18 days ago

PersonalAILab/AFM-WebAgent-RL-Dataset

View all activity

Organizations

ZhuofengLi 's models 12

ZhuofengLi/octo-science-qwen2.5-7b-grpo-step-40-v2

2B • Updated Aug 3 • 12

ZhuofengLi/octo-search-qwen2.5-7b-grpo-155-step-v1

8B • Updated Jul 29 • 13

ZhuofengLi/octo-search-qwen2.5-7b-grpo-step-60-v1.5

2B • Updated Jul 28 • 11

ZhuofengLi/tool-n1-multi-turn-reason-lora-sft-1180-step

Text Generation • 8B • Updated Jul 14 • 10

ZhuofengLi/xlam-reason-lora-sft-1340-step

Text Generation • 3B • Updated Jul 13 • 9

ZhuofengLi/tool-n1-reason-lora-sft-800-step

Text Generation • 8B • Updated Jul 4 • 10

ZhuofengLi/pot-r1-grpo-qwen2.5-7b-Instruct

Text Generation • 8B • Updated Mar 30 • 6

ZhuofengLi/pot-r1-grpo-qwen2.5-1.5b-Instruct

Text Generation • 2B • Updated Mar 30 • 6

ZhuofengLi/pot-r1-grpo-qwen2.5-1.5b-Instruct-wo-warmup

Text Generation • 2B • Updated Mar 28 • 11

ZhuofengLi/Qwen2.5-1.5B-Open-R1-GRPO

ZhuofengLi/pot-r1-grpo-qwen2.5-7b-Instruct-wo-warmup

Text Generation • 8B • Updated Mar 25 • 6

ZhuofengLi/SciBART-original

Updated Jul 4, 2024 • 20