Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
3
Zhenghai Xue
ZhenghaiXue
Follow
21world's profile picture
JohnClema's profile picture
rodoxcasta's profile picture
4 followers
·
8 following
AI_Defender
AI & ML interests
Reinforcement Learning
Recent Activity
authored
a paper
about 16 hours ago
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
authored
a paper
about 23 hours ago
AgentStudio: A Toolkit for Building General Virtual Agents
upvoted
a
paper
3 days ago
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
View all activity
Organizations
ZhenghaiXue
's models
4
Sort: Recently updated
ZhenghaiXue/gigpo_qwen2.5_3b_sim0.3_step150
3B
•
Updated
Jul 30
•
9
ZhenghaiXue/gigpo_qwen2.5_3b_sim0.5_step150
3B
•
Updated
Jul 30
•
9
ZhenghaiXue/Qwen2.5-7B-SimpleTIR
Reinforcement Learning
•
8B
•
Updated
Jul 8
•
12
ZhenghaiXue/Qwen2.5-32B-SimpleTIR
33B
•
Updated
Jul 8
•
9