Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
3
Zhenghai Xue
ZhenghaiXue
Follow
zwt963's profile picture
rodoxcasta's profile picture
21world's profile picture
4 followers
·
8 following
AI_Defender
AI & ML interests
Reinforcement Learning
Recent Activity
authored
a paper
about 16 hours ago
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
authored
a paper
about 23 hours ago
AgentStudio: A Toolkit for Building General Virtual Agents
upvoted
a
paper
3 days ago
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
View all activity
Organizations
ZhenghaiXue
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
6 months ago
deepseek-ai/DeepSeek-R1
Text Generation
•
685B
•
Updated
Mar 27
•
370k
•
•
12.7k
liked
a model
9 months ago
Skywork/Skywork-o1-Open-PRM-Qwen-2.5-7B
Text Classification
•
Updated
7 days ago
•
570
•
51
liked
a model
10 months ago
OpenRLHF/Mistral-7b-PRM-Math-Shepherd
7B
•
Updated
Oct 30, 2024
•
8
•
1