Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
2
2
YANG ZHOU
BAOLONGZHANSHEN
Follow
IANNXANG
AI & ML interests
RLHF and DPO
Recent Activity
authored
a paper
11 days ago
Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning
upvoted
a
paper
11 days ago
Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning
commented
on
a paper
11 days ago
Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning
View all activity
Organizations
None yet
Papers
2
arxiv:
2508.16949
arxiv:
2508.04026
models
0
None public yet
datasets
0
None public yet