Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
2
2
YANG ZHOU
BAOLONGZHANSHEN
Follow
IANNXANG
AI & ML interests
RLHF and DPO
Recent Activity
authored
a paper
12 days ago
Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning
upvoted
a
paper
12 days ago
Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning
commented
on
a paper
12 days ago
Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning
View all activity
Organizations
None yet
BAOLONGZHANSHEN
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
about 2 months ago
2077AIDataFoundation/VeriGUI
Viewer
•
Updated
about 1 month ago
•
25
•
3.09k
•
21
liked
a model
6 months ago
ByteDance-Seed/UI-TARS-7B-DPO
Image-Text-to-Text
•
8B
•
Updated
Jan 25
•
4.41k
•
219