YANG ZHOU's picture

1 2 2

YANG ZHOU

BAOLONGZHANSHEN

IANNXANG

AI & ML interests

RLHF and DPO

Recent Activity

authored a paper 11 days ago

Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning

upvoted a paper 11 days ago

Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning

commented on a paper 11 days ago

Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning

View all activity

Organizations

None yet

Papers 2

arxiv:2508.16949

arxiv:2508.04026

models 0

None public yet

datasets 0

None public yet