Li Yunshui
Wa2erGo
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
4 days ago
Implicit Actor Critic Coupling via a Supervised Learning Framework for
RLVR
authored
a paper
3 months ago
Ruler: A Model-Agnostic Method to Control Generated Length for Large
Language Models
authored
a paper
3 months ago
Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement
Learning