arxiv:2510.19779
Jiaxin Guo
xinyuerufei
AI & ML interests
Large Language Models
Recent Activity
upvoted
a
paper
17 days ago
On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models
upvoted
a
paper
about 1 month ago
Black-Box On-Policy Distillation of Large Language Models
authored
a paper
2 months ago
Reward Reasoning Model
Organizations
None yet