wenxueru's picture

5 14

wenxueru

Aunderline

·

https://github.com/wenxueru

Aunderline

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning

upvoted a paper 2 months ago

Reinforcement Pre-Training

authored a paper 3 months ago

Critic-CoT: Boosting the reasoning abilities of large language model via Chain-of-thoughts Critic

View all activity

Organizations

None yet

Aunderline 's models

None public yet