Jerry Huang's picture

3 1

Jerry Huang PRO

jerry128

·

AI & ML interests

None yet

Organizations

upvoted a paper 5 months ago

Beyond Correctness: Harmonizing Process and Outcome Rewards through RL Training

Paper • 2509.03403 • Published Sep 3, 2025 • 23

upvoted a paper 8 months ago

RAG-RL: Advancing Retrieval-Augmented Generation via RL and Curriculum Learning

Paper • 2503.12759 • Published Mar 17, 2025 • 1

upvoted a paper 11 months ago

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published Feb 26, 2025 • 82