Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jerry Huang's picture
3 1

Jerry Huang PRO

jerry128
smadala2's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago
Beyond Correctness: Harmonizing Process and Outcome Rewards through RL Training
updated a dataset 2 months ago
jerry128/rag-rl-sft-linear
updated a dataset 2 months ago
jerry128/rag-rl-sft-min-max
View all activity

Organizations

RAG-RL's profile picture RANK-RL's profile picture Agents-MCP-Hackathon's profile picture ScaleChemistry's profile picture

upvoted a paper 1 day ago

Beyond Correctness: Harmonizing Process and Outcome Rewards through RL Training

Paper • 2509.03403 • Published 3 days ago • 18
upvoted a paper 3 months ago

RAG-RL: Advancing Retrieval-Augmented Generation via RL and Curriculum Learning

Paper • 2503.12759 • Published Mar 17 • 1
upvoted a paper 6 months ago

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published Feb 26 • 84
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略