Jerry Huang PRO
jerry128
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
1 day ago
Beyond Correctness: Harmonizing Process and Outcome Rewards through RL
Training
updated
a dataset
2 months ago
jerry128/rag-rl-sft-linear
updated
a dataset
2 months ago
jerry128/rag-rl-sft-min-max