raymond's picture

3

raymond

raymond1113

AI & ML interests

None yet

Recent Activity

upvoted an article 4 days ago

Illustrating Reinforcement Learning from Human Feedback (RLHF)

upvoted a paper about 1 month ago

Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament

upvoted a paper 4 months ago

RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style

View all activity

Organizations

None yet

models

None public yet

datasets

None public yet