heegyu
's Collections
Reward Modeling Datasets
updated
Viewer
•
Updated
•
37.1k
•
1.93k
•
233
Viewer
•
Updated
•
169k
•
14k
•
1.27k
Viewer
•
Updated
•
386k
•
2.38k
•
297
PKU-Alignment/PKU-SafeRLHF
Viewer
•
Updated
•
164k
•
4.1k
•
126
openai/webgpt_comparisons
Viewer
•
Updated
•
19.6k
•
711
•
230
openai/summarize_from_feedback
Viewer
•
Updated
•
194k
•
2.1k
•
198
HuggingFaceH4/ultrafeedback_binarized
Viewer
•
Updated
•
187k
•
6.4k
•
274
Viewer
•
Updated
•
183k
•
996
•
287
HuggingFaceH4/stack-exchange-preferences
Viewer
•
Updated
•
10.8M
•
1.16k
•
130
HuggingFaceH4/hhh_alignment
Viewer
•
Updated
•
221
•
744
•
18
Birchlabs/openai-prm800k-stepwise-critic
Viewer
•
Updated
•
1.09M
•
103
•
44
prometheus-eval/Feedback-Collection
Viewer
•
Updated
•
100k
•
572
•
107
argilla/OpenHermesPreferences
Viewer
•
Updated
•
989k
•
1.41k
•
204
Viewer
•
Updated
•
8.11k
•
10.1k
•
87
Viewer
•
Updated
•
21.4k
•
6.94k
•
404
Magpie-Align/Magpie-Pro-DPO-200K
Viewer
•
Updated
•
207k
•
22
•
6
argilla/magpie-ultra-v0.1
Viewer
•
Updated
•
50k
•
625
•
220