DPO datasets
updated
Viewer
• Updated • 7.5k • 561
• 171
argilla/distilabel-capybara-dpo-7k-binarized
Viewer
• Updated • 7.56k • 2.4k
• 182
llamafactory/DPO-En-Zh-20k
Viewer
• Updated • 20k • 322
• 99
argilla/distilabel-intel-orca-dpo-pairs
Viewer
• Updated • 12.9k • 6.23k
• 183
argilla/ultrafeedback-binarized-preferences-cleaned
Viewer
• Updated • 60.9k • 8.12k
• 162
argilla/distilabel-math-preference-dpo
Viewer
• Updated • 2.42k • 225
• 88
M4-ai/prm_dpo_pairs_cleaned
Viewer
• Updated • 7.99k • 26
• 11
jondurbin/truthy-dpo-v0.1
Viewer
• Updated • 1.02k • 394
• 136
YeungNLP/ultrafeedback_binarized
Viewer
• Updated • 63.1k • 18
• 1
shibing624/DPO-En-Zh-20k-Preference
Viewer
• Updated • 20k • 212
• 18
Preview
• Updated • 25
• 6
mlabonne/orpo-dpo-mix-40k
Viewer
• Updated • 44.2k • 615
• 301
Viewer
• Updated • 15.3k • 23
• 19
jondurbin/gutenberg-dpo-v0.1
Viewer
• Updated • 918 • 424
• 162
CyberNative/Code_Vulnerability_Security_DPO
Viewer
• Updated • 4.66k • 1.28k
• 154
mlabonne/orpo-dpo-mix-40k-flat
Viewer
• Updated • 44.2k • 13
• 14
selimc/orpo-dpo-mix-TR-20k
Viewer
• Updated • 19.9k • 24
• 7
efederici/alpaca-vs-alpaca-orpo-dpo
Viewer
• Updated • 49.2k • 83
• 7
Viewer
• Updated • 2.42k • 26
• 10
allenai/llama-3.1-tulu-3-8b-preference-mixture
Viewer
• Updated • 273k • 1.92k
• 26
allenai/llama-3.1-tulu-3-70b-preference-mixture
Viewer
• Updated • 337k • 81
• 19
HuggingFaceH4/ultrafeedback_binarized
Viewer
• Updated • 187k • 12.3k
• 329
Preview
• Updated • 1.36k
• 211
allenai/llama-3.1-tulu-3-405b-preference-mixture
Viewer
• Updated • 361k • 39
• 6
Viewer
• Updated • 450k • 14.2k
• 734
qihoo360/Light-R1-DPOData
Viewer
• Updated • 2.97k • 135
• 29