Function Calling PO Dataset Function Calling Preference Optimization Datasets orion-research/Aura-Tooling-DPO-v3 Viewer • Updated Dec 7, 2024 • 3.8k • 18 GreenNode/RLHF_glaive_toolcall_en Viewer • Updated Jan 7 • 1.23k • 13 AymanTarig/Qwen2.5-0.5B-FC-v1-mistakes-critiques Viewer • Updated Nov 20, 2024 • 9.71k • 9 roborovski/glaive-tool-usage-dpo Viewer • Updated Feb 29, 2024 • 42k • 28 • 2
RLHF + Code Vezora/Code-Preference-Pairs Viewer • Updated Jul 28, 2024 • 54k • 151 • 26 quangduc1112001/python-code-DPO-fine-tune Viewer • Updated Nov 4, 2024 • 2k • 32 • 2 xinlai/Math-Step-DPO-10K Viewer • Updated Jul 4, 2024 • 10.8k • 402 • 57 minfeng-ai/leetcode_preference Viewer • Updated Sep 6, 2023 • 457 • 102 • 7
Function Calling PO Dataset Function Calling Preference Optimization Datasets orion-research/Aura-Tooling-DPO-v3 Viewer • Updated Dec 7, 2024 • 3.8k • 18 GreenNode/RLHF_glaive_toolcall_en Viewer • Updated Jan 7 • 1.23k • 13 AymanTarig/Qwen2.5-0.5B-FC-v1-mistakes-critiques Viewer • Updated Nov 20, 2024 • 9.71k • 9 roborovski/glaive-tool-usage-dpo Viewer • Updated Feb 29, 2024 • 42k • 28 • 2
RLHF + Code Vezora/Code-Preference-Pairs Viewer • Updated Jul 28, 2024 • 54k • 151 • 26 quangduc1112001/python-code-DPO-fine-tune Viewer • Updated Nov 4, 2024 • 2k • 32 • 2 xinlai/Math-Step-DPO-10K Viewer • Updated Jul 4, 2024 • 10.8k • 402 • 57 minfeng-ai/leetcode_preference Viewer • Updated Sep 6, 2023 • 457 • 102 • 7