Research Weighted-Reward Preference Optimization for Implicit Model Fusion Paper • 2412.03187 • Published Dec 4, 2024 • 12
Weighted-Reward Preference Optimization for Implicit Model Fusion Paper • 2412.03187 • Published Dec 4, 2024 • 12
Travel bitext/Bitext-travel-llm-chatbot-training-dataset Viewer • Updated Aug 22, 2024 • 31.7k • 138 alexlawtengyi/travel_agentv1 Viewer • Updated Nov 22, 2024 • 691 • 59 • 1 yananchen/travelplanner_faft_filter_label45_pos517_neg1959 Viewer • Updated Nov 18, 2024 • 2k • 38 osunlp/TravelPlanner Viewer • Updated Jul 14, 2024 • 1.23k • 2.08k • 51