·
AI & ML interests
None yet
Organizations
datasets
26
august66/drpo_hh_qwen2.5_1.5b_with_ref_btpref
Viewer
•
Updated
•
48.8k
•
12
august66/hh_qwen2.5_1.5b_with_bias_bt_pref
Viewer
•
Updated
•
18k
•
10
august66/hh_qwen2.5_1.5b_with_bias
Viewer
•
Updated
•
18k
•
11
august66/drpo_hh_qwen2.5_1.5b
Viewer
•
Updated
•
43.8k
•
5
august66/dpo_reward_dist_pi_theta_prompt_3
Viewer
•
Updated
•
5k
•
5
august66/dpo_reward_dist_pi_theta_prompt_2
Viewer
•
Updated
•
5k
•
12
august66/dpo_reward_dist_pi_theta
Viewer
•
Updated
•
5k
•
6
august66/reward_distribution_2_tldr_openassist_pi_ref
Viewer
•
Updated
•
5k
•
10
august66/reward_distribution_2_tldr_openassist_pi_theta
Viewer
•
Updated
•
5k
•
7
august66/reward_distribution_tldr_openassist_pi_theta
Viewer
•
Updated
•
5k
•
9