HerrHruby/online_acemath_rl_4b_inst_hard_16k_thinking_vanilla_like_step_90 4B • Updated 6 days ago • 18
HerrHruby/online_acemath_rl_4b_inst_hard_16k_thinking_no_summ_thinking_step_90 4B • Updated 7 days ago • 51
HerrHruby/2k_hard_5_05_8_steps_e2e_explore_small_boxed_all_bonus_pos_turn_num_2_mbzs_stable_230_steps 2B • Updated 27 days ago • 1.2k
HerrHruby/2k_hard_5_05_8_steps_e2e_explore_small_boxed_all_bonus_pos_turn_num_2_mbzs_stable_170_steps 2B • Updated 27 days ago • 298
HerrHruby/2k_hard_5_05_8_steps_e2e_explore_small_boxed_all_bonus_pos_turn_num_2_mbzs_stable_130_steps 2B • Updated 27 days ago • 27
HerrHruby/reasoning_cache_deepscalr_16k_1p7b_sft_e2e_summaries_2048_18k Viewer • Updated Sep 19 • 18.4k • 8