wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step50_2026-01-27_03-19-15_nvidia_balanced 8B • Updated about 8 hours ago
wenwenD/qwen3-4b-codeexp_grpo_no_prior_think_step280_2026-01-25_06-29-13_nvidia_balanced 4B • Updated 2 days ago • 10
wenwenD/qwen3-4b-codeexp_grpo_w_prior_think_step280_2026-01-25_06-28-54_nvidia_balanced 4B • Updated 2 days ago • 11
wenwenD/qwen3-4b-codeexp_grpo_with_prior_think_step280_2026-01-24_07-19-57_nvidia 4B • Updated 3 days ago • 14
wenwenD/qwen3-4b-codeexp_grpo_no_prior_think_step280_2026-01-24_07-21-36_nvdia 4B • Updated 3 days ago • 14
wenwenD/qwen3-4b-codeexp_grpo_w_prior_think_discount_always1_step175_2026_01_23_21_40_33 4B • Updated 3 days ago • 8
wenwenD/qwen7B-instruct-repo_sft_3epcs_w_context-synthetic_multiturn_sft_3epcs 8B • Updated Jun 16, 2025 • 1