zhangchenxu/Llama-3.2-3B-Instruct-t2_25k_v2_tag4_processed Text Generation • 3B • Updated about 3 hours ago
zhangchenxu/Llama-3.1-8B-Instruct-t2_25k_v2_tag4_processed Text Generation • 8B • Updated about 3 hours ago
zhangchenxu/Qwen2.5-3B-Instruct-t2_25k_v2_tag4_processed Text Generation • 3B • Updated about 4 hours ago
zhangchenxu/Qwen2.5-Coder-7B-Instruct-t2_25k_v2_tag4_processed Text Generation • 8B • Updated about 4 hours ago
zhangchenxu/Qwen2.5-Coder-3B-Instruct-t2_25k_v2_tag4_processed Text Generation • 3B • Updated about 5 hours ago
zhangchenxu/Qwen2.5-7B-Instruct-t2_25k_v2_tag4_processed Text Generation • 8B • Updated about 5 hours ago
zhangchenxu/Qwen2.5-14B-Instruct-KimiK2_1T_SFT-LR2.0e-5-EPOCHS2 0.0B • Updated about 14 hours ago • 3
zhangchenxu/RB-Qwen2.5-VL-3B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink-GRPO_step312 4B • Updated 22 days ago • 20
zhangchenxu/RB-Qwen2.5-VL-3B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink-GRPO_step288 4B • Updated 22 days ago • 19
zhangchenxu/RB-Qwen2.5-VL-3B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink-GRPO_step256 4B • Updated 22 days ago • 77
zhangchenxu/RB-Qwen2.5-VL-3B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink-GRPO_step224 4B • Updated 22 days ago • 18
zhangchenxu/RB-Qwen2.5-VL-3B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink-GRPO_step192 4B • Updated 22 days ago • 19
zhangchenxu/RB-Qwen2.5-VL-3B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink-GRPO_step160 4B • Updated 22 days ago • 19
zhangchenxu/RB-Qwen2.5-VL-3B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink-GRPO_step128 4B • Updated 22 days ago • 17
zhangchenxu/RB-Qwen2.5-VL-3B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink-GRPO_step96 4B • Updated 22 days ago • 20
zhangchenxu/RB-Qwen2.5-VL-3B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink-GRPO_step64 4B • Updated 22 days ago • 19
zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink_20k-GRPO_step320 8B • Updated 22 days ago • 19