TrandeLik/augmentedrewardtrainer-qwen-qwen2.5-7b-instruct-trl-lib-ultrafeedback_binarized-n_epochs2-bs16 Updated 10 days ago
lemonhat/Qwen2.5-7B-Instruct-agenttuning_v4_15k_tag5-mini Text Generation • 8B • Updated 7 days ago • 10
agurung/v4_savebestearly_sft_qwen7B_25percent_lr_1e3_bptt_offset Text Generation • 8B • Updated 4 days ago • 4
agurung/v4_savebestearly_sft_qwen7B_25percent_lr_1e4_bptt_offset Text Generation • 8B • Updated 2 days ago • 35
mlfoundations-dev/Qwen-7B-Inst_flas-attn_fa2_pack_Fals_clau_3_7_2025_tben_trac_shar_cuto-len_6400_rope-scal_yarn Text Generation • 8B • Updated 3 days ago • 10
mlfoundations-dev/Qwen-7B-Inst_flas-attn_fa2_pack_Fals_clau_3_7_2025_tben_trac_shar_cuto-len_1280_rope-scal_yarn Text Generation • 8B • Updated 3 days ago • 10
mlfoundations-dev/Qwen-7B-Inst_flas-attn_fa2_pack_Fals_clau_3_7_2025_tben_trac_shar_cuto-len_3200_rope-scal_yarn Text Generation • 8B • Updated 3 days ago • 9
mlfoundations-dev/Qwen-7B-Inst_flas-attn_fa2_pack_Fals_clau_3_7_2025_tben_trac_shar_cuto-len_1600_rope-scal_yarn Text Generation • 8B • Updated 3 days ago • 9
mlfoundations-dev/Qwen-7B-Inst_flas-attn_fa2_pack_Fals_clau_3_7_2025_tben_trac_shar_cuto-len_8000_rope-scal_yarn Text Generation • 8B • Updated 3 days ago • 6
mlfoundations-dev/Qwen-7B-Inst_flas-attn_fa2_pack_Fals_clau_3_7_2025_tben_trac_shar_cuto-len_4000_rope-scal_yarn Text Generation • 8B • Updated 3 days ago • 7