clembench-playpen/llama-3.1-8B-Instruct_playpen_ablation_SFT_DFINAL_0.1K-steps_merged_fp16
Text Generation
• 8B • Updated
• 1
clembench-playpen/llama3.1_D40005_DPO_1neg_Aborted_best_models_old_LA_clicgpu5
Updated
clembench-playpen/llama_70B_DPO_1neg_Aborted_best_models_FINAL
Updated
clembench-playpen/Mistral_DPO_1neg_Aborted_best_models_FINAL
Updated
clembench-playpen/H100-L8B_DPO_1neg_Aborted_best_models_FINAL
Updated
clembench-playpen/A100-L8B_DPO_1neg_Aborted_best_models_FINAL
Updated
clembench-playpen/llama-3.1_D40005_DPO_2neg_Aborted_old_LA
Updated
clembench-playpen/llama-3.1_D40005_DPO_2neg_Aborted_same_family_model_old_LA
Updated
clembench-playpen/llama-3.1_D40005_DPO_2neg_Aborted_best_models_old_LA
Updated
clembench-playpen/llama-3.1_D40005_DPO_1neg_Aborted_best_models_old_LA
Updated
clembench-playpen/llama-3.1_D40005_DPO_1neg_Aborted_same_family_model_old_LA
Updated
clembench-playpen/llama-3.1_D40005_DPO_1neg_Aborted_old_LA
Updated
clembench-playpen/llama-3.1-70B-Instruct_playpen_SFT_DFINAL_0.6K-steps_merged_fp16_KTO_FINAL_FINAL
Updated
clembench-playpen/Mistral-Small-24B-Instruct-2501_playpen_SFT_merged_fp16_DFINAL_0.6K-steps_KTO_FINAL_FINAL
Updated
clembench-playpen/llama-3.1-70B-Instruct_playpen_SFT_DFINAL_0.6K-steps_merged_fp16
Text Generation
• 71B • Updated
• 6
clembench-playpen/llama-3.1-8B-Instruct_playpen_ablation_SFT_DFINAL_0.6K-steps
Updated
• 16
clembench-playpen/llama-3.1-8B-Instruct_playpen_ablation_SFT_DFINAL_0.5K-steps
Updated
• 17
clembench-playpen/llama-3.1-8B-Instruct_playpen_ablation_SFT_DFINAL_0.4K-steps
Updated
• 373
clembench-playpen/llama-3.1-8B-Instruct_playpen_ablation_SFT_DFINAL_0.3K-steps
Updated
• 21
clembench-playpen/llama-3.1-8B-Instruct_playpen_ablation_SFT_DFINAL_0.2K-steps
Updated
• 25
clembench-playpen/llama-3.1-8B-Instruct_playpen_ablation_SFT_DFINAL_0.1K-steps
Updated
• 20
clembench-playpen/llama-3.1-8B-Instruct_playpen_SFT_DFINAL_0.7K-steps_merged_fp16_DPO_REF_ALL_PL2_DPO_FINAL
Updated
clembench-playpen/llama-3.1-8B-Instruct_playpen_SFT_DFINAL_0.7K-steps_merged_fp16_DPO_REF10_PL2_DPO_FINAL
Updated
clembench-playpen/Mistral-Small-24B-Instruct-2501_playpen_SFT_merged_fp16_DFINAL_0.6K-steps_DPO_FINAL
Updated
clembench-playpen/llama-3.1-8B-Instruct_playpen_KTO_FINAL
Updated
clembench-playpen/meta-llama-3.1_DPO_1neg_Aborted_best_models_END_07
Updated
clembench-playpen/llama-3.1-8B-Instruct_playpen_SFT_DFINAL_0.7K-steps_merged_fp16
Text Generation
• 8B • Updated
• 5
clembench-playpen/meta-llama_3.1_KTO_Aborted_best_models_old_and_new_endParallel
Updated
clembench-playpen/meta-llama-3.1_DPO_1neg_Aborted_best_models_END
Updated
clembench-playpen/meta-llama_3.1_KTO_Aborted_F
Updated