MattBou00/llama-3-2-1b-detox_v1f_round4-checkpoint-epoch-20 Reinforcement Learning • 1B • Updated about 5 hours ago
MattBou00/llama-3-2-1b-detox_v1f_round4-checkpoint-epoch-40 Reinforcement Learning • 1B • Updated about 5 hours ago
MattBou00/llama-3-2-1b-detox_v1f_round4-checkpoint-epoch-60 Reinforcement Learning • 1B • Updated about 5 hours ago
MattBou00/llama-3-2-1b-detox_v1f_round4-checkpoint-epoch-80 Reinforcement Learning • 1B • Updated about 4 hours ago
MattBou00/llama-3-2-1b-detox_v1f_round4-checkpoint-epoch-100 Reinforcement Learning • 1B • Updated about 4 hours ago