why are the RL checkpoints smaller than the `safe` checkpoints?

#1
by rdesc - opened

Hi I just wanted some clarification on the different models. What are the base models for the rl and safe model checkpoints?

Sign up or log in to comment