Mistral-Small-3.2-AntiRep-24B:

  • Exactly what it says on the tin, Orpo'd Mistral Small 3.2 to remove repetition.
  • Trained to reduce infinite repetition, repetition of structure and sentences in multi turn conversation, and repetition within responses.
  • Got really annoyed with all of my Mistral Small test models having repetition issues, so I decided to whip this up.
  • Produced by doing orpo with Qwen 3 8B at 0 temp + .7 rep pen (<1 increases repetition) as rejected vs V3 03/24 as chosen.
  • The LoRA is also available too, if you want to use it to reduce repetition on other MS3.2 tunes.

Enjoy!

Downloads last month
121
Safetensors
Model size
23.6B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ConicCat/Mistral-Small-3.2-AntiRep-24B

Finetuned
(34)
this model
Finetunes
1 model
Quantizations
5 models

Dataset used to train ConicCat/Mistral-Small-3.2-AntiRep-24B

Collection including ConicCat/Mistral-Small-3.2-AntiRep-24B