Pastiche crown clown logo

CorticalStack/pastiche-crown-clown-7b-dare-dpo

CorticalStack/pastiche-crown-clown-7b-dare-dpo is a DPO fine-tuned version of CorticalStack/pastiche-crown-clown-7b-dare using the jondurbin/truthy-dpo-v0.1 dataset.

LoRA

  • r: 16
  • LoRA alpha: 16
  • LoRA dropout: 0.05

Training arguments

  • Batch size: 4
  • Gradient accumulation steps: 4
  • Optimizer: paged_adamw_32bit
  • Max steps: 200
  • Learning rate: 5e-05
  • Learning rate scheduler type: cosine
  • Beta: 0.1
  • Max prompt length: 1024
  • Max length: 1536
Downloads last month
51
Safetensors
Model size
7.24B params
Tensor type
FP16
Β·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Model tree for CorticalStack/pastiche-crown-clown-7b-dare-dpo

Finetuned
(1)
this model
Finetunes
2 models
Merges
13 models
Quantizations
3 models

Spaces using CorticalStack/pastiche-crown-clown-7b-dare-dpo 8