Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
FDeRubeis
/
araft_trained_dpo
like
0
PEFT
Safetensors
Generated from Trainer
arxiv:
2210.03629
Model card
Files
Files and versions
Community
Use this model
main
araft_trained_dpo
/
checkpoint-48
1 contributor
History:
1 commit
FDeRubeis
Upload folder using huggingface_hub
bbfcc31
verified
11 months ago
dpo_trained
Upload folder using huggingface_hub
11 months ago
reference
Upload folder using huggingface_hub
11 months ago