Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
FDeRubeis
/
araft_trained_dpo
like
0
PEFT
Safetensors
Generated from Trainer
arxiv:
2210.03629
Model card
Files
Files and versions
Community
Use this model
main
araft_trained_dpo
/
dpo_trained
1 contributor
History:
1 commit
FDeRubeis
Upload folder using huggingface_hub
bbfcc31
verified
11 months ago
adapter_config.json
Safe
655 Bytes
Upload folder using huggingface_hub
11 months ago
adapter_model.safetensors
Safe
33.6 MB
LFS
Upload folder using huggingface_hub
11 months ago