Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
FDeRubeis
/
araft_trained_dpo
like
0
PEFT
Safetensors
Generated from Trainer
arxiv:
2210.03629
Model card
Files
Files and versions
Community
Use this model
main
araft_trained_dpo
/
checkpoint-72
Commit History
Upload folder using huggingface_hub
bbfcc31
verified
FDeRubeis
commited on
Apr 8, 2024