Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
smadala2
/
CS443_RLHF_v1
like
0
Safetensors
Model card
Files
Files and versions
xet
Community
07c1b19
CS443_RLHF_v1
/
best_model.pth
Commit History
Uploading PPO LLM
07c1b19
verified
smadala2
commited on
May 5, 2024