YAML Metadata
Warning:
empty or missing yaml metadata in repo card
(https://huggingface.co/docs/hub/model-cards#model-card-metadata)
license: apache-2.0
datasets: mdb
language: English
为了实现InstructGPT的SFT-RW-PPO
此repo采用GPT2作为SFT模型,经过GPT2生成的文字再经过DistilBERT加以评估取生成positive的分数,再经过PPO优化
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support