YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

license: apache-2.0

datasets: mdb

language: English

为了实现InstructGPT的SFT-RW-PPO

此repo采用GPT2作为SFT模型,经过GPT2生成的文字再经过DistilBERT加以评估取生成positive的分数,再经过PPO优化

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support