mariosirt/EleutherAI-gpt-neo-125m-detoxified-perspective Reinforcement Learning • Updated Jun 11, 2023 • 2
Evan-Lin/Bart-RL-many-entailment-attractive-keywordmax Reinforcement Learning • Updated Jul 13, 2023 • 12
nlp-lab-2023-seq2seq/R-best-fine-tuned-bart-base-full-ft-reward_short_sentences_and_words-2023-07-13T06-49-08 Reinforcement Learning • Updated Aug 20, 2023 • 15 • 1
Evan-Lin/Bart-RL-many-entailment-attractive-epoch1 Reinforcement Learning • Updated Jul 14, 2023 • 14
amirabdullah19852020/pythia_70m_ppo_imdb_sentiment Reinforcement Learning • Updated Jul 15, 2023 • 13
Evan-Lin/Bart-RL-many-keywordmax-entailment-attractive-reward1 Reinforcement Learning • Updated Jul 15, 2023 • 12
Evan-Lin/Bart-RL-many-keywordmax-entailment-attractive-reward2 Reinforcement Learning • Updated Jul 15, 2023 • 13
amirabdullah19852020/pythia_70m_ppo_imdb_sentiment_v2 Reinforcement Learning • Updated Jul 15, 2023 • 13
Evan-Lin/Bart-RL-many-keywordmax-entailment-attractive-reward5 Reinforcement Learning • Updated Jul 16, 2023 • 13