POTION Collection These are the flagship POTION models. Load them and use them with model2vec (https://github.com/MinishLab/model2vec) or sentence-transformers • 5 items • Updated 21 days ago • 10
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models Paper • 2501.03262 • Published Jan 4 • 90
Reward Bench Collection Datasets, spaces, and models for the reward model benchmark! • 5 items • Updated 13 days ago • 9
view article Article Accelerated Inference with Optimum and Transformers Pipelines May 10, 2022 • 2