PURE
Collection
PRM and fine-tuned LLM used in our PURE github repo: https://github.com/CJReinforce/PURE
•
4 items
•
Updated
🚨 This repo does not include the Process Reward Model (PRM). For access to the PRM, please refer to here.
This repository hosts a fine-tuned LLM optimized for better mathematical reasoning capabilities via only process rewards.