🚨 This repo does not include the Process Reward Model (PRM). For access to the PRM, please refer to here.

This repository hosts a fine-tuned LLM optimized for better mathematical reasoning capabilities via only process rewards.

Safetensors

Model size

7.62B params

Tensor type

BF16

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for jinachris/Qwen2.5-7B-PURE-PRM

Base model

Qwen/Qwen2.5-7B

Finetuned

Finetuned

(59)

this model

Quantizations

Collection including jinachris/Qwen2.5-7B-PURE-PRM