Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

Carbon Emissions

8-bit precision

Mixture of Experts

Misc with no match

text-embeddings-inference

Models

111

Full-text search

Active filters: reward model

Qwen/Qwen2.5-Math-7B-PRM800K

Text Classification • Updated Jan 17 • 1.52k • 13

Qwen/Qwen2.5-Math-PRM-7B

Text Classification • Updated Jan 17 • 36.9k • 56

berkeley-nest/Starling-LM-7B-alpha

Text Generation • Updated Mar 20, 2024 • 16.6k • 558

CallComply/Starling-LM-11B-alpha

Text Generation • Updated Mar 4, 2024 • 1.84k • 13

johnsnowlabs/JSL-MedMNX-7B

Text Generation • Updated Apr 18, 2024 • 2.71k • 5

nvidia/Nemotron-4-340B-Reward

Updated Jun 19, 2024 • 40 • 116

internlm/internlm2-1_8b-reward

Text Classification • Updated Jul 15, 2024 • 9.33k • 12

internlm/internlm2-20b-reward

Text Classification • Updated Oct 9, 2024 • 285 • 24

Qwen/Qwen2.5-Math-RM-72B

Text Classification • Updated Oct 31, 2024 • 18k • 74

nvidia/Llama-3.1-Nemotron-70B-Reward

Updated Oct 15, 2024 • 40 • 71

nvidia/Llama-3.1-Nemotron-70B-Reward-HF

Updated Oct 15, 2024 • 926 • 79

Qwen/Qwen2.5-Math-PRM-72B

Text Classification • Updated Jan 17 • 1.05k • 68

internlm/internlm-xcomposer2d5-7b-reward

Any-to-Any • Updated 27 days ago • 715 • 7

mradermacher/Starling-LM-11B-alpha-GGUF

Updated 14 days ago • 298 • 1

mradermacher/Starling-LM-11B-alpha-i1-GGUF

Updated 13 days ago • 526 • 1

nicholasKluge/RewardModelPT

Text Classification • Updated Jun 18, 2024 • 42

nicholasKluge/RewardModel

Text Classification • Updated Jun 18, 2024 • 59

Ablustrund/moss-rlhf-reward-model-7B-zh

Updated Jul 13, 2023 • 7 • 23

fnlp/moss-rlhf-reward-model-7B-en

Updated Jul 13, 2023 • 9

berkeley-nest/Starling-RM-7B-alpha

Updated Jul 30, 2024 • 14 • 102

LoneStriker/Starling-LM-7B-alpha-3.0bpw-h6-exl2

Text Generation • Updated Nov 27, 2023 • 7

LoneStriker/Starling-LM-7B-alpha-4.0bpw-h6-exl2

Text Generation • Updated Nov 27, 2023 • 7 • 1

LoneStriker/Starling-LM-7B-alpha-5.0bpw-h6-exl2

Text Generation • Updated Nov 27, 2023 • 8 • 2

LoneStriker/Starling-LM-7B-alpha-6.0bpw-h6-exl2

Text Generation • Updated Nov 27, 2023 • 8 • 1

LoneStriker/Starling-LM-7B-alpha-8.0bpw-h8-exl2

Text Generation • Updated Nov 27, 2023 • 8 • 2

TheBloke/Starling-LM-7B-alpha-GGUF

Updated Nov 28, 2023 • 1.33k • 94

TheBloke/Starling-LM-7B-alpha-AWQ

Text Generation • Updated Nov 28, 2023 • 87 • 9

second-state/Starling-LM-7B-alpha-GGUF

Text Generation • Updated Mar 20, 2024 • 58 • 3

TheBloke/Starling-LM-7B-alpha-GPTQ

Text Generation • Updated Nov 28, 2023 • 36 • 9

bartowski/Starling-LM-7B-alpha-old-exl2

Text Generation • Updated Nov 28, 2023