Edit Models filters

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

14

Full-text search

Active filters: Reward

internlm/POLAR-1_8B-Base

Text Classification • Updated Jul 15 • 107 • 1

internlm/POLAR-7B

Text Classification • Updated Jul 15 • 212 • 24

internlm/POLAR-7B-Base

Text Classification • Updated Jul 15 • 77 • 5

SultanR/SmolTulu-1.7b-RM

Text Classification • 2B • Updated Dec 17, 2024 • 11 • 2

mradermacher/SmolTulu-1.7b-RM-GGUF

2B • Updated Dec 17, 2024 • 91

mradermacher/SmolTulu-1.7b-RM-i1-GGUF

2B • Updated Dec 17, 2024 • 243

TEEN-D/squiral_maze

Reinforcement Learning • Updated Mar 30

internlm/POLAR-1_8B

Text Classification • Updated Jul 15 • 99 • 8

wangclnlp/GRAM-RR-LLaMA-3.1-8B-RewardModel

Text Generation • 8B • Updated 3 days ago • 13

wangclnlp/GRAM-RR-LLaMA-3.2-3B-RewardModel

Text Generation • 3B • Updated 3 days ago • 6

mradermacher/GRAM-RR-LLaMA-3.2-3B-RewardModel-GGUF

3B • Updated 2 days ago • 1.14k

mradermacher/GRAM-RR-LLaMA-3.2-3B-RewardModel-i1-GGUF

3B • Updated 2 days ago • 2.39k

mradermacher/GRAM-RR-LLaMA-3.1-8B-RewardModel-GGUF

8B • Updated 2 days ago • 2.67k

mradermacher/GRAM-RR-LLaMA-3.1-8B-RewardModel-i1-GGUF

8B • Updated 2 days ago • 3.54k