Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
wangclnlp
/
GRAM-RR-LLaMA-3.2-3B-RewardModel
like
0
Text Generation
Safetensors
English
llama
Reward
RewardModel
RewardReasoning
Reasoning
RLHF
Best-of-N
conversational
arxiv:
2509.02492
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
GRAM-RR-LLaMA-3.2-3B-RewardModel
Commit History
Update README.md
21503fb
verified
wangclnlp
commited on
7 days ago
Upload README.md with huggingface_hub
3eb74f3
verified
wangclnlp
commited on
11 days ago
Upload folder using huggingface_hub
fe0561b
verified
wangclnlp
commited on
11 days ago
Upload folder using huggingface_hub
9bdbf53
verified
wangclnlp
commited on
11 days ago
Upload folder using huggingface_hub
68da516
verified
wangclnlp
commited on
11 days ago
Upload folder using huggingface_hub
a559cc7
verified
wangclnlp
commited on
11 days ago
Upload folder using huggingface_hub
a9a7dc9
verified
wangclnlp
commited on
11 days ago
Upload folder using huggingface_hub
5be04c0
verified
wangclnlp
commited on
11 days ago
Upload folder using huggingface_hub
b84282e
verified
wangclnlp
commited on
12 days ago
initial commit
03cbf0d
verified
wangclnlp
commited on
12 days ago