Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
wangclnlp
/
GRAM-RR-LLaMA-3.1-8B-RewardModel
like
0
Text Generation
Safetensors
English
llama
Reward
RewardModel
RewardReasoning
Reasoning
RLHF
Best-of-N
conversational
arxiv:
2509.02492
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
47fb536
GRAM-RR-LLaMA-3.1-8B-RewardModel
Commit History
Update README.md
47fb536
verified
wangclnlp
commited on
10 days ago
Upload README.md with huggingface_hub
5b5261b
verified
wangclnlp
commited on
13 days ago
Upload folder using huggingface_hub
0013266
verified
wangclnlp
commited on
14 days ago
Upload folder using huggingface_hub
706ea74
verified
wangclnlp
commited on
14 days ago
Upload folder using huggingface_hub
1d97403
verified
wangclnlp
commited on
14 days ago
Upload folder using huggingface_hub
6c53225
verified
wangclnlp
commited on
14 days ago
Upload folder using huggingface_hub
46fabba
verified
wangclnlp
commited on
14 days ago
Upload folder using huggingface_hub
4af9bd0
verified
wangclnlp
commited on
14 days ago
Upload folder using huggingface_hub
0060ca9
verified
wangclnlp
commited on
14 days ago
initial commit
337d41a
verified
wangclnlp
commited on
15 days ago