Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
wangclnlp
's Collections
GRAM-RR
RoVRM
GRAM
GRAM-RR
updated
6 days ago
Self-Training Generative Foundation Reward Models for Reward Reasoning
Upvote
-
wangclnlp/GRAM-RR-LLaMA-3.1-8B-RewardModel
Text Generation
•
8B
•
Updated
3 days ago
•
13
wangclnlp/GRAM-RR-LLaMA-3.2-3B-RewardModel
Text Generation
•
3B
•
Updated
3 days ago
•
6
wangclnlp/GRAM-RR-TrainingData
Updated
3 days ago
•
24
Upvote
-
Share collection
View history
Collection guide
Browse collections