Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
免费去水印
Log In
Sign Up
2
6
jkrs
jkrs
Follow
0 followers
·
1 following
AI & ML interests
Reinforcement Learning
Recent Activity
upvoted
a
paper
6 days ago
VI-CuRL: Stabilizing Verifier-Independent RL Reasoning via Confidence-Guided Variance Reduction
upvoted
a
paper
5 months ago
Reinforcement Learning with Verifiable yet Noisy Rewards under Imperfect Verifiers
liked
a dataset
over 1 year ago
Anthropic/hh-rlhf
View all activity
Organizations
None yet
models
1
jkrs/output
Updated
Oct 20, 2022
datasets
0
None public yet
×
🎉 Free Image Generator Now Available!
Totally Free + Zero Barriers + No Login Required
Visit Now