Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2
1
Hugh Zhang
hugh-scale
Follow
0 followers
·
2 following
AI & ML interests
None yet
Recent Activity
authored
a paper
27 days ago
Humanity's Last Exam
authored
a paper
6 months ago
Chain-of-Thought Reasoning is a Policy Improvement Operator
authored
a paper
6 months ago
Q-Probe: A Lightweight Approach to Reward Maximization for Language Models
View all activity
Organizations
Papers
7
arxiv:
2501.14249
arxiv:
2409.03733
arxiv:
2408.15221
arxiv:
2406.04520
Expand 7 papers
models
None public yet
datasets
1
hugh-scale/hugh
Updated
Feb 22, 2024
•
6