Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
8
Wenkai Yang
Keven16
Follow
VanTricht's profile picture
dongguanting's profile picture
LloydAndersen's profile picture
5 followers
·
3 following
https://keven980716.github.io/
keven980716
AI & ML interests
None yet
Recent Activity
upvoted
an
article
8 days ago
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment
commented
on
a paper
11 days ago
ThinkDial: An Open Recipe for Controlling Reasoning Effort in Large Language Models
published
a model
about 1 month ago
Keven16/Qwen2.5-32B-TOPS-Iter-DPO-Preview
View all activity
Organizations
None yet
Keven16
's datasets
2
Sort: Recently updated
Keven16/DeepCritic-RL-Data
Viewer
•
Updated
May 13
•
55k
•
6
Keven16/DeepCritic-4.5K
Preview
•
Updated
May 13
•
12