Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Wenkai Yang's picture
3 8

Wenkai Yang

Keven16
VanTricht's profile picture dongguanting's profile picture LloydAndersen's profile picture
·
https://keven980716.github.io/
  • keven980716

AI & ML interests

None yet

Recent Activity

upvoted an article 8 days ago
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment
commented on a paper 11 days ago
ThinkDial: An Open Recipe for Controlling Reasoning Effort in Large Language Models
published a model about 1 month ago
Keven16/Qwen2.5-32B-TOPS-Iter-DPO-Preview
View all activity

Organizations

None yet

Keven16 's datasets 2

Keven16/DeepCritic-RL-Data

Viewer • Updated May 13 • 55k • 6

Keven16/DeepCritic-4.5K

Preview • Updated May 13 • 12
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略