zhang's picture
6 1

zhang

kekueknu2
·

AI & ML interests

None yet

Recent Activity

Organizations

san's profile picture san's profile picture

kekueknu2's activity

upvoted an article 9 days ago
view article
Article

From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning

By NormalUhr •
• 11
upvoted an article 5 months ago
view article
Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

• 170