7 1

zhang

kekueknu2

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

daVinci-Dev: Agent-native Mid-training for Software Engineering

upvoted an article about 1 year ago

From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning

upvoted an article over 1 year ago

Illustrating Reinforcement Learning from Human Feedback (RLHF)

View all activity

Organizations

upvoted a paper about 2 months ago

daVinci-Dev: Agent-native Mid-training for Software Engineering

Paper • 2601.18418 • Published Jan 26 • 126

upvoted an article about 1 year ago

Article

From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning

Feb 4, 2025

•

upvoted an article over 1 year ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Dec 9, 2022

•

404

upvoted a collection almost 2 years ago

LLM papers

Collection

It is a collection of papers that are useful in studying LLM. • 14 items • Updated Apr 3, 2024 • 15

upvoted a paper almost 2 years ago

Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15, 2024 • 109

upvoted 2 collections almost 2 years ago

Foundation AI Papers

Collection

Curated List of Must-Reads on LLM reasoning at Temus AI team • 135 items • Updated Jun 15, 2024 • 36

Reading Papers

Collection

231 items • Updated Jul 28, 2025 • 13

zhang

AI & ML interests

Recent Activity

Organizations

kekueknu2's activity

From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning

Illustrating Reinforcement Learning from Human Feedback (RLHF)

🎉 Free Image Generator Now Available!