Jiang Jiwen

jjw0126
·

AI & ML interests

RL, LLM

Recent Activity

Organizations

ucas's profile picture ELM Team's profile picture PLM-Team's profile picture

jjw0126's activity

upvoted 2 articles 5 days ago
view article
Article

Open-R1: a fully open reproduction of DeepSeek-R1

771
view article
Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

By NormalUhr
42