Yihe Deng PRO
ydeng9
AI & ML interests
LLM post-training
Recent Activity
upvoted
a
paper
3 days ago
LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model
new activity
about 1 month ago
ydeng9/OpenVLThinker-7B-v1.2:Add project page link to model card
published
a dataset
about 1 month ago
openvlthinker/OpenVLThinker_SFT_iter1