arxiv:2512.13607
Yangyi Chen
YangyiYY
AI & ML interests
Multimodal, Large Language Models
Recent Activity
upvoted
a
paper
about 8 hours ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
liked
a model
21 days ago
nvidia/Nemotron-Cascade-8B-Intermediate-ckpts
authored
a paper
21 days ago
CRAFT: Customizing LLMs by Creating and Retrieving from Specialized
Toolsets
Organizations
None yet