Yangyi Chen's picture

11 12

Yangyi Chen

YangyiYY

·

https://yangyi-chen.github.io/

AI & ML interests

Multimodal, Large Language Models

Recent Activity

upvoted a paper about 8 hours ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

liked a model 21 days ago

nvidia/Nemotron-Cascade-8B-Intermediate-ckpts

authored a paper 21 days ago

CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets

View all activity

Organizations

None yet

Papers 11

arxiv:2512.13607

arxiv:2507.06448

arxiv:2505.08971

arxiv:2407.06438

models 13

YangyiYY/Qwen2.5_sft_tabmwp_textreason_RL

3B • Updated Mar 13, 2025 • 3

YangyiYY/Qwen2.5-VL_sft_tabmwp_textreason_RL

Updated Mar 11, 2025

YangyiYY/Qwen2.5-1.5B-Open-R1-GRPO

Updated Mar 11, 2025

YangyiYY/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Updated Mar 10, 2025

YangyiYY/Qwen2.5-1.5B-Open-R1-Distill

Updated Mar 10, 2025

YangyiYY/model_output_sft_llama_preferred_mixed

Text Generation • 8B • Updated Aug 12, 2024 • 2

YangyiYY/model_output_sft_llama_rejected

Text Generation • 8B • Updated Aug 12, 2024 • 6

YangyiYY/model_output_sft_llama_preferred

Text Generation • 8B • Updated Aug 10, 2024 • 1

YangyiYY/model_output_dpo_llama_data

Text Generation • 8B • Updated Aug 10, 2024 • 4

YangyiYY/model_output_dpo2

Text Generation • 8B • Updated Aug 8, 2024 • 3

datasets 2

YangyiYY/VLM-SFT

Viewer • Updated Dec 3, 2024 • 1.13M • 44 • 2

YangyiYY/LVLM_NLF

Preview • Updated Nov 17, 2023 • 164 • 12