yangli's picture

2

yangli

limingme

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

updated a dataset 2 months ago

limingme/ahat_task_50k_0701_v1

published a dataset 2 months ago

limingme/ahat_task_50k_0701_v1

View all activity

Organizations

None yet

upvoted a paper 9 days ago

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Paper • 2508.20751 • Published 10 days ago • 85

upvoted a paper 6 months ago

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7 • 124