Yaoyao Qian's picture

2 6 1

Yaoyao Qian

FreaxRuby

·

https://h-freax.github.io/

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 2 months ago

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Paper • 2506.24119 • Published Jun 30 • 49

upvoted a paper 3 months ago

WHEN TO ACT, WHEN TO WAIT: Modeling Structural Trajectories for Intent Triggerability in Task-Oriented Dialogue

Paper • 2506.01881 • Published Jun 2 • 6

upvoted a paper 5 months ago

TextArena

Paper • 2504.11442 • Published Apr 15 • 28

upvoted an article 7 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

By

•

Feb 7

• 211

upvoted 2 papers about 1 year ago

ThinkGrasp: A Vision-Language System for Strategic Part Grasping in Clutter

Paper • 2407.11298 • Published Jul 16, 2024 • 5

Imagination Policy: Using Generative Point Cloud Models for Learning Manipulation Policies

Paper • 2406.11740 • Published Jun 17, 2024 • 1