Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yaoyao Qian's picture
2 6 1

Yaoyao Qian

FreaxRuby
andyoung's profile picture
·
https://h-freax.github.io/
  • RubyFreax
  • h-freax
  • rubyfreax

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 2 months ago

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Paper • 2506.24119 • Published Jun 30 • 49
upvoted a paper 3 months ago

WHEN TO ACT, WHEN TO WAIT: Modeling Structural Trajectories for Intent Triggerability in Task-Oriented Dialogue

Paper • 2506.01881 • Published Jun 2 • 6
upvoted a paper 5 months ago

TextArena

Paper • 2504.11442 • Published Apr 15 • 28
upvoted an article 7 months ago
view article
Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

By NormalUhr •
Feb 7
• 211
upvoted 2 papers about 1 year ago

ThinkGrasp: A Vision-Language System for Strategic Part Grasping in Clutter

Paper • 2407.11298 • Published Jul 16, 2024 • 5

Imagination Policy: Using Generative Point Cloud Models for Learning Manipulation Policies

Paper • 2406.11740 • Published Jun 17, 2024 • 1
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略