Tao's picture

7 1

Tao

Leitian

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Jointly Reinforcing Diversity and Quality in Language Model Generations

upvoted a paper 10 days ago

CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics

upvoted a paper 24 days ago

Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning

View all activity

Organizations

None yet

upvoted a paper 3 days ago

Jointly Reinforcing Diversity and Quality in Language Model Generations

Paper • 2509.02534 • Published 4 days ago • 22

upvoted a paper 10 days ago

CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics

Paper • 2508.18124 • Published 12 days ago • 46

upvoted 3 papers 24 days ago

Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning

Paper • 2508.08221 • Published 26 days ago • 44

CoT-Self-Instruct: Building high-quality synthetic prompts for reasoning and non-reasoning tasks

Paper • 2507.23751 • Published Jul 31 • 4

A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems

Paper • 2508.07407 • Published 27 days ago • 90

upvoted a paper about 2 months ago

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Paper • 2507.10532 • Published Jul 14 • 88

upvoted a paper 4 months ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 184