1 7 2

ChengpengLi

AI & ML interests

LLM for Reasoning, reinforcement learning, recommendation system, diffusion models

Recent Activity

published a model 3 days ago

ChengpengLi/START

upvoted a paper about 1 month ago

Enabling Scalable Oversight via Self-Evolving Critic

upvoted a paper about 1 month ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

View all activity

Organizations

None yet

ChengpengLi's activity

published a model 3 days ago

ChengpengLi/START

Updated 3 days ago

upvoted 2 papers about 1 month ago

Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published Jan 10 • 70

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 91

upvoted a paper 2 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 346

liked a Space 4 months ago

202

Qwen2.5 Math Demo

🧮

Describe math images and answer questions

upvoted 2 collections 5 months ago

Qwen2.5-Math

Collection

Math-specific model series based on Qwen2.5 • 11 items • Updated Jan 14 • 75

Qwen2-Math

Collection

Math-specific model series based on Qwen2 • 8 items • Updated Nov 28, 2024 • 50

liked a model 7 months ago

Qwen/Qwen2-Math-72B

Text Generation • Updated Aug 8, 2024 • 57 • 28

authored a paper 7 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 161

authored a paper 8 months ago

DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning

Paper • 2407.04078 • Published Jul 4, 2024 • 18

upvoted 2 papers 8 months ago

Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models

Paper • 2406.13542 • Published Jun 19, 2024 • 16

DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning

Paper • 2407.04078 • Published Jul 4, 2024 • 18

authored a paper 8 months ago

Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models

Paper • 2406.13542 • Published Jun 19, 2024 • 16