Mingzhe Li's picture

4 2

Mingzhe Li

Mubuky

·

https://www.mubuky.com

Mubuky

AI & ML interests

RL & Agent

Recent Activity

upvoted a paper 26 days ago

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

upvoted a paper 26 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

upvoted a paper 27 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

View all activity

Organizations

upvoted 2 papers 26 days ago

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

Paper • 2511.22570 • Published Nov 27 • 83

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 27 days ago • 237

upvoted a paper 27 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 28 days ago • 93

liked a dataset about 2 months ago

OpenMOSS-Team/VideoThinkBench

Viewer • Updated 9 days ago • 4.9k • 1.09k • 12

authored a paper about 2 months ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6 • 210

upvoted a paper about 2 months ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6 • 210

updated a dataset 2 months ago

OpenMOSS-Team/VideoThinkBench

Viewer • Updated 9 days ago • 4.9k • 1.09k • 12

liked a model 3 months ago

Qwen/WorldPM-72B

Text Classification • 73B • Updated May 17 • 101 • 80