weize's picture

7 13 9

weize

weizechen

·

AI & ML interests

None yet

Recent Activity

liked a model about 22 hours ago

openbmb/MiniCPM-SALA

upvoted a paper 3 days ago

SE-Bench: Benchmarking Self-Evolution with Knowledge Internalization

liked a model 9 days ago

openbmb/MiniCPM-o-4_5

View all activity

Organizations

upvoted a paper 3 days ago

SE-Bench: Benchmarking Self-Evolution with Knowledge Internalization

Paper • 2602.04811 • Published 8 days ago • 2

upvoted a paper 3 months ago

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17, 2025 • 134

upvoted 5 papers 5 months ago

From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones

Paper • 2509.25123 • Published Sep 29, 2025 • 22

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Paper • 2509.09674 • Published Sep 11, 2025 • 80

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 190

HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?

Paper • 2509.07894 • Published Sep 9, 2025 • 31

Towards a Unified View of Large Language Model Post-Training

Paper • 2509.04419 • Published Sep 4, 2025 • 76

upvoted a paper 8 months ago

DenseDPO: Fine-Grained Temporal Preference Optimization for Video Diffusion Models

Paper • 2506.03517 • Published Jun 4, 2025 • 13

upvoted 3 papers about 1 year ago

Teaching Language Models to Critique via Reinforcement Learning

Paper • 2502.03492 • Published Feb 5, 2025 • 24

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3, 2025 • 61

ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer

Paper • 2412.07720 • Published Dec 10, 2024 • 31

upvoted 2 papers over 1 year ago

Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System

Paper • 2410.08115 • Published Oct 10, 2024 • 8

Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence

Paper • 2407.07061 • Published Jul 9, 2024 • 28