Zijie Chen's picture

2 14 4

Zijie Chen

Zijie-chen

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

upvoted a paper 4 days ago

Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability

upvoted a paper 5 days ago

Improving Data and Reward Design for Scientific Reasoning in Large Language Models

View all activity

Organizations

None yet

upvoted 2 papers 4 days ago

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Paper • 2602.09443 • Published 5 days ago • 57

Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability

Paper • 2602.02477 • Published 13 days ago • 10

upvoted a paper 5 days ago

Improving Data and Reward Design for Scientific Reasoning in Large Language Models

Paper • 2602.08321 • Published 6 days ago • 39

upvoted 2 papers 6 days ago

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11, 2024 • 94

MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration

Paper • 2602.01734 • Published 13 days ago • 32

upvoted a paper 11 days ago

A2Eval: Agentic and Automated Evaluation for Embodied Brain

Paper • 2602.01640 • Published 13 days ago • 8

upvoted 2 papers 3 months ago

Every Token Counts: Generalizing 16M Ultra-Long Context in Large Language Models

Paper • 2511.23319 • Published Nov 28, 2025 • 24

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17, 2025 • 134

upvoted a paper 4 months ago

Knocking-Heads Attention

Paper • 2510.23052 • Published Oct 27, 2025 • 30

upvoted a paper 9 months ago

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21, 2025 • 88

upvoted a paper 11 months ago

Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts

Paper • 2503.16057 • Published Mar 20, 2025 • 14

upvoted 2 papers over 1 year ago

WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models

Paper • 2401.13919 • Published Jan 25, 2024 • 32

OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization

Paper • 2410.19609 • Published Oct 25, 2024 • 18

upvoted a paper almost 2 years ago

Iterative Reasoning Preference Optimization

Paper • 2404.19733 • Published Apr 30, 2024 • 49