4 291 121

Zhenran Xu

imryanxu

AI & ML interests

fishing in lab while working on language agents

Recent Activity

upvoted a paper 1 day ago

Why Language Models Hallucinate

upvoted a paper 7 days ago

rStar2-Agent: Agentic Reasoning Technical Report

upvoted a paper 7 days ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

View all activity

Organizations

upvoted a paper 1 day ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published 5 days ago • 117

upvoted 4 papers 7 days ago

rStar2-Agent: Agentic Reasoning Technical Report

Paper • 2508.20722 • Published 12 days ago • 101

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published 7 days ago • 175

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published 7 days ago • 79

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published 7 days ago • 112

upvoted a paper 2 months ago

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 234

upvoted 4 papers 3 months ago

AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation

Paper • 2506.10540 • Published Jun 12 • 38

ComfyUI-R1: Exploring Reasoning Models for Workflow Generation

Paper • 2506.09790 • Published Jun 11 • 54

ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development

Paper • 2506.05010 • Published Jun 5 • 78

VerIPO: Cultivating Long Reasoning in Video-LLMs via Verifier-Gudied Iterative Policy Optimization

Paper • 2505.19000 • Published May 25 • 43

upvoted 10 papers 4 months ago

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published May 15 • 121

System Prompt Optimization with Meta-Learning

Paper • 2505.09666 • Published May 14 • 72

J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning

Paper • 2505.10320 • Published May 15 • 23

The CoT Encyclopedia: Analyzing, Predicting, and Controlling how a Reasoning Model will Think

Paper • 2505.10185 • Published May 15 • 26

OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning

Paper • 2505.08617 • Published May 13 • 42

WorldPM: Scaling Human Preference Modeling

Paper • 2505.10527 • Published May 15 • 34

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Paper • 2505.04921 • Published May 8 • 186

Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent Systems

Paper • 2505.00212 • Published Apr 30 • 9

SWE-smith: Scaling Data for Software Engineering Agents

Paper • 2504.21798 • Published Apr 30 • 10

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 203

Zhenran Xu

AI & ML interests

Recent Activity

Organizations

imryanxu's activity