10 117 21

Jiaheng Liu

CheeryLJH

AI & ML interests

None yet

Recent Activity

authored a paper 1 day ago

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

upvoted a paper 1 day ago

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

upvoted a paper 10 days ago

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

View all activity

Organizations

upvoted a paper 1 day ago

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

Paper • 2509.04292 • Published 2 days ago • 44

upvoted a paper 10 days ago

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Paper • 2508.17445 • Published 13 days ago • 77

upvoted a paper 12 days ago

AetherCode: Evaluating LLMs' Ability to Win In Premier Programming Competitions

Paper • 2508.16402 • Published 15 days ago • 14

upvoted a paper 17 days ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published about 1 month ago • 123

upvoted a paper 19 days ago

DINOv3

Paper • 2508.10104 • Published 24 days ago • 237

upvoted a paper 25 days ago

Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning

Paper • 2508.08221 • Published 26 days ago • 44

upvoted a paper 26 days ago

WideSearch: Benchmarking Agentic Broad Info-Seeking

Paper • 2508.07999 • Published 26 days ago • 105

upvoted 2 papers about 1 month ago

VeriGUI: Verifiable Long-Chain GUI Dataset

Paper • 2508.04026 • Published Aug 6 • 156

Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference

Paper • 2508.02193 • Published Aug 4 • 129

upvoted 5 papers about 2 months ago

Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology

Paper • 2507.07999 • Published Jul 10 • 47

upvoted 3 papers 2 months ago

ArtifactsBench: Bridging the Visual-Interactive Gap in LLM Code Generation Evaluation

Paper • 2507.04952 • Published Jul 7 • 9

MMSearch-R1: Incentivizing LMMs to Search

Paper • 2506.20670 • Published Jun 25 • 63

OAgents: An Empirical Study of Building Effective Agents

Paper • 2506.15741 • Published Jun 17 • 35

upvoted 3 papers 3 months ago

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Paper • 2505.24760 • Published May 30 • 70

Scaling Test-time Compute for LLM Agents

Paper • 2506.12928 • Published Jun 15 • 62

TaskCraft: Automated Generation of Agentic Tasks

Paper • 2506.10055 • Published Jun 11 • 32

Jiaheng Liu

AI & ML interests

Recent Activity

Organizations

CheeryLJH's activity