3 20 4

minghao

Liam-Liu

AI & ML interests

LLM, AD

Recent Activity

upvoted a paper 3 days ago

O-DisCo-Edit: Object Distortion Control for Unified Realistic Video Editing

upvoted a paper 3 days ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

upvoted a paper 3 days ago

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

View all activity

Organizations

upvoted 5 papers 3 days ago

O-DisCo-Edit: Object Distortion Control for Unified Realistic Video Editing

Paper • 2509.01596 • Published 5 days ago • 1

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published 6 days ago • 59

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published 4 days ago • 101

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published 4 days ago • 76

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published 4 days ago • 144

upvoted a paper 6 days ago

A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code

Paper • 2508.18106 • Published 12 days ago • 192

upvoted a paper 18 days ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6 • 123

upvoted a paper 25 days ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published 29 days ago • 173

upvoted 2 papers about 1 month ago

Efficient Agents: Building Effective Agents While Reducing Cost

Paper • 2508.02694 • Published Jul 24 • 85

VeriGUI: Verifiable Long-Chain GUI Dataset

Paper • 2508.04026 • Published Aug 6 • 156

upvoted 2 papers about 2 months ago

A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8 • 90

CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization

Paper • 2507.06181 • Published Jul 8 • 42

upvoted a paper 3 months ago

MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix

Paper • 2505.13032 • Published May 19 • 2

upvoted 2 papers 4 months ago

DMind Benchmark: The First Comprehensive Benchmark for LLM Evaluation in the Web3 Domain

Paper • 2504.16116 • Published Apr 18 • 12

FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models

Paper • 2505.02735 • Published May 5 • 32

upvoted a paper 5 months ago

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published Apr 7 • 45

upvoted a paper 6 months ago

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published Mar 11 • 70

upvoted a paper 7 months ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 106

upvoted a paper 10 months ago

AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions

Paper • 2410.20424 • Published Oct 27, 2024 • 41

upvoted a paper over 1 year ago

MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series

Paper • 2405.19327 • Published May 29, 2024 • 49

minghao

AI & ML interests

Recent Activity

Organizations

Liam-Liu's activity