seojinlee

sjlee311

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks

upvoted a paper 4 days ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

upvoted a paper 4 days ago

How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on τ-bench

View all activity

Organizations

None yet

upvoted a paper 2 days ago

DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks

Paper • 2509.01396 • Published 6 days ago • 47

upvoted 2 papers 4 days ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published 5 days ago • 155

How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on τ-bench

Paper • 2508.20931 • Published 10 days ago • 15

upvoted a paper 13 days ago

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published 16 days ago • 132

upvoted a paper 16 days ago

Deep Think with Confidence

Paper • 2508.15260 • Published 17 days ago • 81

upvoted a paper 17 days ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6 • 123

upvoted a collection 25 days ago

Qwen3

Collection

84 items • Updated Aug 6 • 1.2k

upvoted 3 papers 26 days ago

upvoted 7 papers about 1 month ago

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published Aug 7 • 123

Efficient Agents: Building Effective Agents While Reducing Cost

Paper • 2508.02694 • Published Jul 24 • 85

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published Aug 2 • 234

Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference

Paper • 2508.02193 • Published Aug 4 • 129

Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training

Paper • 2508.00414 • Published Aug 1 • 89

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

Paper • 2507.23726 • Published Jul 31 • 112

MUR: Momentum Uncertainty guided Reasoning for Large Language Models

Paper • 2507.14958 • Published Jul 20 • 46

upvoted 3 papers about 2 months ago

Can One Domain Help Others? A Data-Centric Study on Multi-Domain Reasoning via Reinforcement Learning

Paper • 2507.17512 • Published Jul 23 • 36

Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning

Paper • 2507.16784 • Published Jul 22 • 119

EXAONE 4.0: Unified Large Language Models Integrating Non-reasoning and Reasoning Modes

Paper • 2507.11407 • Published Jul 15 • 57

seojinlee

AI & ML interests

Recent Activity

Organizations

sjlee311's activity