seojinlee

sjlee311

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks

upvoted a paper 4 days ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

upvoted a paper 4 days ago

How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on τ-bench

View all activity

Organizations

None yet

upvoted a paper 2 days ago

DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks

Paper • 2509.01396 • Published 6 days ago • 45

upvoted 2 papers 4 days ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published 5 days ago • 155

How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on τ-bench

Paper • 2508.20931 • Published 10 days ago • 15

upvoted a paper 13 days ago

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published 16 days ago • 132

upvoted a paper 16 days ago

Deep Think with Confidence

Paper • 2508.15260 • Published 17 days ago • 81

liked a model 17 days ago

MoritzLaurer/DeBERTa-v3-large-mnli-fever-anli-ling-wanli

Zero-Shot Classification • 0.4B • Updated Apr 11, 2024 • 136k • • 103

upvoted a paper 17 days ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6 • 123

liked a model 20 days ago

google/gemma-3-270m

Text Generation • 0.3B • Updated 24 days ago • 144k • 747

liked a model 24 days ago

simplescaling/s1.1-7B

Text Generation • 8B • Updated Mar 9 • 3.86k • • 6

upvoted a collection 25 days ago

Qwen3

Collection

84 items • Updated Aug 6 • 1.2k

liked a model 25 days ago

Qwen/Qwen3-4B-Thinking-2507

Text Generation • 4B • Updated Aug 6 • 212k • • 349

liked a dataset 26 days ago

ReasoningTrap/AIME

Viewer • Updated May 27 • 34 • 125 • 1

upvoted 3 papers 26 days ago

upvoted 4 papers about 1 month ago

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published Aug 7 • 123

Efficient Agents: Building Effective Agents While Reducing Cost

Paper • 2508.02694 • Published Jul 24 • 85

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published Aug 2 • 234

Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference

Paper • 2508.02193 • Published Aug 4 • 129

liked a model about 1 month ago

openai/gpt-oss-20b

Text Generation • 22B • Updated 12 days ago • 9.1M • • 3.43k

seojinlee

AI & ML interests

Recent Activity

Organizations

sjlee311's activity