yang
dearaj23
·
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
14 days ago
Memory in the Age of AI Agents
liked
a dataset
about 1 month ago
openai/gsm8k
upvoted
a
paper
about 2 months ago
SRMT: Shared Memory for Multi-agent Lifelong Pathfinding
Organizations
None yet
deep research
-
Scaling Agents via Continual Pre-training
Paper • 2509.13310 • Published • 117 -
WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning
Paper • 2509.13305 • Published • 91 -
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use
Paper • 2510.05592 • Published • 106 -
Explore to Evolve: Scaling Evolved Aggregation Logic via Proactive Online Exploration for Deep Research Agents
Paper • 2510.14438 • Published • 13
LLM
survey
RL
multi-agent
-
MALT: Improving Reasoning with Multi-Agent LLM Training
Paper • 2412.01928 • Published • 45 -
Multi-Agent System for Comprehensive Soccer Understanding
Paper • 2505.03735 • Published • 25 -
DITING: A Multi-Agent Evaluation Framework for Benchmarking Web Novel Translation
Paper • 2510.09116 • Published • 96
CoT
memory
RL
deep research
-
Scaling Agents via Continual Pre-training
Paper • 2509.13310 • Published • 117 -
WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning
Paper • 2509.13305 • Published • 91 -
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use
Paper • 2510.05592 • Published • 106 -
Explore to Evolve: Scaling Evolved Aggregation Logic via Proactive Online Exploration for Deep Research Agents
Paper • 2510.14438 • Published • 13
multi-agent
-
MALT: Improving Reasoning with Multi-Agent LLM Training
Paper • 2412.01928 • Published • 45 -
Multi-Agent System for Comprehensive Soccer Understanding
Paper • 2505.03735 • Published • 25 -
DITING: A Multi-Agent Evaluation Framework for Benchmarking Web Novel Translation
Paper • 2510.09116 • Published • 96
LLM
CoT
survey