-
Agent Laboratory: Using LLM Agents as Research Assistants
Paper • 2501.04227 • Published • 85 -
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Paper • 2501.05366 • Published • 95 -
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training
Paper • 2501.11425 • Published • 91 -
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Paper • 2501.10893 • Published • 24
Shyam Sunder Kumar
theainerd
AI & ML interests
Natural Language Processing
Recent Activity
liked
a dataset
about 18 hours ago
facebook/natural_reasoning
upvoted
a
paper
1 day ago
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Organizations
Collections
4
-
Training Large Language Models to Reason in a Continuous Latent Space
Paper • 2412.06769 • Published • 78 -
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 55 -
Evolving Deeper LLM Thinking
Paper • 2501.09891 • Published • 106 -
Kimi k1.5: Scaling Reinforcement Learning with LLMs
Paper • 2501.12599 • Published • 97
models
2
datasets
None public yet