MCP - a jbejar86 Collection

jbejar86 's Collections

MCP

MCP

updated 2 days ago

LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries

Paper • 2508.15760 • Published 16 days ago • 44
LiveMCPBench: Can Agents Navigate an Ocean of MCP Tools?

Paper • 2508.01780 • Published Aug 3 • 18
API-Bank: A Comprehensive Benchmark for Tool-Augmented LLMs

Paper • 2304.08244 • Published Apr 14, 2023 • 1
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published 15 days ago • 131
Memp: Exploring Agent Procedural Memory

Paper • 2508.06433 • Published 29 days ago • 33
MCPEval: Automatic MCP-based Deep Evaluation for AI Agent Models

Paper • 2507.12806 • Published Jul 17 • 19
Survey on Evaluation of LLM-based Agents

Paper • 2503.16416 • Published Mar 20 • 96
AgentBench: Evaluating LLMs as Agents

Paper • 2308.03688 • Published Aug 7, 2023 • 25
PlanGenLLMs: A Modern Survey of LLM Planning Capabilities

Paper • 2502.11221 • Published Feb 16 • 1
AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes

Paper • 2506.14728 • Published Jun 17
Supporting Our AI Overlords: Redesigning Data Systems to be Agent-First

Paper • 2509.00997 • Published 6 days ago • 2