ToolACE-MCP: Generalizing History-Aware Routing from MCP Tools to the Agent Web Paper • 2601.08276 • Published 6 days ago • 5
RealMem: Benchmarking LLMs in Real-World Memory-Driven Interaction Paper • 2601.06966 • Published 8 days ago • 7
Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning Paper • 2601.06943 • Published 8 days ago • 202