Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning Paper • 2601.06943 • Published 6 days ago • 201
GVE Collection Towards General Video Embeddings: Models and Benchmarks • 4 items • Updated Nov 3, 2025 • 20
VLA^2: Empowering Vision-Language-Action Models with an Agentic Framework for Unseen Concept Manipulation Paper • 2510.14902 • Published Oct 16, 2025 • 15