HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation Paper • 2501.14729 • Published Jan 24 • 3
NAUTILUS: A Large Multimodal Model for Underwater Scene Understanding Paper • 2510.27481 • Published Oct 31 • 1
Cook and Clean Together: Teaching Embodied Agents for Parallel Task Execution Paper • 2511.19430 • Published 13 days ago • 7