Scaling and Beyond: Advancing Spatial Reasoning in MLLMs Requires New Recipes Paper • 2504.15037 • Published Apr 21, 2025
Optimization-Guided Diffusion for Interactive Scene Generation Paper • 2512.07661 • Published Dec 8, 2025 • 2
Latent Sketchpad: Sketching Visual Thoughts to Elicit Multimodal Reasoning in MLLMs Paper • 2510.24514 • Published Oct 28, 2025 • 22
SimScale: Learning to Drive via Real-World Simulation at Scale Paper • 2511.23369 • Published Nov 28, 2025 • 38
SimScale: Learning to Drive via Real-World Simulation at Scale Paper • 2511.23369 • Published Nov 28, 2025 • 38
RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark Paper • 2509.24897 • Published Sep 29, 2025 • 46
Reinforced Refinement with Self-Aware Expansion for End-to-End Autonomous Driving Paper • 2506.09800 • Published Jun 11, 2025 • 1
R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning Paper • 2505.02835 • Published May 5, 2025 • 28
view article Article LeRobot goes to driving school: World’s largest open-source self-driving dataset Mar 11, 2025 • 105
Mavors: Multi-granularity Video Representation for Multimodal Large Language Model Paper • 2504.10068 • Published Apr 14, 2025 • 30
MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models Paper • 2504.03641 • Published Apr 4, 2025 • 14