SemanticGen: Video Generation in Semantic Space Paper โข 2512.20619 โข Published 12 days ago โข 88
VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning? Paper โข 2505.23359 โข Published May 29, 2025 โข 38
Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence Paper โข 2505.23747 โข Published May 29, 2025 โข 68
DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision Paper โข 2312.16256 โข Published Dec 26, 2023 โข 18
Kimi k1.5: Scaling Reinforcement Learning with LLMs Paper โข 2501.12599 โข Published Jan 22, 2025 โข 126
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper โข 2501.12948 โข Published Jan 22, 2025 โข 433
Structured 3D Latents for Scalable and Versatile 3D Generation Paper โข 2412.01506 โข Published Dec 2, 2024 โข 84
SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE Paper โข 2411.16856 โข Published Nov 25, 2024 โข 13
LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models Paper โข 2411.09595 โข Published Nov 14, 2024 โข 77