DreamFoley: Scalable VLMs for High-Fidelity Video-to-Audio Generation Paper • 2512.06022 • Published Dec 4, 2025 • 3
VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models Paper • 2504.15279 • Published Apr 21, 2025 • 78