The MERIT Dataset: Modelling and Efficiently Rendering Interpretable Transcripts Paper • 2409.00447 • Published Aug 31, 2024 • 3
Visual Autoregressive Models Beat Diffusion Models on Inference Time Scaling Paper • 2510.16751 • Published Oct 19 • 20
Efficient Test-Time Scaling for Small Vision-Language Models Paper • 2510.03574 • Published Oct 3 • 9
AutoQ-VIS: Improving Unsupervised Video Instance Segmentation via Automatic Quality Assessment Paper • 2508.19808 • Published Aug 27 • 8