OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer Paper • 2601.14250 • Published 3 days ago • 36
nvidia/nemotron-speech-streaming-en-0.6b Automatic Speech Recognition • Updated 17 days ago • 8.12k • 423
Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation Paper • 2512.24271 • Published 24 days ago • 62
view article Article How to make NeuTTS-air generate over 200 seconds of audio in a single second. Nov 21, 2025 • 22
Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans? Paper • 2512.13281 • Published Dec 15, 2025 • 64