CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation Paper β’ 2601.10061 β’ Published 3 days ago β’ 26