Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer
Paper
•
2511.22699
•
Published
•
145
ARC mainly focuses on areas of computer vision, speech, and natural language processing, including speech/video generation, enhancement, retrieval, understanding, AutoML, etc. Considering research developments and industry trends, ARC consistently pursues exploration, innovation, and breakthroughs in technologies.
ARC-Chapter: Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries
Rolling Forcing: Autoregressive Long Video Diffusion in Real Time