SophiaWang (Zixia Wang)

upvoted a paper 3 months ago

Droplet3D: Commonsense Priors from Videos Facilitate 3D Generation

Paper • 2508.20470 • Published Aug 28 • 75

upvoted a paper 8 months ago

DropletVideo: A Dataset and Approach to Explore Integral Spatio-Temporal Consistent Video Generation

Paper • 2503.06053 • Published Mar 8 • 138

upvoted 15 papers 9 months ago

SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints

Paper • 2412.07760 • Published Dec 10, 2024 • 55

Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step

Paper • 2501.13926 • Published Jan 23 • 43

VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM

Paper • 2501.00599 • Published Dec 31, 2024 • 47

TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space

Paper • 2501.12224 • Published Jan 21 • 48

VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control

Paper • 2501.01427 • Published Jan 2 • 54

OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints

Paper • 2501.03841 • Published Jan 7 • 56

STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

Paper • 2501.02976 • Published Jan 6 • 55

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published Jan 9 • 95

CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation

Paper • 2502.08639 • Published Feb 12 • 43

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model

Paper • 2502.10248 • Published Feb 14 • 55

Goku: Flow Based Video Generative Foundation Models

Paper • 2502.04896 • Published Feb 7 • 106

FlexWorld: Progressively Expanding 3D Scenes for Flexiable-View Synthesis

Paper • 2503.13265 • Published Mar 17 • 15

Zixia Wang

AI & ML interests

Organizations

Droplet3D: Commonsense Priors from Videos Facilitate 3D Generation

DropletVideo: A Dataset and Approach to Explore Integral Spatio-Temporal Consistent Video Generation

SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints

Parallelized Autoregressive Visual Generation

GenEx: Generating an Explorable World

1.58-bit FLUX

Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step

VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM

TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space

VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control

OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints

STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

The GAN is dead; long live the GAN! A Modern GAN Baseline

CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model

Goku: Flow Based Video Generative Foundation Models

FlexWorld: Progressively Expanding 3D Scenes for Flexiable-View Synthesis

Zixia Wang

AI & ML interests

Organizations

SophiaWang's activity