Xiaohui Shen's picture

2

Xiaohui Shen

XiaohuiShen

AI & ML interests

None yet

Organizations

None yet

authored a paper 8 months ago

Vidi: Large Multimodal Models for Video Understanding and Editing

Paper • 2504.15681 • Published Apr 22 • 14

authored 2 papers 11 months ago

Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens

Paper • 2501.07730 • Published Jan 13 • 18

1.58-bit FLUX

Paper • 2412.18653 • Published Dec 24, 2024 • 85

authored a paper about 1 year ago

Randomized Autoregressive Visual Generation

Paper • 2411.00776 • Published Nov 1, 2024 • 18

authored 3 papers over 1 year ago

Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models

Paper • 2406.09416 • Published Jun 13, 2024 • 29

An Image is Worth 32 Tokens for Reconstruction and Generation

Paper • 2406.07550 • Published Jun 11, 2024 • 60

COCONut: Modernizing COCO Segmentation

Paper • 2404.08639 • Published Apr 12, 2024 • 30