Swap faces in images using a source image
Swap faces in videos
Generate videos from text prompts
Generate images from a face with a prompt
Generate product shots using text or reference images
Fill and modify images using a mask and prompt
Erase objects from images using masks
Generate high-resolution images from text prompts
FitDiT is a high-fidelity virtual try-on model.
High-fidelity Virtual Try-on